DATA SCIENCE • FAULT TOLERANCE • ML

Data Scientist. Fault Tolerance

Research-driven Data Scientist & ML Engineer with a PhD focus on fault tolerance in high-performance computing. I apply data science, machine learning, and large language models (LLMs) to solve complex problems, from building predictive models and analytics pipelines to designing dependable systems. My work bridges practical data-driven insights with research-level rigor—making complex pipelines feel simple and impactful.

Ahmer Jamil — portrait
|

Education

PhDDependable Systems

The University of Florida — PhD in Computer Science

Aug 2025 – Present • Advisor: Prof. Guanpeng Li • Focus: dependable HPC, fault tolerance, GNN, DBN, SDC modeling, LLVM, Fault Injection.

Current
PhDDependable Systems

The University of Iowa — PhD in Computer Science

Aug 2024 – 2025 • Advisor: Prof. Guanpeng Li • Focus: dependable HPC, fault tolerance, LLVM-based fault injection & SDC modeling.

Transferred CGPA 4.0
BScDistinction

LUMS — BSc Computer Science

2019 – 2023 • CGPA 3.78 • Dean’s Honor List (2019–2022). Core: ML, DL, AI, Speech, Data Mining, HCI, Security, SE, Automata, NCC.

Honors CGPA 3.78

Skills & Abilities

Machine Learning & Data Science

PyTorch85%
TensorFlow80%
Scikit-learn75%
Pandas / NumPy85%

Development

JavaScript (ES6+)80%
React / React Native85%
HTML5 / CSS375%

Programming Languages

Python90%
C++80%
SQL75%

Systems & Tools

LLVM75%
Linux80%
Git / Docker75%

Projects

Project image
SnowflakeDataikuSMOTE

Fraud Detection

Developed scalable credit card fraud detection models using ensemble learning, SMOTE, and cloud-based tools.

  • Focused on refining credit card transaction security through techniques such as ensemble learning, data preprocessing, and the application of SMOTE to address class imbalance challenges.
  • Executed comprehensive literature reviews, conducted exploratory data analysis, and applied and compared classification and clustering techniques for fraud detection.
  • Integrated Snowflake and Dataiku tools to harness scalable computational power for machine learning model development and assessment.
  • Addressed practical challenges in credit card fraud detection, including class imbalance and memory constraints, showcasing adaptability and solution-oriented thinking.
CLARIBA LUMS
Project image
LLMRAG

LLM-Powered Chatbot

Built a knowledge-based chatbot with RAG, ChromaDB, and GPT-3.5, achieving ~85% accuracy on domain-specific queries.

  • Developed a knowledge-based chatbot leveraging Retrieval-Augmented Generation (RAG) and ChromaDB vector database.
  • Integrated OpenAI GPT-3.5 to enhance conversational fluency and contextual understanding.
  • Implemented a clean HTML/CSS interface with backend support in Python for seamless interaction.
  • Delivered ~85% accuracy on domain-specific queries, supporting knowledge retrieval use cases.
Systems Limited
Project image
MLRetention

Churn Prediction

Predictive churn modeling with engineered RFM/tenure features and uplift analysis.

  • Developed predictive models using RFM features, tenure, and support ticket history with logistic regression and gradient boosting.
  • Built end-to-end pipelines including feature engineering, model training, and uplift analysis in PySpark and Python.
  • Monitored model drift in production-like settings, ensuring long-term reliability.
  • Identified at-risk customers with ~80% precision, providing actionable insights for retention strategies.
Systems Limited
Project image
MLSHAP

Customer Segmentation

Supervised pipelines with SHAP explainability to power targeted campaigns.

  • Engineered supervised learning pipelines in Python using scikit-learn and XGBoost to generate customer clusters.
  • Incorporated behavioral and demographic features, applying probability calibration and ensemble trees.
  • Leveraged SHAP values to explain model outputs, aligning insights with marketing strategies.
  • Increased simulated campaign response rates by ~12% through targeted customer segmentation.
Systems Limited
Project image
SeleniumFlaskGeoJSON

Interactive Geospatial Constituency Plotting

Built an interactive geospatial web app by automating data collection to GeoJSON and visualizing it with Flask + Leaflet.

  • Automated web scraping with Selenium to collect comprehensive National and Provincial Constituency data in Pakistan.
  • Standardized and formatted acquired datasets to prepare them for advanced analysis and visualization.
  • Engineered an interactive webpage using Python, Flask, and HTML to display constituency- level information.
  • Integrated GeoJSON outputs into the platform, enabling seamless exploration of geospatial data.
  • Enhanced graphical representation of data using online plotting tools for improved accessibility.
  • Implemented user-friendly features such as zoom, pan, and constituency-level detail retrieval to maximize usability.
VentureDive
Project image
ONNXWebGL

In-Browser MNIST Classifier

Implemented a MNIST digit classifier using ONNX Runtime Web with WebGL acceleration and WASM fallback.

Try Live
  • Digit prediction fully client-side via ONNX Runtime Web.
  • WebGL acceleration with graceful WASM fallback.
  • Path-robust model loading for GitHub Pages.
Personal

Experience

View full list
UF

Graduate Research Assistant — University of Florida

Aug 2025 – Present · Part-time

ML × HPC · fault tolerance · LLVM/LLFI

Iowa

Research Assistant — Dependable Systems Lab, University of Iowa

Jan 2025 – Aug 2025

Researched fault tolerance at the intersection of compiler analysis and ML, developing LLVM-based tools to study silent data corruption

UIowa

Teaching Assistant — Artificial Intelligence, University of Iowa

Aug 2024 – Dec 2024

Supported delivery of the Artificial Intelligence course, mentoring students and enabling stronger outcomes through hands-on guidance.

DebloatBench

Research Assistant — DebloatBench (SRI • UArizona • LUMS)

Aug 2023 – Aug 2024

Contributed to DebloatBench, a unified framework for evaluating container debloaters across diverse design and execution environments, in collaboration with SRI and University of Arizona.

Systems

Associate Consultant — AI, Systems Limited

Jun 2023 – Jul 2024

Delivered ML & LLM solutions for global clients while leading training and enablement efforts on modern NLP/LLM practices.

LUMS

Fraud Detection — Clariba SEIDOR × LUMS

Aug 2022 – Aug 2023

Credit-card fraud detection with Clariba SEIDOR during my time at LUMS (ISPL); built scalable ML pipelines on large, real-world transaction data.

VentureDive

Data Analysis Intern — VentureDive

Jul 2022 – Oct 2022

Built automated geospatial data pipelines and interactive visualization tools for constituency-level insights in Pakistan.

HBL

Branch Banking Intern — Habib Bank Limited

Jun 2022 – Jul 2022

Applied ML and analytics to measure branch performance and employee effectiveness while supporting internal recognition programs.

LUMS

Peer Advisor — LUMS

Aug 2021 – Jun 2023

Served as a Peer Advisor at LUMS, mentoring undergraduate students on course planning and academic success strategies.

LUMS

Teaching Assistant - LUMS

Aug 2021 – Dec 2022

Teaching Assistant for Computational Problem Solving at LUMS, supporting over 100 students in programming fundamentals.

JS Bank

Data Science Intern — JS Bank

Jun 2021 – Aug 2021

ETL optimization · SQL · predictive analytics

Certifications

☁️Cloud & Databases

  • Microsoft Azure SQL (Coursera)
  • Explore Core Data Concepts in Microsoft Azure (Coursera)
  • Intermediate SQL Server (DataCamp)
  • Introduction to SQL Server (DataCamp)
  • Joining Data in SQL (DataCamp)

🤖AI & Machine Learning

  • Generative AI Fundamentals (Databricks)
  • Large Language Models: Application through Production (Databricks)
  • Unsupervised Learning in Python (DataCamp)
  • Machine Learning with scikit-learn (DataCamp)
  • Introduction to Python (DataCamp)

📊Data Analysis & Visualization

  • Data Analysis in Excel (DataCamp)
  • Intermediate Data Modeling in Power BI (DataCamp)
  • Data Modeling in Power BI (DataCamp)
  • Introduction to Power BI (DataCamp)
  • Data Analysis in Spreadsheets (DataCamp)
  • Introduction to Spreadsheets (DataCamp)

Draw-a-Digit (0–9)

▼ Try the in-browser demo

Draw below and hit Predict.

Probabilities

Get in touch

Based in the US • Open to research collabs & ML systems work.

ahmerjamil.aj@gmail.com +1 319-936-1014
Copied!