Skip to content
View Emart29's full-sized avatar
☺️
open to work
☺️
open to work

Block or report Emart29

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Emart29/README.md

Hey, I'm Emmanuel Nwanguma 👋

Machine Learning Engineer · Data Scientist · AI Systems Builder

I don't just build models — I build systems that make decisions.

LinkedIn Portfolio Medium Email


🧭 About Me

ML Engineer who takes models from notebook to production. I build end-to-end AI systems — prediction services, RAG pipelines, LLM evaluation frameworks — with observability, explainability, and deployment baked in from day one. Currently focused on LLM reliability, AI agents, and production-grade ML infrastructure.


🛠️ Tech Stack

Languages

Python SQL Bash

ML & Deep Learning

PyTorch TensorFlow scikit-learn Hugging Face Keras

LLMs & AI Agents

LangChain OpenAI RAG Prompt Engineering AI Agents

MLOps & Deployment

Docker Kubernetes FastAPI MLflow GitHub Actions

Cloud & Data

AWS GCP BigQuery PostgreSQL

Data & Visualization

Pandas NumPy Tableau Jupyter

Explainability & Monitoring

SHAP Evidently AI


🚀 Featured Projects

End-to-end ML system · 88.5% accuracy · FastAPI + Streamlit + SHAP + MLflow + Docker

Production healthcare ML system with SHAP explainability, MLflow experiment tracking, REST API, interactive dashboard, and full Docker deployment. Built for trust, not just performance.


QLoRA fine-tuning · +69% ROUGE-L over base · Live Hugging Face demo

Fine-tuned Microsoft Phi-4 Mini 3.8B on SEC 10-K financial Q&A. Demonstrates practical LLM adaptation for domain-specific enterprise use cases.


Automated prompt regression detection · CI/CD for LLMs

Production evaluation pipeline that catches quality regressions before prompts reach users. CI/CD for LLM behavior — because shipping a broken prompt is just as bad as shipping broken code.


Persistent memory · Semantic search · Multi-tenant · Python + TypeScript SDKs

Memory infrastructure for AI agents with semantic search, multi-tenant isolation, 8 framework adapters, and dual-language SDK support.


Multiple search strategies · Semantic chunking · Real-time document processing

Production-ready RAG pipeline with hybrid retrieval, semantic chunking, and real-time ingestion. Goes beyond naive vector search.


Fine-tuned DistilBERT · 19 categories · Drift detection · React dashboard

NLP classifier with FastAPI serving, real-time drift detection via Evidently AI, and a React analytics dashboard. Fully containerized.


📈 GitHub Activity

GitHub Stats Top Languages

GitHub Streak


🤝 Open to Opportunities

I'm actively looking for ML Engineer and Data Scientist roles where I can build production AI systems end-to-end. Especially interested in teams working on LLM reliability, MLOps infrastructure, or AI-powered products.

📬 nwangumaemmanuel29@gmail.com · 🌐 emart29.vercel.app

Pinned Loading

  1. phi4-finance-finetuning phi4-finance-finetuning Public

    Fine-tuning Microsoft Phi-4 Mini 3.8B on SEC 10-K financial Q&A using QLoRA — +69% ROUGE-L over base model. Live demo on Hugging Face.

    Jupyter Notebook 2

  2. llm-quality-gate llm-quality-gate Public

    pytest for LLMs — automated quality gate that catches prompt regressions and model degradations before they reach production. CI/CD integration, multi-provider support, and a live monitoring dashbo…

    Python 1

  3. emartai/remembr emartai/remembr Public

    Persistent memory infrastructure for AI agents — semantic search, multi-tenant isolation, 8 framework adapters, Python + TypeScript SDKs.

    Python 1

  4. ecommerce-product-classifier ecommerce-product-classifier Public

    Production-ready NLP classifier: fine-tuned DistilBERT across 19 e-commerce categories with FastAPI serving, real-time drift detection via Evidently AI, and a React analytics dashboard. Fully conta…

    Jupyter Notebook 1

  5. ragwell ragwell Public

    A production-ready Retrieval-Augmented Generation (RAG) pipeline with multiple search strategies, semantic chunking, and real-time document processing.

    Python 1

  6. Heart-Disease-Prediction Heart-Disease-Prediction Public

    🫀 Production-ready ML system for heart disease risk assessment with 88.5% accuracy. Features FastAPI REST API, Streamlit dashboard, SHAP explainability, MLflow tracking, and Docker deployment. Demo…

    Jupyter Notebook 2