Skip to content
View JatinYadav2006's full-sized avatar

Block or report JatinYadav2006

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
JatinYadav2006/README.md

Jatin Yadav

Final-year CS undergrad at MNNIT Allahabad building AI systems at the intersection of reinforcement learning, LLMs, and production engineering.


🏆 Achievements

Meta × Hugging Face Hackathon 2026 — Top 800 of 31,000+ teams globally (Top 2.6%)
Built Pulse-ER: a Python-based RL training environment for AI-driven trauma care

Sankalp Innovation Challenge 2026 — National Finalist, Team Lead
Built Governance Memory AI: multi-agent civic platform for crisis detection and dispatch
National AI & Innovation Summit, MNNIT Allahabad


🔨 Featured Projects

RL training environment for AI-driven trauma patient management

  • Fine-tuned Qwen2.5-3B-Instruct with GRPO + LoRA (rank 16) for ATLS-aligned protocol sequencing
  • Engineered reward shaping pipelines achieving 8.33 expert reward vs −17.15 random baseline (25× improvement)
  • Stack: Python · TRL · GRPO · LoRA · FastAPI · Docker · Pulse Physiology Engine

Scalable multi-agent civic platform for complaint clustering and crisis detection

  • Architected end-to-end RESTful API pipeline for multilingual complaint intake and dispatch
  • Built semantic clustering pipeline with SentenceTransformers + DBSCAN for real-time crisis detection
  • Stack: Python · FastAPI · PostgreSQL · SentenceTransformers · scikit-learn · Streamlit

🛠 Stack

Core: Python · PyTorch · FastAPI · Docker
LLMs & Training: Hugging Face Transformers · TRL · GRPO · LoRA · PEFT · Qwen2.5
ML: scikit-learn · Pandas · NumPy · DBSCAN · SentenceTransformers
Databases: PostgreSQL · MongoDB · SQLite
Infra: Git · Docker · GCP · REST APIs
Interests: Reinforcement Learning · LLM Fine-Tuning · Agentic AI · RAG · NLP


📬 Connect

LinkedIn HuggingFace Email


Currently building agentic AI systems and exploring LLM fine-tuning for domain-specific reasoning. Open to AI/ML Engineer and Research roles — 2027 graduate.

Pinned Loading

  1. Pulse-ER-env Pulse-ER-env Public

    RL training environment for AI-driven trauma care. Meta × HuggingFace Hackathon finalist — top 2.6% of 31,000+ teams.

    Python 1 1

  2. governance-memory-ai governance-memory-ai Public

    Multi-agent civic AI platform for multilingual complaint clustering, crisis detection, and real-time dispatch. Sankalp Innovation Challenge finalist — National AI & Innovation Summit, MNNIT Allahabad.

    Python