Skip to content
View HemantBK's full-sized avatar

Block or report HemantBK

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
HemantBK/README.md

Hemant Kumar B K

Typing SVG

LinkedIn Email arXiv


ML Engineer building production-grade AI systems with safety at the core. Currently researching Multi-Agent RL for cybersecurity at the University of Arizona and co-authoring StepShield β€” a safety benchmark for autonomous code agents (submitted to ICML 2026). Previously built recommendation engines at Escape LLC (30% engagement lift) and agentic RAG chatbots at Omdena (95% reduction in harmful responses).

I don't treat AI safety as a checkbox β€” I treat it as an engineering discipline.


πŸ”¬ Research

πŸ›‘οΈ StepShield β€” Co-Author Β  Paper ICML 2026

First benchmark for evaluating when autonomous code agents go rogue β€” not just whether they do. Detects specification violations (data exfiltration, unauthorized access) in real-time across 9,213 agent trajectories. Early detection cuts monitoring costs by 75% (~$108M projected savings).

Python PyTorch LLM Safety Red-Teaming Autonomous Agents


πŸš€ Featured Projects

πŸ’° Dynamic Pricing Engine

Production-grade ML pricing system

XGBoost demand forecasting + price elasticity estimation + scipy revenue optimization. FastAPI serving, Streamlit dashboard, MLflow tracking, Evidently drift monitoring.

Python XGBoost FastAPI MLflow Streamlit

Code

πŸ—£οΈ AI Voice Assistant

Full-stack speech pipeline: STT β†’ LLM β†’ TTS

End-to-end voice assistant with FastAPI backend, React frontend, and Docker containerization. Speech-to-Text, LLM reasoning, and Text-to-Speech in one pipeline.

JavaScript FastAPI React Docker LLM

Code

🌐 Multilingual Sentiment & Emotion Engine

5 languages + Hindi-English code-switching

Multi-task XLM-RoBERTa with LoRA adapters, ONNX INT8 inference, and cross-lingual transfer. Production-grade multilingual NLP pipeline.

Python XLM-RoBERTa LoRA ONNX NLP

Code

πŸ“ˆ AI-Driven Algorithmic Trading

Sentiment-aware stock prediction system

Combines NLP sentiment analysis on financial headlines with quantitative indicators. TimeGPT predictions + Power BI dashboard. 20% higher prediction accuracy.

Python NLP TimeGPT Sentiment Analysis

Code


πŸ› οΈ Tech Stack

Languages

Python C++ SQL Java Bash

ML & DL

PyTorch TensorFlow HuggingFace scikit-learn W&B

LLM & Agents

LangChain CrewAI RAG Prompt Engineering

MLOps & Cloud

AWS Docker FastAPI GitHub Actions MLflow GCP

Data

PostgreSQL Pandas Power BI Tableau


Open to ML Engineer, AI Safety, and AI Researcher roles β€” remote & relocation
Let's build AI systems that are powerful AND trustworthy.

Pinned Loading

  1. dynamic-pricing-engine dynamic-pricing-engine Public

    Production-grade ML pricing system β€” XGBoost demand forecasting + price elasticity estimation + scipy revenue optimization. FastAPI serving, Streamlit dashboard, MLflow tracking, Evidently drift mo…

    Python

  2. AI-Voice-Assistant AI-Voice-Assistant Public

    Full-stack AI voice assistant: Speech-to-Text β†’ LLM β†’ Text-to-Speech | FastAPI + React + Docker

    JavaScript

  3. Multilingual-Sentiment-Emotion-Intelligence-Engine Multilingual-Sentiment-Emotion-Intelligence-Engine Public

    Production-grade multilingual sentiment & emotion analysis engine covering 5 languages + Hindi-English code-switching. Multi-task XLM-RoBERTa with LoRA adapters, ONNX INT8 inference, and cross-ling…

    Python

  4. Algorithmic-Trading-AI Algorithmic-Trading-AI Public

    AI-powered algorithmic trading system that combines FinBERT sentiment analysis, spaCy NER, and TimeGPT forecasting to generate BUY/SELL/HOLD signals from real-time financial news.

    Jupyter Notebook 18

  5. LLaMA-Sum-Fine-Tuning LLaMA-Sum-Fine-Tuning Public

    Fine-tuned Meta's LLaMA 3.2 1B for text summarization using QLoRA (4-bit quantization + LoRA), achieving 40%+ improvement in ROUGE-2 over the base model on CNN/DailyMail dataset.

    Python 3

  6. MLShield MLShield Public

    ML-infrastructure-aware anomaly detection system for protecting model weights against exfiltration, using a 3-layer cascaded architecture (Rules β†’ ML β†’ LLM).

    Python