SarathL754

SarathL754

Pinned Loading

vigil3d-video-inference vigil3d-video-inference Public

End-to-end video violence detection system using a 3D CNN, deployed with FastAPI, Docker, AWS EC2, S3, and a React frontend on Vercel.

Python
Reducing-Hallucinations-with-Direct-Preference-Optimization Reducing-Hallucinations-with-Direct-Preference-Optimization Public

An RLHF-inspired DPO framework that explicitly teaches LLMs when to refuse, significantly reducing hallucinations.
Decision-Transformer-from-Scratch-HalfCheetah-Minari-BC-vs-Return-Conditioned-DT Decision-Transformer-from-Scratch-HalfCheetah-Minari-BC-vs-Return-Conditioned-DT Public

Implementing Decision Transformers from scratch for offline RL, benchmarking return-conditioned policies against Behavior Cloning.

Python
VulneraAI-agent VulneraAI-agent Public

An agentic LLM security scanner that analyzes applications against OWASP Top 10 using tool-calling, LangGraph, and AWS Bedrock.

Python
Email-Assistant-langgraph Email-Assistant-langgraph Public

Python
Multi-agent-RL-texas-holdem-aec Multi-agent-RL-texas-holdem-aec Public

An engineering-focused multi-agent reinforcement learning system for Texas Hold’em using PettingZoo AEC and a custom PyTorch PPO self-play setup.

Python