Search, cherry-pick, and export examples from public AI evaluation datasets. Built for AI engineers and AI agents.
-
Updated
Mar 30, 2026 - Python
Search, cherry-pick, and export examples from public AI evaluation datasets. Built for AI engineers and AI agents.
Transform vague optimization problems into fully scaffolded autonomous experiment loops. Claude Code skill.
Python LangGraph port of FitFi's recommendation pipeline, with an eval suite. Exercise project for framework practice.
Add a description, image, and links to the eval-suite topic page so that developers can more easily learn about it.
To associate your repository with the eval-suite topic, visit your repo's landing page and select "manage topics."