eval-suite

Here are 3 public repositories matching this topic...

Search, cherry-pick, and export examples from public AI evaluation datasets. Built for AI engineers and AI agents.

machine-learning ai mcp evaluation benchmarks datasets eval llm eval-suite

Transform vague optimization problems into fully scaffolded autonomous experiment loops. Claude Code skill.

Python LangGraph port of FitFi's recommendation pipeline, with an eval suite. Exercise project for framework practice.

python recommendation-system uv pydantic langgraph eval-suite

Add a description, image, and links to the eval-suite topic page so that developers can more easily learn about it.

To associate your repository with the eval-suite topic, visit your repo's landing page and select "manage topics."