🎯
Focusing
Founder @ SecPortal.io | OWASP Project Lead | Security Researcher | CVE Disclosures
Pinned Loading
-
inspect_ai
inspect_ai PublicForked from UKGovernmentBEIS/inspect_ai
Inspect: A framework for large language model evaluations
Python
-
UKGovernmentBEIS/inspect_ai
UKGovernmentBEIS/inspect_ai PublicInspect: A framework for large language model evaluations
-
AlexsJones/llmfit
AlexsJones/llmfit PublicHundreds of models & providers. One command to find what runs on your hardware.
-
OWASP/Agent-Security-Regression-Harness
OWASP/Agent-Security-Regression-Harness PublicExecutable security regression testing for agentic applications and MCP-integrated systems.
-
openrepobench
openrepobench PublicA reproducible benchmark harness for evaluating coding agents on realistic repository maintenance tasks.
Python 1
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.



