Skip to content
View mertsatilmaz's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report mertsatilmaz

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. inspect_ai inspect_ai Public

    Forked from UKGovernmentBEIS/inspect_ai

    Inspect: A framework for large language model evaluations

    Python

  2. UKGovernmentBEIS/inspect_ai UKGovernmentBEIS/inspect_ai Public

    Inspect: A framework for large language model evaluations

    Python 2.1k 517

  3. AlexsJones/llmfit AlexsJones/llmfit Public

    Hundreds of models & providers. One command to find what runs on your hardware.

    Rust 26.5k 1.6k

  4. OWASP/Agent-Security-Regression-Harness OWASP/Agent-Security-Regression-Harness Public

    Executable security regression testing for agentic applications and MCP-integrated systems.

    Python 22 21

  5. openrepobench openrepobench Public

    A reproducible benchmark harness for evaluating coding agents on realistic repository maintenance tasks.

    Python 1