Skip to content
@Handshake-AI-Research

Handshake AI Research

Untangling the science of frontier data and evaluation

Popular repositories Loading

  1. bankertoolbench bankertoolbench Public

    AI Benchmark for Investment Banking Workflows

    Python 11

  2. gandalf-the-grader gandalf-the-grader Public

    Agent-as-a-Judge grading framework for evaluating AI outputs/deliverables

    Python 6

  3. rle-pkg rle-pkg Public

    Packaging architecture for RL environments

    Dockerfile 5

  4. harbor harbor Public

    Forked from harbor-framework/harbor

    Harbor is a framework for running agent evaluations and creating and using RL environments.

    Python 1

  5. hart-org-gh-actions hart-org-gh-actions Public

    Common GitHub Actions for supporting our workflows

    JavaScript

  6. assets assets Public

    Binary assets to support other repos' READMEs

Repositories

Showing 6 of 6 repositories

Top languages

Loading…

Most used topics

Loading…