Skip to content

Add gemini 3.1 flash lite both as model-to-be-evaluated and LLM judge.#133

Open
daanschouten-kaiko wants to merge 1 commit into
MedARC-AI:mainfrom
daanschouten-kaiko:main
Open

Add gemini 3.1 flash lite both as model-to-be-evaluated and LLM judge.#133
daanschouten-kaiko wants to merge 1 commit into
MedARC-AI:mainfrom
daanschouten-kaiko:main

Conversation

@daanschouten-kaiko

Copy link
Copy Markdown

Small PR that adds support for Gemini 3.1 flash lite to be used as LLM judge and to evaluate the model itself. While adding this model to the medmarks-endpoints.toml enables its direct evaluation, usage of Gemini 3.1 flash lite as LLM judge with the official Google endpoint requires some additional tweaking through removal of unsupported sampling parameters. Verified with a quick 5-sample test on pubhealthbench that both routes of model-under-evaluation and LLM judge work as expected.

@CLAassistant

CLAassistant commented Jun 12, 2026

Copy link
Copy Markdown

CLA assistant check
All committers have signed the CLA.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants