Modified mt_bench with API and HF scripts for LFMs.
-
Updated
Jul 9, 2025 - Python
Modified mt_bench with API and HF scripts for LFMs.
⚖️ Dual-Judge: 让AI测试结果真正有说服力 | 双LLM交叉验证消除单模型偏见 | 独立于具体Agent的通用评估框架 | Making AI Evaluation Trustworthy
Add a description, image, and links to the mt-bench topic page so that developers can more easily learn about it.
To associate your repository with the mt-bench topic, visit your repo's landing page and select "manage topics."