bfcl
Here are 4 public repositories matching this topic...
Independent audit of a fine-tuned LLM tool-calling PoC — BFCL regression decomposition, inference stack risk assessment, and production recommendation for a FinTech client. Qwen-2.5, LoRA, SGLang, H100.
-
Updated
Apr 27, 2026 - Python
Pre-generation tool-call gating via linear probes on LLM hidden states. F1 ≈ 0.91–0.94 on BFCL v4, 14–22× faster than full generation. Cross-architecture transfer across Llama / Qwen / Phi / Mistral (3B–7B) with ≥96% retention.
-
Updated
May 8, 2026 - Jupyter Notebook
Tool-call reliability fine-tuning lab with open-weight model training, benchmark evaluation, and serving notes.
-
Updated
May 11, 2026 - Python
Improve this page
Add a description, image, and links to the bfcl topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the bfcl topic, visit your repo's landing page and select "manage topics."