Independent audit of a fine-tuned LLM tool-calling PoC — BFCL regression decomposition, inference stack risk assessment, and production recommendation for a FinTech client. Qwen-2.5, LoRA, SGLang, H100.
machine-learning inference fintech lora model-evaluation fine-tuning mlops huggingface llm vllm function-calling qwen sglang bfcl toolace
-
Updated
Apr 27, 2026 - Python