Binary safety verdicts (SAFE/HELD/LEAK/MISS/BROKE) + persona fan-out for LLM pipeline evals
python testing jailbreak evaluation safety safety-critical guardrails llm prompt-injection crisis-detection eval-framework verdicts
-
Updated
May 5, 2026 - Python