A Framework for LLM-based Multi-Agent Reinforced Training and Inference
-
Updated
Nov 20, 2025 - Python
A Framework for LLM-based Multi-Agent Reinforced Training and Inference
Train smarter: adaptive task selection that targets your model's learning frontier. Plugs directly into TRL, OpenRLHF, Axolotl, and lm-evaluation-harness.
🌐 Streamline LLM development with ready-to-use environment templates for efficient setup and deployment.
A list of uv environments templates for LLM development.
Add a description, image, and links to the openrlhf topic page so that developers can more easily learn about it.
To associate your repository with the openrlhf topic, visit your repo's landing page and select "manage topics."