openrlhf

Here are 4 public repositories matching this topic...

TsinghuaC3I / MARTI

A Framework for LLM-based Multi-Agent Reinforced Training and Inference

camel llama gemma multi-agent-systems autogen multi-agent-reinforcement-learning large-language-models qwen large-reasoning-models deepseek-r1 verl openrlhf

Updated Nov 20, 2025
Python

Skelf-Research / lmadapt

Star

Train smarter: adaptive task selection that targets your model's learning frontier. Plugs directly into TRL, OpenRLHF, Axolotl, and lm-evaluation-harness.

training axolotl adaptive-learning lm-evaluation-harness openrlhf

Updated Jan 25, 2026
Rust

KRESS99 / llm-env-templates

Star

🌐 Streamline LLM development with ready-to-use environment templates for efficient setup and deployment.

python environment deep-learning conda pytorch venv uv llm flash-attn verl openrlhf

Updated Sep 8, 2025

Magnicord / llm-env-templates

Star

A list of uv environments templates for LLM development.

python environment deep-learning conda pytorch venv uv llm flash-attn verl openrlhf

Updated Sep 19, 2025

Improve this page

Add a description, image, and links to the openrlhf topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the openrlhf topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

openrlhf

Here are 4 public repositories matching this topic...

TsinghuaC3I / MARTI

Skelf-Research / lmadapt

KRESS99 / llm-env-templates

Magnicord / llm-env-templates

Improve this page

Add this topic to your repo