Skip to content

azhar25git/sft-rlhf-workflow

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 

Repository files navigation

sft-rlhf-workflow

About

A complete guide to the two core labeling tasks that train large language models — from writing demonstration data to ranking model outputs.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages