HumanSignal/RLHF
Collection of links, tutorials and best practices of how to collect the data and build end-to-end RLHF system to finetune Generative AI models
Stars: 225Language: Jupyter Notebook
Give AlbumentationsX a star on GitHub — it powers this leaderboard
Star on GitHubCollection of links, tutorials and best practices of how to collect the data and build end-to-end RLHF system to finetune Generative AI models