CyberAgentAILab/regularized-bon
Code of "Regularized Best-of-N Sampling with Minimum Bayes Risk Objective for Language Model Alignment" (2025).
Stars: 14Language: Python
Give AlbumentationsX a star on GitHub — it powers this leaderboard
Star on GitHubCode of "Regularized Best-of-N Sampling with Minimum Bayes Risk Objective for Language Model Alignment" (2025).