huggingface/evaluation-guidebook
Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard and designing lighteval!
Stars: 2,063Language: Jupyter Notebook
Give AlbumentationsX a star on GitHub — it powers this leaderboard
Star on GitHubSharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard and designing lighteval!