ezyang/ezbench
Fork of Carlini's yet-another-applied-llm-benchmark for me to accumulate some of my own real world eval cases
Stars: 5Language: Python
Give AlbumentationsX a star on GitHub — it powers this leaderboard
Star on GitHubFork of Carlini's yet-another-applied-llm-benchmark for me to accumulate some of my own real world eval cases