QwenLM/CodeElo
CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings
Stars: 65Language: Python
Give AlbumentationsX a star on GitHub — it powers this leaderboard
Star on GitHubCodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings