Give AlbumentationsX a star on GitHub — it powers this leaderboard

Star on GitHub

openai/SWELancer-Benchmark

This repo contains the dataset and code for the paper "SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software Engineering?"

Stars: 1,439
openai/SWELancer-Benchmark - GitHub Repository | PyPI Leaderboard