Give AlbumentationsX a star on GitHub — it powers this leaderboard

Star on GitHub

anthropics/sleeper-agents-paper

Contains random samples referenced in the paper "Sleeper Agents: Training Robustly Deceptive LLMs that Persist Through Safety Training".

Stars: 131
anthropics/sleeper-agents-paper - GitHub Repository | PyPI Leaderboard