dmlls/HarmBench
HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal
Stars: 0Language: Jupyter Notebook
Give AlbumentationsX a star on GitHub — it powers this leaderboard
Star on GitHubHarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal