Give AlbumentationsX a star on GitHub — it powers this leaderboard

Star on GitHub

EleutherAI/stackexchange-dataset

Python tools for processing the stackexchange data dumps into a text dataset for Language Models

Stars: 86Language: Python
EleutherAI/stackexchange-dataset - GitHub Repository | PyPI Leaderboard