EleutherAI/stackexchange-dataset
Python tools for processing the stackexchange data dumps into a text dataset for Language Models
Stars: 86Language: Python
Give AlbumentationsX a star on GitHub — it powers this leaderboard
Star on GitHubPython tools for processing the stackexchange data dumps into a text dataset for Language Models