Give AlbumentationsX a star on GitHub — it powers this leaderboard

Star on GitHub

huggingface/datatrove

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Stars: 2,912Language: Python
huggingface/datatrove - GitHub Repository | PyPI Leaderboard