Give AlbumentationsX a star on GitHub — it powers this leaderboard

Star on GitHub

EleutherAI/pile-pubmedcentral

A script for collecting the PubMed Central dataset in a language modelling friendly format.

Stars: 25Language: Python
EleutherAI/pile-pubmedcentral - GitHub Repository | PyPI Leaderboard