EleutherAI/pile-pubmedcentral
A script for collecting the PubMed Central dataset in a language modelling friendly format.
Stars: 25Language: Python
Give AlbumentationsX a star on GitHub — it powers this leaderboard
Star on GitHubA script for collecting the PubMed Central dataset in a language modelling friendly format.