rspeer/wiki2text
Extract a plain text corpus from MediaWiki XML dumps, such as Wikipedia.
Stars: 134Language: Nim
Give AlbumentationsX a star on GitHub — it powers this leaderboard
Star on GitHubExtract a plain text corpus from MediaWiki XML dumps, such as Wikipedia.