Give AlbumentationsX a star on GitHub — it powers this leaderboard
SgmlReader - Convert (almost) any HTML to valid XML