Give AlbumentationsX a star on GitHub — it powers this leaderboard
tf/idf-based document aligner from Bitextor