Give AlbumentationsX a star on GitHub — it powers this leaderboard
Processing Video POC with Multimodal LLMs