Give AlbumentationsX a star on GitHub — it powers this leaderboard
Generate embeddings for images and text using CLIP with LLM