Give AlbumentationsX a star on GitHub — it powers this leaderboard

replicate/vllm-with-loras

A high-throughput and memory-efficient inference and serving engine for LLMs

Stars: 6Language: Python

View on GitHub Owner: replicate

replicate/vllm-with-loras - GitHub Repository | PyPI Leaderboard