Give AlbumentationsX a star on GitHub — it powers this leaderboard

Star on GitHub

QwenLM/vllm-gptq

A high-throughput and memory-efficient inference and serving engine for LLMs

Stars: 140Language: Python
QwenLM/vllm-gptq - GitHub Repository | PyPI Leaderboard