Give AlbumentationsX a star on GitHub — it powers this leaderboard

QwenLM/vllm-gptq

A high-throughput and memory-efficient inference and serving engine for LLMs

Stars: 140Language: Python

View on GitHub Owner: QwenLM

QwenLM/vllm-gptq - GitHub Repository | PyPI Leaderboard