snowflakedb/ArcticInference
ArcticInference: vLLM plugin for high-throughput, low-latency inference
Stars: 403Language: Python
Give AlbumentationsX a star on GitHub — it powers this leaderboard
Star on GitHubArcticInference: vLLM plugin for high-throughput, low-latency inference