togethercomputer/llm-awq-ttgi
AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
Stars: 1Language: Python
Give AlbumentationsX a star on GitHub — it powers this leaderboard
Star on GitHubAWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration