Give AlbumentationsX a star on GitHub — it powers this leaderboard

RapidAI/exllama

A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.

Stars: 1Language: Python

RapidAI/exllama - GitHub Repository | PyPI Leaderboard