HIGGS - a ISTA-DASLab Collection

ISTA-DASLab 's Collections

HIGGS

updated 6 days ago

Models prequantized with [HIGGS](https://arxiv.org/abs/2411.17525) zero-shot quantization. Requires the latest `transformers` to run.

Pushing the Limits of Large Language Model Quantization via the Linearity Theorem

Paper • 2411.17525 • Published Nov 26, 2024 • 5
ISTA-DASLab/Llama-3.3-70B-Instruct-HIGGS-GPTQ-4bit

19B • Updated Dec 12, 2024 • 25 • 7
ISTA-DASLab/Llama-3.1-8B-Instruct-HIGGS-GPTQ-4bit

Text Generation • 3B • Updated Dec 10, 2024 • 1
ISTA-DASLab/Llama-3.1-8B-Instruct-HIGGS-GPTQ-3bit

Text Generation • 2B • Updated Dec 10, 2024 • 1
ISTA-DASLab/Llama-3.1-8B-HIGGS-GPTQ-4bit

Text Generation • 3B • Updated Dec 10, 2024 • 2
ISTA-DASLab/Llama-3.1-8B-HIGGS-GPTQ-3bit

Text Generation • 2B • Updated Dec 10, 2024 • 1
ISTA-DASLab/Llama-3.3-70B-Instruct-HIGGS-4bit

Text Generation • 19B • Updated Dec 9, 2024 • 4 • 3
ISTA-DASLab/Llama-3.3-70B-Instruct-HIGGS-3bit

Text Generation • 15B • Updated Dec 6, 2024
ISTA-DASLab/Llama-3.1-70B-Instruct-HIGGS-4bit

Text Generation • 19B • Updated Dec 6, 2024
ISTA-DASLab/Llama-3.1-70B-Instruct-HIGGS-3bit

Text Generation • 15B • Updated Dec 6, 2024
ISTA-DASLab/Llama-3.1-8B-Instruct-HIGGS-4bit

Text Generation • 3B • Updated Dec 6, 2024 • 3
ISTA-DASLab/Llama-3.1-8B-Instruct-HIGGS-3bit

Text Generation • 2B • Updated Dec 6, 2024 • 1
ISTA-DASLab/Llama-3.1-8B-HIGGS-4bit

Text Generation • 3B • Updated Dec 6, 2024
ISTA-DASLab/Llama-3.1-8B-HIGGS-3bit

Text Generation • 2B • Updated Dec 6, 2024 • 9
ISTA-DASLab/gemma-2-9b-it-HIGGS-4bit

Text Generation • 3B • Updated Dec 6, 2024 • 1 • 1
ISTA-DASLab/gemma-2-9b-it-HIGGS-3bit

Text Generation • 3B • Updated Dec 6, 2024
ISTA-DASLab/gemma-2-9b-HIGGS-4bit

Text Generation • 3B • Updated Dec 6, 2024
ISTA-DASLab/gemma-2-9b-HIGGS-3bit

Text Generation • 3B • Updated Dec 6, 2024 • 1