Pushing the Limits of Large Language Model Quantization via the Linearity Theorem
Paper
• 2411.17525 • Published
• 5
Models prequantized with [HIGGS](https://arxiv.org/abs/2411.17525) zero-shot quantization. Requires the latest `transformers` to run.