ISTA-DASLab/Llama-3.2-1B-AQLM-PV-2Bit-2x8
Text Generation
•
0.5B
•
Updated
•
10
None defined yet.
WUSH: Near-Optimal Adaptive Transforms for LLM Quantization
CAGE: Curvature-Aware Gradient Estimation For Accurate Quantization-Aware Training