UniQL: Unified Quantization and Low-rank Compression for Adaptive Edge LLMs Paper • 2512.03383 • Published 8 days ago • 3
Quamba2: A Robust and Scalable Post-training Quantization Framework for Selective State Space Models Paper • 2503.22879 • Published Mar 28 • 9
Quamba: A Post-Training Quantization Recipe for Selective State Space Models Paper • 2410.13229 • Published Oct 17, 2024 • 1
Efficient Low-rank Backpropagation for Vision Transformer Adaptation Paper • 2309.15275 • Published Sep 26, 2023 • 1
MobileTL: On-device Transfer Learning with Inverted Residual Blocks Paper • 2212.03246 • Published Dec 5, 2022 • 1