NoWag: A Unified Framework for Shape Preserving Compression of Large Language Models Paper • 2504.14569 • Published Apr 20
ARMOR: High-Performance Semi-Structured Pruning via Adaptive Matrix Factorization Paper • 2510.05528 • Published Oct 7 • 2