NoWag: A Unified Framework for Shape Preserving Compression of Large Language Models Paper • 2504.14569 • Published Apr 20, 2025
ARMOR: High-Performance Semi-Structured Pruning via Adaptive Matrix Factorization Paper • 2510.05528 • Published Oct 7, 2025 • 2