SpenseGPT: Practical One-shot Pruning Enabling Sparse and Dense GEMMs for LLM Inference Paper • 2606.10445 • Published 12 days ago