arxiv:2409.00492
Michael Goin
mgoin
AI & ML interests
LLM inference optimization, compression, quantization, pruning, distillation
Recent Activity
updated a model about 4 hours ago
mgoin/GLM-5.2-speculator.dspark-testing published a model about 4 hours ago
mgoin/GLM-5.2-speculator.dspark-testing updated a model 1 day ago
mgoin/Qwen3-8B-speculator.dspark-reasoning