Learning from the Self-future: On-policy Self-distillation for dLLMs Paper • 2606.18195 • Published 3 days ago • 67
TRIAGE: Dialectical Reasoning for Explainable Risk Prediction on Irregularly Sampled Medical Time Series with LLMs Paper • 2606.09030 • Published 11 days ago • 27
OPD-Evolver: Cultivating Holistic Agent Evolver via On-Policy Distillation Paper • 2606.17628 • Published 3 days ago • 25
GameCraft-Bench: Can Agents Build Playable Games End-to-End in a Real Game Engine? Paper • 2606.17861 • Published 3 days ago • 41
LoopCoder-v2: Only Loop Once for Efficient Test-Time Computation Scaling Paper • 2606.18023 • Published 3 days ago • 130
Zone of Proximal Policy Optimization: Teacher in Prompts, Not Gradients Paper • 2606.18216 • Published 3 days ago • 48
DavidAU/Qwen3.6-40B-Claude-4.6-Opus-Deckard-Heretic-Uncensored-Thinking-NEO-CODE-Di-IMatrix-MAX-GGUF Image-Text-to-Text • 39B • Updated 7 days ago • 529k • 388
view article Article Party is over: regularizing ColBERT models to fix efficient ANN methods lightonai • 2 days ago • 19
FastContext: Training Efficient Repository Explorer for Coding Agents Paper • 2606.14066 • Published 7 days ago • 81
VibeThinker-3B: Exploring the Frontier of Verifiable Reasoning in Small Language Models Paper • 2606.16140 • Published 4 days ago • 91
Data Journalist Agent: Transforming Data into Verifiable Multimodal Stories Paper • 2606.11176 • Published 10 days ago • 112
Memory is Reconstructed, Not Retrieved: Graph Memory for LLM Agents Paper • 2606.06036 • Published 15 days ago • 66
usermma/Huihui-LFM2.5-8B-A1B-abliterated-mlx-8Bit Text Generation • 8B • Updated 15 days ago • 301 • 2
usermma/Huihui-LFM2.5-8B-A1B-abliterated-mlx-6Bit Text Generation • 8B • Updated 15 days ago • 83 • 1
usermma/Huihui-LFM2.5-8B-A1B-abliterated-mlx-fp16 Text Generation • 8B • Updated 15 days ago • 71 • 1
usermma/Huihui-LFM2.5-8B-A1B-abliterated-mlx-4Bit Text Generation • 1B • Updated 15 days ago • 127 • 1