Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability Paper • 2604.06628 • Published 9 days ago • 312
DataFlex: A Unified Framework for Data-Centric Dynamic Training of Large Language Models Paper • 2603.26164 • Published 21 days ago • 353
Attention Sink in Transformers: A Survey on Utilization, Interpretation, and Mitigation Paper • 2604.10098 • Published 6 days ago • 74
KnowRL: Boosting LLM Reasoning via Reinforcement Learning with Minimal-Sufficient Knowledge Guidance Paper • 2604.12627 • Published 3 days ago • 94
LightThinker++: From Reasoning Compression to Memory Management Paper • 2604.03679 • Published 13 days ago • 35
OpenWorldLib: A Unified Codebase and Definition of Advanced World Models Paper • 2604.04707 • Published 11 days ago • 200
TriAttention: Efficient Long Reasoning with Trigonometric KV Compression Paper • 2604.04921 • Published 11 days ago • 107
SpikingBrain Technical Report: Spiking Brain-inspired Large Models Paper • 2509.05276 • Published Sep 5, 2025 • 5
SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization Paper • 2604.02268 • Published 15 days ago • 93
ClawKeeper: Comprehensive Safety Protection for OpenClaw Agents Through Skills, Plugins, and Watchers Paper • 2603.24414 • Published 23 days ago • 183
Dynin-Omni: Omnimodal Unified Large Diffusion Language Model Paper • 2604.00007 • Published Mar 9 • 19
Data Darwinism Part I: Unlocking the Value of Scientific Data for Pre-training Paper • 2602.07824 • Published Feb 8 • 18
LongCat-Next: Lexicalizing Modalities as Discrete Tokens Paper • 2603.27538 • Published 19 days ago • 143
FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization Paper • 2603.19835 • Published 28 days ago • 337
TAPS: Task Aware Proposal Distributions for Speculative Sampling Paper • 2603.27027 • Published 20 days ago • 142