N-GRPO: Embedding-Level Neighbor Mixing for Enhanced Policy Optimization Paper • 2606.10768 • Published 4 days ago • 18
Breaking Entropy Bounds: Accelerating RL Training via MTP with Rejection Sampling Paper • 2606.12370 • Published 3 days ago • 19
Redesign Mixture-of-Experts Routers with Manifold Power Iteration Paper • 2606.12397 • Published 3 days ago • 80
LoopUS: Recasting Pretrained LLMs into Looped Latent Refinement Models Paper • 2605.11011 • Published May 10 • 10
On Subquadratic Architectures: From Applications to Principles Paper • 2606.12364 • Published 2 days ago • 22
view article Article Fine-tune FLUX.2 [klein] with a LoRA under 60 minutes black-forest-labs • 8 days ago • 18
view article Article Thousand Token Wood: shipping a multi-agent economy on a 3B model build-small-hackathon • 7 days ago • 5
Socratic-SWE: Self-Evolving Coding Agents via Trace-Derived Agent Skills Paper • 2606.07412 • Published 8 days ago • 12
Your UnEmbedding Matrix is Secretly a Feature Lens for Text Embeddings Paper • 2606.07502 • Published 8 days ago • 90
SoCRATES: Towards Reliable Automated Evaluation of Proactive LLM Mediation across Domains and Socio-cognitive Variations Paper • 2606.05563 • Published 9 days ago • 49
LatentSkill: From In-Context Textual Skills to In-Weight Latent Skills for LLM Agents Paper • 2606.06087 • Published 9 days ago • 60