Gemini Embedding 2: A Native Multimodal Embedding Model from Gemini Paper • 2605.27295 • Published 6 days ago • 19
WorldKV: Efficient World Memory with World Retrieval and Compression Paper • 2605.22718 • Published 11 days ago • 41
PiD: Fast and High-Resolution Latent Decoding with Pixel Diffusion Paper • 2605.23902 • Published 10 days ago • 44
HINT-SD: Targeted Hindsight Self-Distillation for Long-Horizon Agents Paper • 2605.17873 • Published 14 days ago • 11
GenRecon: Bridging Generative Priors for Multi-View 3D Scene Reconstruction Paper • 2605.23888 • Published 10 days ago • 13
Lens: Rethinking Training Efficiency for Foundational Text-to-Image Models Paper • 2605.21573 • Published 12 days ago • 106
Rethinking Cross-Layer Information Routing in Diffusion Transformers Paper • 2605.20708 • Published 12 days ago • 109
view article Article EMO: Pretraining mixture of experts for emergent modularity allenai • 23 days ago • 38
view article Article OlmoEarth v1.1: A more efficient family of Earth observation models allenai • 12 days ago • 20
view article Article Towards Speed-of-Light Text Generation with Nemotron-Labs Diffusion Language Models nvidia • 9 days ago • 27
Full Attention Strikes Back: Transferring Full Attention into Sparse within Hundred Training Steps Paper • 2605.16928 • Published 16 days ago • 93
DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards Paper • 2605.21467 • Published 12 days ago • 204