From Reasoning to Agentic: Credit Assignment in Reinforcement Learning for Large Language Models Paper • 2604.09459 • Published 6 days ago • 12
NVIDIA Ising Collection NVIDIA Ising is a new Model Family to enable building useful Quantum Computers with AI. • 4 items • Updated 4 days ago • 18
Audio Flamingo Next: Next-Generation Open Audio-Language Models for Speech, Sound, and Music Paper • 2604.10905 • Published 6 days ago • 26
QuanBench+: A Unified Multi-Framework Benchmark for LLM-Based Quantum Code Generation Paper • 2604.08570 • Published 25 days ago • 122
Pseudo-Unification: Entropy Probing Reveals Divergent Information Patterns in Unified Multimodal Models Paper • 2604.10949 • Published 6 days ago • 39
Attention Sink in Transformers: A Survey on Utilization, Interpretation, and Mitigation Paper • 2604.10098 • Published 8 days ago • 74
Think in Strokes, Not Pixels: Process-Driven Image Generation via Interleaved Reasoning Paper • 2604.04746 • Published 11 days ago • 70
OmniShow: Unifying Multimodal Conditions for Human-Object Interaction Video Generation Paper • 2604.11804 • Published 6 days ago • 69
Qualixar OS: A Universal Operating System for AI Agent Orchestration Paper • 2604.06392 • Published 12 days ago • 16
Automating Database-Native Function Code Synthesis with LLMs Paper • 2604.06231 • Published 17 days ago • 17
MolmoWeb: Open Visual Web Agent and Open Data for the Open Web Paper • 2604.08516 • Published 10 days ago • 41
Structured Causal Video Reasoning via Multi-Objective Alignment Paper • 2604.04415 • Published 13 days ago • 10
OpenVLThinkerV2: A Generalist Multimodal Reasoning Model for Multi-domain Visual Tasks Paper • 2604.08539 • Published 10 days ago • 48
view article Article How we OCR'ed 30,000 papers using Codex, open OCR models and Jobs 12 days ago • 56
AgentSwing: Adaptive Parallel Context Management Routing for Long-Horizon Web Agents Paper • 2603.27490 • Published 21 days ago • 17
WildDet3D: Scaling Promptable 3D Detection in the Wild Paper • 2604.08626 • Published 10 days ago • 238
FP4 Explore, BF16 Train: Diffusion Reinforcement Learning via Efficient Rollout Scaling Paper • 2604.06916 • Published 11 days ago • 34