From Reasoning to Agentic: Credit Assignment in Reinforcement Learning for Large Language Models Paper • 2604.09459 • Published 4 days ago • 12
NVIDIA Ising Collection NVIDIA Ising is a new Model Family to enable building useful Quantum Computers with AI. • 4 items • Updated 2 days ago • 16
Audio Flamingo Next: Next-Generation Open Audio-Language Models for Speech, Sound, and Music Paper • 2604.10905 • Published 4 days ago • 26
QuanBench+: A Unified Multi-Framework Benchmark for LLM-Based Quantum Code Generation Paper • 2604.08570 • Published 23 days ago • 121
Pseudo-Unification: Entropy Probing Reveals Divergent Information Patterns in Unified Multimodal Models Paper • 2604.10949 • Published 4 days ago • 38
Attention Sink in Transformers: A Survey on Utilization, Interpretation, and Mitigation Paper • 2604.10098 • Published 6 days ago • 74
Think in Strokes, Not Pixels: Process-Driven Image Generation via Interleaved Reasoning Paper • 2604.04746 • Published 9 days ago • 70
OmniShow: Unifying Multimodal Conditions for Human-Object Interaction Video Generation Paper • 2604.11804 • Published 4 days ago • 68
Qualixar OS: A Universal Operating System for AI Agent Orchestration Paper • 2604.06392 • Published 10 days ago • 16
Automating Database-Native Function Code Synthesis with LLMs Paper • 2604.06231 • Published 15 days ago • 17
MolmoWeb: Open Visual Web Agent and Open Data for the Open Web Paper • 2604.08516 • Published 8 days ago • 41
Structured Causal Video Reasoning via Multi-Objective Alignment Paper • 2604.04415 • Published 11 days ago • 10
OpenVLThinkerV2: A Generalist Multimodal Reasoning Model for Multi-domain Visual Tasks Paper • 2604.08539 • Published 8 days ago • 48
view article Article How we OCR'ed 30,000 papers using Codex, open OCR models and Jobs 10 days ago • 53
AgentSwing: Adaptive Parallel Context Management Routing for Long-Horizon Web Agents Paper • 2603.27490 • Published 19 days ago • 17
WildDet3D: Scaling Promptable 3D Detection in the Wild Paper • 2604.08626 • Published 8 days ago • 236
FP4 Explore, BF16 Train: Diffusion Reinforcement Learning via Efficient Rollout Scaling Paper • 2604.06916 • Published 9 days ago • 34