RealRestorer: Towards Generalizable Real-World Image Restoration with Large-Scale Image Editing Models Paper • 2603.25502 • Published 4 days ago • 48
Calibri: Enhancing Diffusion Transformers via Parameter-Efficient Calibration Paper • 2603.24800 • Published 4 days ago • 55
PixelSmile: Toward Fine-Grained Facial Expression Editing Paper • 2603.25728 • Published 4 days ago • 113
view article Article Introducing Cohere-transcribe: state-of-the-art speech recognition 4 days ago • 26
Omnilingual MT: Machine Translation for 1,600 Languages Paper • 2603.16309 • Published 13 days ago • 20
MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild Paper • 2603.17187 • Published 12 days ago • 134
ProactiveBench: Benchmarking Proactiveness in Multimodal Large Language Models Paper • 2603.19466 • Published 10 days ago • 41
Astrolabe: Steering Forward-Process Reinforcement Learning for Distilled Autoregressive Video Models Paper • 2603.17051 • Published 13 days ago • 106
HopChain: Multi-Hop Data Synthesis for Generalizable Vision-Language Reasoning Paper • 2603.17024 • Published 13 days ago • 106
Speed by Simplicity: A Single-Stream Architecture for Fast Audio-Video Generative Foundation Model Paper • 2603.21986 • Published 7 days ago • 118
IndexCache: Accelerating Sparse Attention via Cross-Layer Index Reuse Paper • 2603.12201 • Published 18 days ago • 52
Spatial-TTT: Streaming Visual-based Spatial Intelligence with Test-Time Training Paper • 2603.12255 • Published 18 days ago • 90
LLM2Vec-Gen: Generative Embeddings from Large Language Models Paper • 2603.10913 • Published 19 days ago • 43
Flash-KMeans: Fast and Memory-Efficient Exact K-Means Paper • 2603.09229 • Published 20 days ago • 81