Speed by Simplicity: A Single-Stream Architecture for Fast Audio-Video Generative Foundation Model Paper โข 2603.21986 โข Published 20 days ago โข 123
Running on CPU Upgrade Featured 1.31k Open ASR Leaderboard ๐ 1.31k Explore speech recognition model benchmarks and request new ones
daVinci-Dev: Agent-native Mid-training for Software Engineering Paper โข 2601.18418 โข Published Jan 26 โข 126
RAPO++: Cross-Stage Prompt Optimization for Text-to-Video Generation via Data Alignment and Test-Time Scaling Paper โข 2510.20206 โข Published Oct 23, 2025 โข 12
Running Featured 131 Open VLM Video Leaderboard ๐ 131 VLMEvalKit Eval Results in video understanding benchmark
Running on CPU Upgrade 1.01k Open VLM Leaderboard ๐ 1.01k VLMEvalKit Evaluation Results Collection
Running on CPU Upgrade 13.9k Open LLM Leaderboard ๐ 13.9k Track, rank and evaluate open LLMs and chatbots
view article Article Smol2Operator: Post-Training GUI Agents for Computer Use +3 Sep 23, 2025 โข 138