WBench: A Comprehensive Multi-turn Benchmark for Interactive Video World Model Evaluation Paper • 2605.25874 • Published 2 days ago • 90
LARY: A Latent Action Representation Yielding Benchmark for Generalizable Vision-to-Action Alignment Paper • 2604.11689 • Published Apr 13 • 21
LongCat-Next: Lexicalizing Modalities as Discrete Tokens Paper • 2603.27538 • Published Mar 29 • 147
LongCat-Flash-Prover: Advancing Native Formal Reasoning via Agentic Tool-Integrated Reinforcement Learning Paper • 2603.21065 • Published Mar 22 • 78
Scaling Embeddings Outperforms Scaling Experts in Language Models Paper • 2601.21204 • Published Jan 29 • 104
meituan-longcat/LongCat-Flash-Thinking-ZigZag Text Generation • 562B • Updated Feb 2 • 45 • 32
meituan-longcat/LongCat-Flash-Thinking-2601-FP8 Text Generation • 562B • Updated Jan 23 • 53 • 13
meituan-longcat/LongCat-Flash-Thinking-2601 Text Generation • 562B • Updated Jan 23 • 12.4k • 113