2 38

Lani Ko

lanikoworld

https://ko-lani.github.io/

AI & ML interests

generative models, video diffusion models, world models

Recent Activity

upvoted a paper about 3 hours ago

Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players

upvoted a paper about 3 hours ago

LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding

upvoted a paper 3 days ago

VGenST-Bench: A Benchmark for Spatio-Temporal Reasoning via Active Video Synthesis

View all activity

Organizations

upvoted 2 papers about 3 hours ago

Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players

Paper • 2605.28816 • Published 1 day ago • 43

LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding

Paper • 2605.27365 • Published 2 days ago • 104

upvoted a paper 3 days ago

VGenST-Bench: A Benchmark for Spatio-Temporal Reasoning via Active Video Synthesis

Paper • 2605.22570 • Published 7 days ago • 23

upvoted 3 papers about 2 months ago

Generative World Renderer

Paper • 2604.02329 • Published Apr 2 • 102

CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence

Paper • 2603.28032 • Published Mar 30 • 342

Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models

Paper • 2603.25716 • Published Mar 26 • 156

upvoted a collection 2 months ago

Gemma 3 Release

Collection

28 items • Updated Mar 12 • 638

upvoted a paper 2 months ago

Repurposing Geometric Foundation Models for Multi-view Diffusion

Paper • 2603.22275 • Published Mar 23 • 48

submitted a paper to Daily Papers 2 months ago

2Xplat: Two Experts Are Better Than One Generalist

Paper • 2603.21064 • Published Mar 22 • 25

upvoted 5 papers 2 months ago

2Xplat: Two Experts Are Better Than One Generalist

Paper • 2603.21064 • Published Mar 22 • 25

PEARL: Personalized Streaming Video Understanding Model

Paper • 2603.20422 • Published Mar 20 • 40

Omni-WorldBench: Towards a Comprehensive Interaction-Centric Evaluation for World Models

Paper • 2603.22212 • Published Mar 23 • 126

Group3D: MLLM-Driven Semantic Grouping for Open-Vocabulary 3D Object Detection

Paper • 2603.21944 • Published Mar 23 • 26

Versatile Editing of Video Content, Actions, and Dynamics without Training

Paper • 2603.17989 • Published Mar 18 • 18

authored 4 papers 2 months ago

upvoted 2 papers 2 months ago

HopChain: Multi-Hop Data Synthesis for Generalizable Vision-Language Reasoning

Paper • 2603.17024 • Published Mar 17 • 110

Astrolabe: Steering Forward-Process Reinforcement Learning for Distilled Autoregressive Video Models

Paper • 2603.17051 • Published Mar 17 • 109

Lani Ko

AI & ML interests

Recent Activity

Organizations

lanikoworld's activity