sherry's picture

sherry

rain305

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 5 hours ago

Enhancing Spatial Understanding in Image Generation via Reward Modeling

upvoted a paper 2 days ago

UniG2U-Bench: Do Unified Models Advance Multimodal Understanding?

upvoted a paper 2 days ago

Beyond Language Modeling: An Exploration of Multimodal Pretraining

View all activity

Organizations

None yet

upvoted a paper about 5 hours ago

Enhancing Spatial Understanding in Image Generation via Reward Modeling

Paper • 2602.24233 • Published 8 days ago • 48

upvoted 3 papers 2 days ago

UniG2U-Bench: Do Unified Models Advance Multimodal Understanding?

Paper • 2603.03241 • Published 4 days ago • 79

Beyond Language Modeling: An Exploration of Multimodal Pretraining

Paper • 2603.03276 • Published 4 days ago • 71

UniT: Unified Multimodal Chain-of-Thought Test-time Scaling

Paper • 2602.12279 • Published 23 days ago • 20

upvoted a paper 10 days ago

CoF-T2I: Video Models as Pure Visual Reasoners for Text-to-Image Generation

Paper • 2601.10061 • Published Jan 15 • 31

upvoted a paper 11 days ago

A Very Big Video Reasoning Suite

Paper • 2602.20159 • Published 12 days ago • 508

upvoted an article 27 days ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

+1

Jan 28, 2025

•

888

upvoted a paper 5 months ago

Hybrid Reinforcement: When Reward Is Sparse, It's Better to Be Dense

Paper • 2510.07242 • Published Oct 8, 2025 • 30

upvoted a collection 5 months ago

Qwen3

84 items • Updated Dec 31, 2025 • 1.71k

upvoted 2 papers 5 months ago

LUMINA: Detecting Hallucinations in RAG System with Context-Knowledge Signals

Paper • 2509.21875 • Published Sep 26, 2025 • 10

Understanding Language Prior of LVLMs by Contrasting Chain-of-Embedding

Paper • 2509.23050 • Published Sep 27, 2025 • 15