sherry

rain305

30 8

AI & ML interests

None yet

Recent Activity

upvoted a paper 8 days ago

Hide-and-Seek in Trajectories: Discovering Failure Signals for VLA Runtime Monitoring

upvoted a paper 8 days ago

Neglected Free Lunch from Post-training: Progress Advantage for LLM Agents

upvoted a paper 9 days ago

Qwen-Image-Agent: Bridging the Context Gap in Real-World Image Generation

View all activity

Organizations

None yet

upvoted 2 papers 8 days ago

Hide-and-Seek in Trajectories: Discovering Failure Signals for VLA Runtime Monitoring

Paper • 2605.30834 • Published May 29 • 10

Neglected Free Lunch from Post-training: Progress Advantage for LLM Agents

Paper • 2606.26080 • Published 13 days ago • 10

upvoted a paper 9 days ago

Qwen-Image-Agent: Bridging the Context Gap in Real-World Image Generation

Paper • 2606.26907 • Published 12 days ago • 49

upvoted a paper 15 days ago

DF3DV-1K: A Large-Scale Dataset and Benchmark for Distractor-Free Novel View Synthesis

Paper • 2604.13416 • Published 19 days ago • 33

upvoted a paper 23 days ago

Kwai Keye-VL-2.0 Technical Report

Paper • 2606.10651 • Published 28 days ago • 192

upvoted 4 papers 26 days ago

Vision-OPD: Learning to See Fine Details for Multimodal LLMs via On-Policy Self-Distillation

Paper • 2605.18740 • Published May 18 • 5

Retrospective Harness Optimization: Improving LLM Agents via Self-Preference over Trajectory Rollouts

Paper • 2606.05922 • Published Jun 4 • 69

Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models

Paper • 2606.11025 • Published 28 days ago • 41

SCAIL-2: Unifying Controlled Character Animation with End-to-end In-Context Conditioning

Paper • 2606.10804 • Published 28 days ago • 53

upvoted a paper 3 months ago

RationalRewards: Reasoning Rewards Scale Visual Generation Both Training and Test Time

Paper • 2604.11626 • Published Apr 13 • 103

liked a model 3 months ago

Skywork/Skywork-UniPic-1.5B

Any-to-Any • Updated Sep 8, 2025 • 19 • 116

upvoted a paper 3 months ago

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Paper • 2604.13016 • Published Apr 14 • 113

liked a model 3 months ago

jdopensource/JoyAI-Image-Edit

Image-to-Image • Updated May 7 • 193 • • 130

upvoted a paper 3 months ago

ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling

Paper • 2603.25746 • Published Mar 26 • 155

liked a model 4 months ago

CodeGoat24/UniGenBench-EvalModel-qwen-72b-v1

Image-Text-to-Text • 73B • Updated Oct 25, 2025 • 46 • 4

upvoted a paper 4 months ago

Evaluating and Steering Modality Preferences in Multimodal Large Language Model

Paper • 2505.20977 • Published May 27, 2025 • 10

upvoted an article 4 months ago

Article

NEO-unify: Building Native Multimodal Unified Models End to End

sensenova

•

Mar 5

• 167

upvoted 2 papers 4 months ago

Heterogeneous Agent Collaborative Reinforcement Learning

Paper • 2603.02604 • Published Mar 3 • 198

CoCo: Code as CoT for Text-to-Image Preview and Rare Concept Generation

Paper • 2603.08652 • Published Mar 9 • 41

liked a dataset 4 months ago

LanguageBind/UniWorld-V1

Viewer • Updated Jun 16, 2025 • 7.11k • 1.03k • 26

sherry

AI & ML interests

Recent Activity

Organizations

rain305's activity

NEO-unify: Building Native Multimodal Unified Models End to End