1 33 5

zuijiang

AI & ML interests

None yet

Recent Activity

upvoted a paper about 23 hours ago

Domino: Decoupling Causal Modeling from Autoregressive Drafting in Speculative Decoding

upvoted a paper 6 days ago

LiteCoder-Terminal: Scaling Long-Horizon Terminal Environments for Learning Language Agents

upvoted a paper 9 days ago

MetaphorVU: Towards Metaphorical Video Understanding

View all activity

Organizations

upvoted a paper about 23 hours ago

Domino: Decoupling Causal Modeling from Autoregressive Drafting in Speculative Decoding

Paper • 2605.29707 • Published 7 days ago • 133

upvoted a paper 6 days ago

LiteCoder-Terminal: Scaling Long-Horizon Terminal Environments for Learning Language Agents

Paper • 2605.29559 • Published 7 days ago • 15

upvoted a paper 9 days ago

MetaphorVU: Towards Metaphorical Video Understanding

Paper • 2605.25461 • Published 10 days ago • 8

upvoted a paper 13 days ago

MolmoAct2: Action Reasoning Models for Real-world Deployment

Paper • 2605.02881 • Published May 4 • 348

upvoted 2 papers 15 days ago

GoLongRL: Capability-Oriented Long Context Reinforcement Learning with Multitask Alignment

Paper • 2605.19577 • Published 16 days ago • 58

Draft Less, Retrieve More: Hybrid Tree Construction for Speculative Decoding

Paper • 2605.20104 • Published 16 days ago • 7

upvoted a paper 20 days ago

Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling

Paper • 2605.13301 • Published 22 days ago • 159

upvoted a paper 23 days ago

RigidFormer: Learning Rigid Dynamics using Transformers

Paper • 2605.09196 • Published 26 days ago • 14

upvoted a paper about 1 month ago

Large Language Models Explore by Latent Distilling

Paper • 2604.24927 • Published Apr 27 • 74

upvoted an article about 2 months ago

Article

Releasing LiteCoder-Terminal-SFT

Lite-Coder

•

Apr 13

• 4

upvoted a paper about 2 months ago

RAGEN-2: Reasoning Collapse in Agentic RL

Paper • 2604.06268 • Published Apr 7 • 67

liked a model 2 months ago

Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled

Image-Text-to-Text • 28B • Updated Apr 6 • 163k • • 2.87k

upvoted 2 papers 2 months ago

Composer 2 Technical Report

Paper • 2603.24477 • Published Mar 25 • 18

Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs?

Paper • 2603.24472 • Published Mar 25 • 57

upvoted 4 papers 3 months ago

Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

Paper • 2603.19220 • Published Mar 19 • 69

upvoted 2 papers 4 months ago

CoBA-RL: Capability-Oriented Budget Allocation for Reinforcement Learning in LLMs

Paper • 2602.03048 • Published Feb 3 • 32

Good SFT Optimizes for SFT, Better SFT Prepares for Reinforcement Learning

Paper • 2602.01058 • Published Feb 1 • 45

zuijiang

AI & ML interests

Recent Activity

Organizations

zuijiang's activity

Releasing LiteCoder-Terminal-SFT