Reinforcing Few-step Generators via Reward-Tilted Distribution Matching Paper • 2605.26108 • Published 3 days ago • 3 • 3
Reinforcing Few-step Generators via Reward-Tilted Distribution Matching Paper • 2605.26108 • Published 3 days ago • 3
Reinforcing Few-step Generators via Reward-Tilted Distribution Matching Paper • 2605.26108 • Published 3 days ago • 3
RTDMD Collection Reinforcing Few-step Generators via Reward-Tilted Distribution Matching • 3 items • Updated 1 day ago • 2
RTDMD Collection Reinforcing Few-step Generators via Reward-Tilted Distribution Matching • 3 items • Updated 1 day ago • 2
Reinforcing Few-step Generators via Reward-Tilted Distribution Matching Paper • 2605.26108 • Published 3 days ago • 3
RTDMD Collection Reinforcing Few-step Generators via Reward-Tilted Distribution Matching • 3 items • Updated 1 day ago • 2
GenRL Collection Model collections trained with our framework: https://github.com/ModelTC/GenRL • 3 items • Updated Feb 11 • 3
On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language Models Paper • 2602.03392 • Published Feb 3 • 59
Light Forcing: Accelerating Autoregressive Video Diffusion via Sparse Attention Paper • 2602.04789 • Published Feb 4 • 3
LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit Paper • 2405.06001 • Published May 9, 2024
Focus-dLLM: Accelerating Long-Context Diffusion LLM Inference via Confidence-Guided Context Focusing Paper • 2602.02159 • Published Feb 2 • 2