Open-Sourced model and data for ULTRAIF: Advancing Instruction Following from the Wild.
li sheng
bambisheng
AI & ML interests
None yet
Recent Activity
upvoted a paper about 1 month ago
Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe upvoted a paper about 2 months ago
Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models updated a dataset about 2 months ago
dynn-datasets/Evaluation