Longhui Yu

Longhui98

5 25 34

https://yulonghui.github.io/

AI & ML interests

None yet

Recent Activity

liked a model about 1 month ago

moonshotai/Kimi-K2.7-Code

liked a model 3 months ago

moonshotai/Kimi-K2.6

new activity 6 months ago

moonshotai/Kimi-K2.5:Context Management Reproducibility | 可复现性 ?

View all activity

Organizations

upvoted a paper 6 months ago

Kimi K2.5: Visual Agentic Intelligence

Paper • 2602.02276 • Published Feb 2 • 277

upvoted a paper 8 months ago

P1: Mastering Physics Olympiads with Reinforcement Learning

Paper • 2511.13612 • Published Nov 17, 2025 • 135

upvoted a paper 12 months ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24, 2025 • 320

upvoted an article about 1 year ago

Article

Introducing smolagents: simple agents that write actions in code.

m-ric, merve, thomwolf

•

Dec 31, 2024

• 1.21k

upvoted a collection about 1 year ago

Kimi-K2

Collection

Moonshot's MoE LLMs with 1 trillion parameters, exceptional on agentic intellegence • 5 items • Updated Jan 27 • 174

upvoted an article about 1 year ago

Article

🤔👀🎬🖥️📖 Kimi-VL-A3B-Thinking-2506: A Quick Navigation

moonshotai

•

Jun 21, 2025

• 77

upvoted 4 papers about 1 year ago

AIMO-2 Winning Solution: Building State-of-the-Art Mathematical Reasoning Models with OpenMathReasoning dataset

Paper • 2504.16891 • Published Apr 23, 2025 • 27

upvoted 6 papers over 1 year ago

Kimi-VL Technical Report

Paper • 2504.07491 • Published Apr 10, 2025 • 142

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published Jan 22, 2025 • 130

O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson?

Paper • 2411.16489 • Published Nov 25, 2024 • 46

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published Dec 9, 2024 • 87

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 381

SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator

Paper • 2412.12094 • Published Dec 16, 2024 • 11

upvoted a collection almost 2 years ago

Qwen2-Math

Collection

Math-specific model series based on Qwen2 • 7 items • Updated Mar 2 • 53

upvoted a collection about 2 years ago

NuminaMath

Collection

Datasets and models for training SOTA math LLMs. See our GitHub for training & inference code: https://github.com/project-numina/aimo-progress-prize • 7 items • Updated Feb 10, 2025 • 83

upvoted a paper about 2 years ago

Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies

Paper • 2407.13623 • Published Jul 18, 2024 • 56

upvoted an article about 2 years ago

Article

RegMix: Data Mixture as Regression for Language Model Pre-training

SivilTaram

•

Jul 11, 2024

• 16

Longhui Yu

AI & ML interests

Recent Activity

Organizations

Longhui98's activity

Introducing smolagents: simple agents that write actions in code.

🤔👀🎬🖥️📖 Kimi-VL-A3B-Thinking-2506: A Quick Navigation

RegMix: Data Mixture as Regression for Language Model Pre-training