388 521

Yu li

Yukkkop

AI & ML interests

None yet

Recent Activity

liked a model 1 day ago

rubricreward/R3-Qwen3-4B-14k

liked a model 1 day ago

MegaScience/Qwen3-4B-MegaScience

liked a model 1 day ago

TIGER-Lab/General-Reasoner-Qwen3-4B

View all activity

Organizations

None yet

liked 3 models 1 day ago

upvoted a paper 5 days ago

GSQ: Highly-Accurate Low-Precision Scalar Quantization for LLMs via Gumbel-Softmax Sampling

Paper • 2604.18556 • Published Apr 20 • 7

liked a Space 8 days ago

Nanbeige 4.1 3B

🔮

Chat with Nanbeige AI locally in your browser

upvoted 8 papers 12 days ago

Nonsense Helps: Prompt Space Perturbation Broadens Reasoning Exploration

Paper • 2605.05566 • Published 16 days ago • 37

Echoes as Anchors: Probabilistic Costs and Attention Refocusing in LLM Reasoning

Paper • 2602.06600 • Published Feb 6 • 3

PowerInfer-2: Fast Large Language Model Inference on a Smartphone

Paper • 2406.06282 • Published Jun 10, 2024 • 40

ResRL: Boosting LLM Reasoning via Negative Sample Projection Residual Reinforcement Learning

Paper • 2605.00380 • Published 22 days ago • 7

EMO: Pretraining Mixture of Experts for Emergent Modularity

Paper • 2605.06663 • Published 16 days ago • 12

Continuous Latent Diffusion Language Model

Paper • 2605.06548 • Published 16 days ago • 78

Introspective Diffusion Language Models

Paper • 2604.11035 • Published Apr 13 • 25

Evaluating the Progression of Large Language Model Capabilities for Small-Molecule Drug Design

Paper • 2604.16279 • Published Apr 17 • 1

liked 2 models 14 days ago

oumoumad/ltx-2.3-dearchive-lora

Video-to-Video • Updated 14 days ago • 34

lablab-ai-amd-developer-hackathon/CyberSecQwen-4B

Text Generation • 4B • Updated 15 days ago • 736 • 11

upvoted 2 articles 14 days ago

Article

CyberSecQwen-4B: Why Defensive Cyber Needs Small, Specialized, Locally-Runnable Models

lablab-ai-amd-developer-hackathon

•

14 days ago

• 8

Article

EMO: Pretraining mixture of experts for emergent modularity

allenai

•

15 days ago

• 38

upvoted 3 papers 17 days ago

Generative Video Motion Editing with 3D Point Tracks

Paper • 2512.02015 • Published Dec 1, 2025 • 4

ÜberWeb: Insights from Multilingual Curation for a 20-Trillion-Token Dataset

Paper • 2602.15210 • Published Feb 25 • 1

Kakugo: Distillation of Low-Resource Languages into Small Language Models

Paper • 2601.14051 • Published Jan 20 • 1

Yu li

AI & ML interests

Recent Activity

Organizations

Yukkkop's activity

Nanbeige 4.1 3B

CyberSecQwen-4B: Why Defensive Cyber Needs Small, Specialized, Locally-Runnable Models

EMO: Pretraining mixture of experts for emergent modularity