8 12 9

Richard Zhuang PRO

RZ412

https://richardzhuang0412.github.io

AI & ML interests

LLM Routing, LLM + Games, Post-Training, Agents

Recent Activity

updated a dataset about 11 hours ago

DCAgent3/terminal_bench_2_gptlong_continue_nemotron_terminal_step5400__Qwen3_32B_20260518_002122

published a dataset about 11 hours ago

DCAgent3/terminal_bench_2_gptlong_continue_nemotron_terminal_step5400__Qwen3_32B_20260518_002122

updated a dataset about 12 hours ago

DCAgent3/terminal_bench_2_sft__g1_gptlong_top8_train_on_obs_fa3__40_0__Qwen3_32B_20260518_002839

View all activity

Organizations

New activity in laion/exp_tas_optimal_combined_traces-Qwen3.5-9B about 2 months ago

Add preprocessor_config.json from Qwen/Qwen3.5-9B base model

#2 opened about 2 months ago by

RZ412

Upload preprocessor_config.json

#1 opened about 2 months ago by

RZ412

New activity in open-r1/README about 1 year ago

[Experiment] Training R1-Zero-like models with Open R1

👀🔥 11

#20 opened about 1 year ago by

lewtun

New activity in huggingface/HuggingDiscussions over 1 year ago

[FEEDBACK] Daily Papers

🔥❤️ 21

200

#32 opened almost 2 years ago by

kramp

New activity in RZ412/PokerBench over 1 year ago

Fix formatting

#4 opened over 1 year ago by

nielsr

Add task category, paper and code links

#3 opened over 1 year ago by

nielsr

add minimal metadata

#2 opened over 1 year ago by

davanstrien

Richard Zhuang PRO

AI & ML interests

Recent Activity

Organizations

RZ412's activity

Add preprocessor_config.json from Qwen/Qwen3.5-9B base model

Upload preprocessor_config.json

[Experiment] Training R1-Zero-like models with Open R1

[FEEDBACK] Daily Papers

Fix formatting

Add task category, paper and code links

add minimal metadata