Richard Zhuang PRO
RZ412
AI & ML interests
LLM Routing, LLM + Games, Post-Training, Agents
Recent Activity
updated a dataset about 11 hours ago
DCAgent3/terminal_bench_2_gptlong_continue_nemotron_terminal_step5400__Qwen3_32B_20260518_002122 published a dataset about 11 hours ago
DCAgent3/terminal_bench_2_gptlong_continue_nemotron_terminal_step5400__Qwen3_32B_20260518_002122 updated a dataset about 12 hours ago
DCAgent3/terminal_bench_2_sft__g1_gptlong_top8_train_on_obs_fa3__40_0__Qwen3_32B_20260518_002839