DCAgent/neulab-code-feedback-sandboxes_glm_4.7_traces_jupiter Viewer • Updated about 7 hours ago • 9.93k
DCAgent/rl__32GPU_shaped_entropy__mix_v2_h2_language_balanced__GLM-4_7-swesmith-san__20-0 Viewer • Updated about 12 hours ago • 1.02k
DCAgent/rl__32GPU_shaped_entropy__mix_v2_baseline_uniform__GLM-4_7-swesmith-san__20-0 Viewer • Updated about 13 hours ago • 979
DCAgent/neulab-agenttuning-os-sandboxes_glm_4.7_traces_jupiter Viewer • Updated about 13 hours ago • 10k
DCAgent/neulab-agenttuning-kg-sandboxes_glm_4.7_traces_jupiter Viewer • Updated about 15 hours ago • 10.4k
DCAgent/rl__32GPU_shaped_entropy__mix_v2_h4_dense_rewards_hard__GLM-4_7-swesmith-san__20-0 Viewer • Updated about 16 hours ago • 896
DCAgent/rl__32GPU_shaped_entropy__mix_v2_h1_struggle_zone__GLM-4_7-swesmith-san__20-0 Viewer • Updated about 17 hours ago • 880
DCAgent/neulab-agenttuning-mind2web-sandboxes_glm_4.7_traces_jupiter Viewer • Updated about 17 hours ago • 10k
DCAgent/exp_rpt_stack-php-large_10k_glm_4.7_traces_jupiter Viewer • Updated about 23 hours ago • 13.1k