DCAgent/neulab-code-feedback-sandboxes_glm_4.7_traces_jupiter Viewer • Updated about 6 hours ago • 9.93k
DCAgent/rl__32GPU_shaped_entropy__mix_v2_h2_language_balanced__GLM-4_7-swesmith-san__20-0 Viewer • Updated about 11 hours ago • 1.02k
DCAgent/rl__32GPU_shaped_entropy__mix_v2_baseline_uniform__GLM-4_7-swesmith-san__20-0 Viewer • Updated about 12 hours ago • 979
DCAgent/neulab-agenttuning-os-sandboxes_glm_4.7_traces_jupiter Viewer • Updated about 12 hours ago • 10k
DCAgent/neulab-agenttuning-kg-sandboxes_glm_4.7_traces_jupiter Viewer • Updated about 14 hours ago • 10.4k
DCAgent/rl__32GPU_shaped_entropy__mix_v2_h4_dense_rewards_hard__GLM-4_7-swesmith-san__20-0 Viewer • Updated about 15 hours ago • 896
DCAgent/rl__32GPU_shaped_entropy__mix_v2_h1_struggle_zone__GLM-4_7-swesmith-san__20-0 Viewer • Updated about 16 hours ago • 880
DCAgent/neulab-agenttuning-mind2web-sandboxes_glm_4.7_traces_jupiter Viewer • Updated about 16 hours ago • 10k
DCAgent/exp_rpt_stack-php-large_10k_glm_4.7_traces_jupiter Viewer • Updated about 22 hours ago • 13.1k
DCAgent/neulab-agenttuning-db-sandboxes_glm_4.7_traces_jupiter Viewer • Updated about 22 hours ago • 10k
DCAgent/rl__32GPU_shaped_entropy__mix_v2_h2_language_balanced__GLM-4_7-swesmith-san Viewer • Updated 1 day ago • 852 • 6
DCAgent/rl__32GPU_shaped_entropy__mix_v2_h1_struggle_zone__GLM-4_7-swesmith-san Viewer • Updated 1 day ago • 831 • 8
DCAgent/neulab-agenttuning-alfworld-sandboxes_glm_4.7_traces_jupiter Viewer • Updated 1 day ago • 11.1k • 3
DCAgent/rl__32GPU_shaped_entropy__mix_v2_baseline_uniform__GLM-4_7-swesmith-san Viewer • Updated 1 day ago • 1.03k • 6
DCAgent/rl__32GPU_shaped_entropy__mix_v2_h4_dense_rewards_hard__GLM-4_7-swesmith-san Viewer • Updated 1 day ago • 879 • 3
DCAgent/rl__32GPU_shaped_entropy__mix_v2_h2_language_proportional__GLM-4_7-swesmith-san Viewer • Updated 1 day ago • 859 • 3
DCAgent/rl__24GPU_shaped_entropy__mix_v2_baseline_uniform__qwen3base-GLM-4_7-sw Viewer • Updated 1 day ago • 720 • 7
DCAgent/rl__24GPU_shaped_entropy__mix_v2_h4_dense_rewards_hard__qwen3base-GLM-4_7-sw Viewer • Updated 1 day ago • 850 • 3
DCAgent/rl__24GPU_shaped_entropy__mix_v2_h2_language_proportional__qwen3base-GLM-4_7-sw Viewer • Updated 1 day ago • 762 • 5
DCAgent/rl__24GPU_shaped_entropy__mix_v2_h2_language_balanced__qwen3base-GLM-4_7-sw Viewer • Updated 1 day ago • 764 • 4
DCAgent/rl__24GPU_shaped_entropy__mix_v2_h1_struggle_zone__qwen3base-GLM-4_7-sw Viewer • Updated 1 day ago • 801 • 3
DCAgent/stackexchange-superuser-sandboxes_glm_4.7_traces_jupiter Viewer • Updated 1 day ago • 10.1k • 2
DCAgent/exp_rpt_stack-pytest-synthetic-gpt5nano_glm_4.7_traces_jupiter Viewer • Updated 1 day ago • 10.4k • 2
DCAgent/exp_rpt_stack-pytest-withtests_glm_4.7_traces_jupiter Viewer • Updated 2 days ago • 12.6k • 3