mzio/aprm-sft_thinkact-Eact_prm_tw_coin_easy_sp-Gnobandit_aprm_qwen3_ap-S0-R1-train_eval-b039 Viewer • Updated about 5 hours ago • 1.22k
mzio/aprm-sft_thinkact-Eact_prm_tw_coin_easy_sp-Gnobandit_aprm_qwen3_ap-S0-R1-train_eval-b039 Viewer • Updated about 5 hours ago • 1.22k
mzio/aprm-sft_thinkact-Eact_prm_tw_coin_easy_sp-Gaprm_qwen3_ap-S42-R0-train_eval-b019 Viewer • Updated about 7 hours ago • 1.22k
mzio/aprm-sft_thinkact-Eact_prm_tw_coin_easy_sp-Gaprm_qwen3_ap-S42-R0-train_eval-b019 Viewer • Updated about 7 hours ago • 1.22k
mzio/aprm-sft_thinkact-Eact_prm_tw_coin_hard_sp-Gaprm_qwen3_ap-S42-Rlr1e4-train_eval-b019 Viewer • Updated about 7 hours ago • 1.22k
mzio/aprm-sft_thinkact-Eact_prm_tw_coin_hard_sp-Gaprm_qwen3_ap-S42-Rlr1e4-train_eval-b019 Viewer • Updated about 7 hours ago • 1.22k
mzio/aprm-sft_thinkact-Eact_prm_tw_coin_easy_sp-Gnobandit_aprm_qwen3_ap-S0-R1-train_eval-b029 Viewer • Updated about 11 hours ago • 1.22k
mzio/aprm-sft_thinkact-Eact_prm_tw_coin_easy_sp-Gnobandit_aprm_qwen3_ap-S0-R1-train_eval-b029 Viewer • Updated about 11 hours ago • 1.22k
mzio/aprm-sft_thinkact-Eaprm_tw_coin_easy_sp-Gaprm_qw3_ap-S42-Rmt128_reg_coin_easy-train_eval-b0 Viewer • Updated about 13 hours ago • 1.22k • 10
mzio/aprm-sft_thinkact-Eaprm_tw_coin_medium_sp-Gaprm_qw3_ap-S42-Rmt128_reg_coin_medium_g4-train_ Viewer • Updated about 3 hours ago • 1.22k • 3
mzio/aprm-sft_thinkact-Eaprm_tw_coin_medium_sp-Gaprm_qw3_ap-S42-Rmt128_reg_coin_medium_g4-train_ Viewer • Updated about 3 hours ago • 1.22k • 3
mzio/aprm-sft_thinkact-Eact_prm_tw_coin_easy_sp-Gaprm_qwen3_ap-S42-R0-train_eval-b009 Viewer • Updated about 14 hours ago • 1.22k • 3
mzio/aprm-sft_thinkact-Eact_prm_tw_coin_easy_sp-Gaprm_qwen3_ap-S42-R0-train_eval-b009 Viewer • Updated about 14 hours ago • 1.22k • 3
mzio/aprm-sft_thinkact-Eaprm_tw_treasure_medium_sp-Gaprm_qw3_ap-S42-Rmt128_reg_treasure_medium-t Updated about 2 hours ago • 13
mzio/aprm-sft_thinkact-Eaprm_tw_treasure_hard_sp-Gaprm_qw3_ap-S42-Rmt128_reg_treasure_hard-train Updated about 1 hour ago • 3
mzio/aprm-sft_thinkact-Eaprm_tw_treasure_hard_sp-Gaprm_qw3_ap-S42-Rmt128_reg_treasure_hard-train Updated about 1 hour ago • 3
mzio/aprm-sft_thinkact-Eact_prm_tw_coin_easy_sp-Gaprm_qwen3_ap-S0-R1-train_eval-b029 Viewer • Updated about 15 hours ago • 1.22k • 3
mzio/aprm-sft_thinkact-Eact_prm_tw_coin_easy_sp-Gaprm_qwen3_ap-S0-R1-train_eval-b029 Viewer • Updated about 15 hours ago • 1.22k • 3
mzio/aprm-sft_thinkact-Eact_prm_tw_coin_hard_sp-Gaprm_qwen3_ap-S42-Rlr1e4-train_eval-b009 Viewer • Updated about 15 hours ago • 1.22k • 9
mzio/aprm-snorkelai_agent_finance_reasoning-aligned-v2 Viewer • Updated about 16 hours ago • 1.16k • 4