Auto-formalized versions of GSM8K and MATH500 auto-formalized and filtered with Goedel models
Ujan PRO
Ujan
·
AI & ML interests
NLP, Speech
Recent Activity
updated a dataset about 5 hours ago
Ujan/math500_formal_eval_Falcon-H1R-7B_prover published a dataset about 5 hours ago
Ujan/math500_formal_eval_Falcon-H1R-7B_prover updated a dataset about 6 hours ago
Ujan/math500_formal_eval_Qwen3.5-9B_prover_judgeOrganizations
Formal v1
Auto-formalized versions of GSM8K with the state-of-the-art Goedel-Prover-V2 and filtered using Deepseek-Prover-V2
-
Ujan/gsm8k_formal_goedel_few_shot_filtered_Goedel-Prover-V2-32B
Viewer • Updated • 1.18k • 33 -
Ujan/gsm8k_formal_goedel_zero_shot_filtered_DeepSeek-Prover-V2-7B
Viewer • Updated • 1.15k • 34 -
Ujan/gsm8k_formal_goedel_zero_shot_filtered_Goedel-Prover-V2-32B
Viewer • Updated • 1.23k • 8 -
Ujan/gsm8k_formal_goedel_few_shot
Viewer • Updated • 1.3k • 6
Formal v2
Auto-formalized versions of GSM8K and MATH500 auto-formalized and filtered with Goedel models
Formal v1
Auto-formalized versions of GSM8K with the state-of-the-art Goedel-Prover-V2 and filtered using Deepseek-Prover-V2
-
Ujan/gsm8k_formal_goedel_few_shot_filtered_Goedel-Prover-V2-32B
Viewer • Updated • 1.18k • 33 -
Ujan/gsm8k_formal_goedel_zero_shot_filtered_DeepSeek-Prover-V2-7B
Viewer • Updated • 1.15k • 34 -
Ujan/gsm8k_formal_goedel_zero_shot_filtered_Goedel-Prover-V2-32B
Viewer • Updated • 1.23k • 8 -
Ujan/gsm8k_formal_goedel_few_shot
Viewer • Updated • 1.3k • 6
models 8
Ujan/Qwen3-4B-Base_DeepMath-103K_samples_10000_seq_2048_epoch_1
Text Generation • 4B • Updated • 3 •
Ujan/Qwen3-4B-Base_DeepMath-103K_samples_50000_seq_16384_epoch_1
Text Generation • 4B • Updated • 2
Ujan/Qwen3-4B-Base_DeepMath-103K_samples_50000_seq_8192_epoch_1
Text Generation • 4B • Updated • 5
Ujan/Qwen3-4B-Base_DeepMath-103K_samples_50000_seq_4096_epoch_1
Text Generation • 4B • Updated • 4
Ujan/Qwen3-4B-Base_DeepMath-103K_samples_10000_seq_16384_epoch_1
Text Generation • 4B • Updated • 2
Ujan/Qwen3-4B-Base_DeepMath-103K_samples_10000_seq_4096_epoch_1
Text Generation • 4B • Updated • 3 •
Ujan/Qwen3-4B-Base_DeepMath-103K_samples_10000_seq_8192_epoch_1
Text Generation • 4B • Updated • 3
Ujan/whisper-small_moe_k_means
Automatic Speech Recognition • Updated • 6
datasets 85
Ujan/math500_formal_eval_Falcon-H1R-7B_prover
Viewer • Updated • 104
Ujan/math500_formal_eval_Qwen3.5-9B_prover_judge
Viewer • Updated • 137
Ujan/math500_formal_eval_Qwen3-4B-Thinking-2507_prover_judge
Viewer • Updated • 109
Ujan/math500_formal_eval_Qwen3-8B_prover_judge
Viewer • Updated • 88
Ujan/math500_formal_eval_Olmo-3-7B-Think_prover_judge
Viewer • Updated • 47
Ujan/math500_formal_eval_Falcon-H1R-7B
Viewer • Updated • 118
Ujan/math500_formal_eval_Ministral-3-8B-Reasoning-2512_prover_judge
Viewer • Updated • 7
Ujan/math500_formal_eval_NVIDIA-Nemotron-Nano-12B-v2_prover
Viewer • Updated • 63 • 3
Ujan/math500_formal_eval_Olmo-3-7B-Think_prover
Viewer • Updated • 56 • 3
Ujan/math500_formal_eval_Ministral-3-8B-Reasoning-2512_prover
Viewer • Updated • 7 • 2