VladShash/deepseek-math-7B-lean-prover-dpo-300k-mistral-150k-olmo Text Generation • 7B • Updated 7 days ago • 1.84k
VladShash/deepseek-math-full-7B-lean-prover-dpo-mistral Text Generation • 7B • Updated 12 days ago • 1.17k
VladShash/deepseek-math-7B-lean-prover-grpo-olmo-weighed Text Generation • 7B • Updated 21 days ago • 3.89k • 1