MTP-LM Collection Models to accompany research paper on training multi token prediction language models using self-distillation. • 5 items • Updated 1 day ago • 2
MTP-LM Collection Models to accompany research paper on training multi token prediction language models using self-distillation. • 5 items • Updated 1 day ago • 2
MTP-LM Collection Models to accompany research paper on training multi token prediction language models using self-distillation. • 5 items • Updated 1 day ago • 2
jwkirchenbauer/debug_metamath_full_rand_k2-8_ex_valk_baseline_latest Text Generation • 8B • Updated 7 days ago • 9
jwkirchenbauer/debug_metamath_full_rand_k2-8_ex_valk_baseline_latest Text Generation • 8B • Updated 7 days ago • 9