SynthLabsAI/ALP_DeepScaleR_1.5B_C16K
Reinforcement Learning
โข
2B
โข
Updated
โข
10
โข
3
Scaling up good synthetic reasoning. Post-training and synthetic data research lab.