-
Open-Reasoner-Zero/Open-Reasoner-Zero-32B
Reinforcement Learning • Updated • 103 • 33 -
Open-Reasoner-Zero/Open-Reasoner-Zero-7B
Reinforcement Learning • 8B • Updated • 2.45k • 33 -
Open-Reasoner-Zero/Open-Reasoner-Zero-1.5B
Reinforcement Learning • 2B • Updated • 182 • 1 -
Open-Reasoner-Zero/Open-Reasoner-Zero-0.5B
Reinforcement Learning • 0.5B • Updated • 37
AI & ML interests
Scale up the Reasoner-Zero Training
-
Open-Reasoner-Zero/Open-Reasoner-Zero-32B
Reinforcement Learning • Updated • 103 • 33 -
Open-Reasoner-Zero/Open-Reasoner-Zero-7B
Reinforcement Learning • 8B • Updated • 2.45k • 33 -
Open-Reasoner-Zero/Open-Reasoner-Zero-1.5B
Reinforcement Learning • 2B • Updated • 182 • 1 -
Open-Reasoner-Zero/Open-Reasoner-Zero-0.5B
Reinforcement Learning • 0.5B • Updated • 37