ads
sxcasf
AI & ML interests
None yet
Recent Activity
upvoted a paper about 16 hours ago
Lightning OPD: Efficient Post-Training for Large Reasoning Models with Offline On-Policy Distillation commentedon a paper about 16 hours ago
Lightning OPD: Efficient Post-Training for Large Reasoning Models with Offline On-Policy Distillation upvoted a paper 15 days ago
A Survey of On-Policy Distillation for Large Language Models