arxiv:2601.22975
Jian Hu
chuyi777
AI & ML interests
Reinforcement Learning
Recent Activity
updated a model 3 days ago
OpenRLHF/Llama-3-8b-rm-700k upvoted a paper about 1 month ago
PhyCritic: Multimodal Critic Models for Physical AI updated a dataset about 1 month ago
OpenRLHF/aime-2024