Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
39
36
30
Shizhe Diao
shizhediao2
Follow
cmhungsteve's profile picture
sukabluat's profile picture
dark-pen's profile picture
18 followers
·
13 following
https://shizhediao.github.io/
shizhediao
shizhediao
shizhediao
AI & ML interests
LLM pre-training and reasoning
Recent Activity
upvoted
a
paper
about 7 hours ago
PhyCritic: Multimodal Critic Models for Physical AI
upvoted
a
paper
1 day ago
Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text
upvoted
a
paper
about 1 month ago
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization
View all activity
Organizations
shizhediao2
's models
3
Sort:Â Recently updated
shizhediao2/ToolOrchestrator-8B
Updated
Oct 15, 2025
•
2
shizhediao2/Llama-Nemotron-8B-v1-Prorl
Updated
Aug 25, 2025
shizhediao2/Nemotron-Research-Reasoning-Qwen-1.5B
Updated
May 14, 2025
•
1