Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Shizhe Diao's picture
39 36 30

Shizhe Diao

shizhediao2
cmhungsteve's profile picture sukabluat's profile picture dark-pen's profile picture
·
https://shizhediao.github.io/
  • shizhediao
  • shizhediao
  • shizhediao

AI & ML interests

LLM pre-training and reasoning

Recent Activity

upvoted a paper about 7 hours ago
PhyCritic: Multimodal Critic Models for Physical AI
upvoted a paper 1 day ago
Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text
upvoted a paper about 1 month ago
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization
View all activity

Organizations

NVIDIA's profile picture temp_math_data's profile picture UGPhysics's profile picture Data Filtering Challenge for Training Edge Language Models's profile picture brorl's profile picture

shizhediao2 's models 3

shizhediao2/ToolOrchestrator-8B

Updated Oct 15, 2025 • 2

shizhediao2/Llama-Nemotron-8B-v1-Prorl

Updated Aug 25, 2025

shizhediao2/Nemotron-Research-Reasoning-Qwen-1.5B

Updated May 14, 2025 • 1
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs