Shizhe Diao's picture

Shizhe Diao

shizhediao2

·

https://shizhediao.github.io/

AI & ML interests

LLM pre-training and reasoning

Recent Activity

upvoted a paper about 7 hours ago

PhyCritic: Multimodal Critic Models for Physical AI

upvoted a paper 1 day ago

Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text

upvoted a paper about 1 month ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

View all activity

Organizations

shizhediao2 's models 3

shizhediao2/ToolOrchestrator-8B

Updated Oct 15, 2025 • 2

shizhediao2/Llama-Nemotron-8B-v1-Prorl

Updated Aug 25, 2025

shizhediao2/Nemotron-Research-Reasoning-Qwen-1.5B

Updated May 14, 2025 • 1