Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Andrew Zhao's picture
3 27 22

Andrew Zhao

andrewzh
Danielrf314's profile picture draftman's profile picture 21world's profile picture
·
https://andrewzh112.github.io/
  • _AndrewZhao
  • Andrewzh112
  • andrewqzhao

AI & ML interests

Reinforcement Learning, Agents

Recent Activity

upvoted a paper about 18 hours ago
Good SFT Optimizes for SFT, Better SFT Prepares for Reinforcement Learning
authored a paper 7 days ago
Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning
authored a paper 7 days ago
ExCyTIn-Bench: Evaluating LLM agents on Cyber Threat Investigation
View all activity

Organizations

Tsinghua-LeapLab's profile picture

andrewzh 's collections 1

Absolute Zero Reasoner
  • andrewzh/Absolute_Zero_Reasoner-Coder-7b

    8B • Updated May 5, 2025 • 396 • 20
  • andrewzh/Absolute_Zero_Reasoner-Coder-14b

    15B • Updated May 6, 2025 • 23 • 29
  • andrewzh/Absolute_Zero_Reasoner-Coder-3b

    3B • Updated May 6, 2025 • 12 • 14
  • andrewzh2/Absolute_Zero_Reasoner-Base-14b

    15B • Updated May 6, 2025 • 6 • 10
Absolute Zero Reasoner
  • andrewzh/Absolute_Zero_Reasoner-Coder-7b

    8B • Updated May 5, 2025 • 396 • 20
  • andrewzh/Absolute_Zero_Reasoner-Coder-14b

    15B • Updated May 6, 2025 • 23 • 29
  • andrewzh/Absolute_Zero_Reasoner-Coder-3b

    3B • Updated May 6, 2025 • 12 • 14
  • andrewzh2/Absolute_Zero_Reasoner-Base-14b

    15B • Updated May 6, 2025 • 6 • 10
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs