Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
HanyangWang's picture
3

HanyangWang

Hanyang-W

AI & ML interests

None yet

Recent Activity

upvoted a paper 8 days ago
Text2Grad: Reinforcement Learning from Natural Language Feedback
upvoted a paper 9 days ago
SkillRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning
upvoted a paper 3 months ago
UFO^3: Weaving the Digital Agent Galaxy
View all activity

Organizations

None yet

models 4

Hanyang-W/zephyr-7b-cw-dpo-full

Updated Aug 16, 2025

Hanyang-W/zephyr-7b-dpo-full

Text Generation • 266k • Updated Aug 8, 2025 • 2

Hanyang-W/SmolLM3-DPO

Updated Aug 7, 2025

Hanyang-W/llama3.1-8b-instruct-dpo-full

Text Generation • 175k • Updated Aug 6, 2025 • 3

datasets 0

None public yet
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs