Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
3
HanyangWang
Hanyang-W
Follow
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
8 days ago
Text2Grad: Reinforcement Learning from Natural Language Feedback
upvoted
a
paper
9 days ago
SkillRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning
upvoted
a
paper
3 months ago
UFO^3: Weaving the Digital Agent Galaxy
View all activity
Organizations
None yet
models
4
Sort: Recently updated
Hanyang-W/zephyr-7b-cw-dpo-full
Updated
Aug 16, 2025
Hanyang-W/zephyr-7b-dpo-full
Text Generation
•
266k
•
Updated
Aug 8, 2025
•
2
Hanyang-W/SmolLM3-DPO
Updated
Aug 7, 2025
Hanyang-W/llama3.1-8b-instruct-dpo-full
Text Generation
•
175k
•
Updated
Aug 6, 2025
•
3
datasets
0
None public yet