Andy Xu's picture

3 1

Andy Xu

andaero

·

andaero

AI & ML interests

Computational Materials Generation | AI4Science | Reinforcement Learning

Recent Activity

liked a Space 8 days ago

LeMaterial/LeMat-GenBench

authored a paper about 2 months ago

PLaID++: A Preference Aligned Language Model for Targeted Inorganic Materials Design

updated a model 4 months ago

HOPE-Lab-HMC/PLaID

View all activity

Organizations

upvoted an article 10 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Feb 7, 2025

•

276

upvoted a paper 11 months ago

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Paper • 2503.14476 • Published Mar 18, 2025 • 144

upvoted a paper almost 2 years ago

RLHF Workflow: From Reward Modeling to Online RLHF

Paper • 2405.07863 • Published May 13, 2024 • 71