Ariel Kwiatkowski
RedTachyon
AI & ML interests
RL, MARL, Crowd Simulation
Recent Activity
upvoted
a
paper
2 days ago
Teaching Models to Teach Themselves: Reasoning at the Edge of Learnability
upvoted
a
paper
4 months ago
Soft Tokens, Hard Truths
upvoted
a
paper
12 months ago
PILAF: Optimal Human Preference Sampling for Reward Modeling