Seonil Son's picture

Seonil Son

sonsus

·

AI & ML interests

LLM alignment and evals.

Recent Activity

upvoted a paper about 1 hour ago

How Much Heavy Lifting Can an Agent Harness Do?: Measuring the LLM's Residual Role in a Planning Agent

upvoted a paper about 1 hour ago

Becoming Experienced Judges: Selective Test-Time Learning for Evaluators

upvoted a paper about 1 hour ago

V-Agent: An Interactive Video Search System Using Vision-Language Models

View all activity

Organizations

None yet

sonsus 's datasets

None public yet