Seonil Son
sonsus
AI & ML interests
LLM alignment and evals.
Recent Activity
upvoted a paper about 1 hour ago
How Much Heavy Lifting Can an Agent Harness Do?: Measuring the LLM's Residual Role in a Planning Agent upvoted a paper about 1 hour ago
Becoming Experienced Judges: Selective Test-Time Learning for Evaluators upvoted a paper about 1 hour ago
V-Agent: An Interactive Video Search System Using Vision-Language ModelsOrganizations
None yet