shijie xia
seven-cat
AI & ML interests
LLMs
Recent Activity
upvoted
a
paper
about 11 hours ago
AgencyBench: Benchmarking the Frontiers of Autonomous Agents in 1M-Token Real-World Contexts
upvoted
a
paper
11 days ago
One Sample to Rule Them All: Extreme Data Efficiency in RL Scaling