arxiv:2505.13886
tongjingqi(SII)
tongjingqi
AI & ML interests
NLP
Recent Activity
upvoted
a
paper
8 days ago
TL-GRPO: Turn-Level RL for Reasoning-Guided Iterative Optimization
upvoted
a
paper
9 days ago
Learning to Discover at Test Time
upvoted
a
paper
11 days ago
EvoCUA: Evolving Computer Use Agents via Learning from Scalable Synthetic Experience