WebGuard: Building a Generalizable Guardrail for Web Agents Paper • 2507.14293 • Published Jul 18, 2025 • 1
Exploratory Memory-Augmented LLM Agent via Hybrid On- and Off-Policy Optimization Paper • 2602.23008 • Published 6 days ago • 34
World Models with Hints of Large Language Models for Goal Achieving Paper • 2406.07381 • Published Jun 11, 2024 • 1
ADG: Ambient Diffusion-Guided Dataset Recovery for Corruption-Robust Offline Reinforcement Learning Paper • 2505.23871 • Published May 29, 2025 • 1
Multi-Agent Coordination via Multi-Level Communication Paper • 2209.12713 • Published Sep 26, 2022 • 2
When to Memorize and When to Stop: Gated Recurrent Memory for Long-Context Reasoning Paper • 2602.10560 • Published 22 days ago • 29
SEAD: Self-Evolving Agent for Multi-Turn Service Dialogue Paper • 2602.03548 • Published 29 days ago • 4
Reasoning-Enhanced Large Language Models for Molecular Property Prediction Paper • 2510.10248 • Published Oct 11, 2025 • 2
MemOCR: Layout-Aware Visual Memory for Efficient Long-Horizon Reasoning Paper • 2601.21468 • Published Jan 29 • 25
Look Back to Reason Forward: Revisitable Memory for Long-Context LLM Agents Paper • 2509.23040 • Published Sep 27, 2025 • 12
TRIP-Bench: A Benchmark for Long-Horizon Interactive Agents in Real-World Scenarios Paper • 2602.01675 • Published about 1 month ago • 9
MemOCR: Layout-Aware Visual Memory for Efficient Long-Horizon Reasoning Paper • 2601.21468 • Published Jan 29 • 25
MARS2 2025 Challenge on Multimodal Reasoning: Datasets, Methods, Results, Discussion, and Outlook Paper • 2509.14142 • Published Sep 17, 2025 • 10