Agentic Memory: Learning Unified Long-Term and Short-Term Memory Management for Large Language Model Agents Paper • 2601.01885 • Published 7 days ago • 1
Atlas: Orchestrating Heterogeneous Models and Tools for Multi-Domain Complex Reasoning Paper • 2601.03872 • Published 5 days ago • 37
From Entropy to Epiplexity: Rethinking Information for Computationally Bounded Intelligence Paper • 2601.03220 • Published 6 days ago • 1
From Failure to Mastery: Generating Hard Samples for Tool-use Agents Paper • 2601.01498 • Published 8 days ago • 2
MAGMA: A Multi-Graph based Agentic Memory Architecture for AI Agents Paper • 2601.03236 • Published 6 days ago • 2
MemRL: Self-Evolving Agents via Runtime Reinforcement Learning on Episodic Memory Paper • 2601.03192 • Published 6 days ago • 1
Can LLMs Guide Their Own Exploration? Gradient-Guided Reinforcement Learning for LLM Reasoning Paper • 2512.15687 • Published 26 days ago • 18
GenEnv: Difficulty-Aligned Co-Evolution Between LLM Agents and Environment Simulators Paper • 2512.19682 • Published 21 days ago • 15
Reinforcement Learning for Self-Improving Agent with Skill Library Paper • 2512.17102 • Published 24 days ago • 32
view article Article Apriel-1.6-15b-Thinker: Cost-efficient Frontier Multimodal Performance Dec 9, 2025 • 82
Emergent temporal abstractions in autoregressive models enable hierarchical reinforcement learning Paper • 2512.20605 • Published 20 days ago • 59
Live-SWE-agent: Can Software Engineering Agents Self-Evolve on the Fly? Paper • 2511.13646 • Published Nov 17, 2025 • 8
MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling Paper • 2511.11793 • Published Nov 14, 2025 • 171
Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning Paper • 2511.14460 • Published Nov 18, 2025 • 20
Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning Paper • 2511.16043 • Published Nov 20, 2025 • 108
O-Mem: Omni Memory System for Personalized, Long Horizon, Self-Evolving Agents Paper • 2511.13593 • Published Nov 17, 2025 • 25
The Image as Its Own Reward: Reinforcement Learning with Adversarial Reward for Image Generation Paper • 2511.20256 • Published Nov 25, 2025 • 27