20 611 640

Chmielewski

Eryk-Chmielewski

AI & ML interests

None yet

Recent Activity

liked a model 19 minutes ago

cerebras/GLM-4.7-REAP-218B-A32B

liked a model 20 minutes ago

BAAI/RoboBrain2.5-8B-MT

liked a model 20 minutes ago

BAAI/RoboBrain2.5-8B-NV

View all activity

Organizations

upvoted a paper 2 days ago

Evolving Programmatic Skill Networks

Paper • 2601.03509 • Published 5 days ago • 71

upvoted 7 papers 3 days ago

Agentic Memory: Learning Unified Long-Term and Short-Term Memory Management for Large Language Model Agents

Paper • 2601.01885 • Published 7 days ago • 1

End-to-End Test-Time Training for Long Context

Paper • 2512.23675 • Published 14 days ago • 16

Atlas: Orchestrating Heterogeneous Models and Tools for Multi-Domain Complex Reasoning

Paper • 2601.03872 • Published 5 days ago • 37

From Entropy to Epiplexity: Rethinking Information for Computationally Bounded Intelligence

Paper • 2601.03220 • Published 6 days ago • 1

From Failure to Mastery: Generating Hard Samples for Tool-use Agents

Paper • 2601.01498 • Published 8 days ago • 2

MAGMA: A Multi-Graph based Agentic Memory Architecture for AI Agents

Paper • 2601.03236 • Published 6 days ago • 2

MemRL: Self-Evolving Agents via Runtime Reinforcement Learning on Episodic Memory

Paper • 2601.03192 • Published 6 days ago • 1

upvoted a collection 5 days ago

DFlash

Collection

Block Diffusion for Flash Speculative Decoding • 2 items • Updated 7 days ago • 11

upvoted a paper 7 days ago

Can LLMs Guide Their Own Exploration? Gradient-Guided Reinforcement Learning for LLM Reasoning

Paper • 2512.15687 • Published 26 days ago • 18

upvoted 2 papers 15 days ago

GenEnv: Difficulty-Aligned Co-Evolution Between LLM Agents and Environment Simulators

Paper • 2512.19682 • Published 21 days ago • 15

Reinforcement Learning for Self-Improving Agent with Skill Library

Paper • 2512.17102 • Published 24 days ago • 32

upvoted an article 15 days ago

Article

Apriel-1.6-15b-Thinker: Cost-efficient Frontier Multimodal Performance

Dec 9, 2025

•

upvoted a paper 16 days ago

Emergent temporal abstractions in autoregressive models enable hierarchical reinforcement learning

Paper • 2512.20605 • Published 20 days ago • 59

upvoted 6 papers 22 days ago

Live-SWE-agent: Can Software Engineering Agents Self-Evolve on the Fly?

Paper • 2511.13646 • Published Nov 17, 2025 • 8

MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling

Paper • 2511.11793 • Published Nov 14, 2025 • 171

Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning

Paper • 2511.14460 • Published Nov 18, 2025 • 20

Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning

Paper • 2511.16043 • Published Nov 20, 2025 • 108

O-Mem: Omni Memory System for Personalized, Long Horizon, Self-Evolving Agents

Paper • 2511.13593 • Published Nov 17, 2025 • 25

The Image as Its Own Reward: Reinforcement Learning with Adversarial Reward for Image Generation

Paper • 2511.20256 • Published Nov 25, 2025 • 27

Chmielewski

AI & ML interests

Recent Activity

Organizations

Eryk-Chmielewski's activity

Apriel-1.6-15b-Thinker: Cost-efficient Frontier Multimodal Performance