CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation Paper • 2602.24286 • Published Feb 27 • 98
Astra: A Multi-Agent System for GPU Kernel Performance Optimization Paper • 2509.07506 • Published Sep 9, 2025
Understanding the Challenges in Iterative Generative Optimization with LLMs Paper • 2603.23994 • Published 28 days ago • 28
FP4 Explore, BF16 Train: Diffusion Reinforcement Learning via Efficient Rollout Scaling Paper • 2604.06916 • Published 14 days ago • 34
Efficient Memory Management for Large Language Model Serving with PagedAttention Paper • 2309.06180 • Published Sep 12, 2023 • 53