Performance Optimization - a srygaard-mila Collection

srygaard-mila 's Collections

Performance Optimization

Performance Optimization

updated about 8 hours ago

CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation

Paper • 2602.24286 • Published Feb 27 • 98
Astra: A Multi-Agent System for GPU Kernel Performance Optimization

Paper • 2509.07506 • Published Sep 9, 2025
Understanding the Challenges in Iterative Generative Optimization with LLMs

Paper • 2603.23994 • Published 28 days ago • 28
FP4 Explore, BF16 Train: Diffusion Reinforcement Learning via Efficient Rollout Scaling

Paper • 2604.06916 • Published 14 days ago • 34
Efficient Memory Management for Large Language Model Serving with PagedAttention

Paper • 2309.06180 • Published Sep 12, 2023 • 53