In a Training Loop 🔄

13 21 41

Honglin Guo

KYLN24

KYLN24

AI & ML interests

None yet

Recent Activity

reacted to danieldk's post with 🔥 3 minutes ago

kernels 0.12 is out! 🎉 Changes: * Support for kernel version branches to gracefully roll out kernel API changes. * Support for PyTorch 2.10. * kernel-builder is now merged into the kernels repo. * Initial support for standardized kernel benchmarks. https://github.com/huggingface/kernels/releases/tag/v0.12.0

upvoted a paper 3 days ago

TL-GRPO: Turn-Level RL for Reasoning-Guided Iterative Optimization

authored a paper 9 days ago

ABC-Bench: Benchmarking Agentic Backend Coding in Real-World Development

View all activity

Organizations

authored a paper 9 days ago

ABC-Bench: Benchmarking Agentic Backend Coding in Real-World Development

Paper • 2601.11077 • Published 13 days ago • 63

authored a paper 13 days ago

OctoBench: Benchmarking Scaffold-Aware Instruction Following in Repository-Grounded Agentic Coding

Paper • 2601.10343 • Published 14 days ago

authored a paper about 1 month ago

Memory in the Age of AI Agents

Paper • 2512.13564 • Published Dec 15, 2025 • 146

authored 4 papers about 2 months ago

Better Process Supervision with Bi-directional Rewarding Signals

Paper • 2503.04618 • Published Mar 6, 2025

AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning

Paper • 2509.08755 • Published Sep 10, 2025 • 57

BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping

Paper • 2510.18927 • Published Oct 21, 2025 • 84

Nex-N1: Agentic Models Trained via a Unified Ecosystem for Large-Scale Environment Construction

Paper • 2512.04987 • Published Dec 4, 2025 • 80

authored 2 papers 5 months ago

Pre-Trained Policy Discriminators are General Reward Models

Paper • 2507.05197 • Published Jul 7, 2025 • 39

BMMR: A Large-Scale Bilingual Multimodal Multi-Discipline Reasoning Dataset

Paper • 2507.03483 • Published Jul 4, 2025 • 24

authored 3 papers 11 months ago

DuoDecoding: Hardware-aware Heterogeneous Speculative Decoding with Dynamic Multi-Sequence Drafting

Paper • 2503.00784 • Published Mar 2, 2025 • 13

Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning

Paper • 2402.05808 • Published Feb 8, 2024

CritiQ: Mining Data Quality Criteria from Human Preferences

Paper • 2502.19279 • Published Feb 26, 2025 • 10

authored a paper over 1 year ago

AgentGym: Evolving Large Language Model-based Agents across Diverse Environments

Paper • 2406.04151 • Published Jun 6, 2024 • 24

authored a paper almost 2 years ago

Code Needs Comments: Enhancing Code LLMs with Comment Augmentation

Paper • 2402.13013 • Published Feb 20, 2024 • 1

authored a paper about 2 years ago

CoLLiE: Collaborative Training of Large Language Models in an Efficient Way

Paper • 2312.00407 • Published Dec 1, 2023 • 3

Honglin Guo

AI & ML interests

Recent Activity

Organizations

KYLN24's activity