Huining Yuan

HuiningYuan

HuiningYuan

AI & ML interests

Reinforcement learning, LLM Agents, World models

Recent Activity

updated a model 2 days ago

nics-efc/MARSHAL-Mini-Hanabi-Qwen3-4B

updated a model 2 days ago

nics-efc/MARSHAL-Kuhn-Poker-Qwen3-4B

updated a model 2 days ago

nics-efc/MARSHAL-Tic-Tac-Toe-Qwen3-4B

View all activity

Organizations

updated 5 models 2 days ago

upvoted a paper 2 days ago

RLinf-USER: A Unified and Extensible System for Real-World Online Policy Learning in Embodied AI

Paper • 2602.07837 • Published 5 days ago • 51

upvoted a paper 4 days ago

RLinf: Flexible and Efficient Large-scale Reinforcement Learning via Macro-to-Micro Flow Transformation

Paper • 2509.15965 • Published Sep 19, 2025 • 18

upvoted a paper 8 days ago

WideSeek-R1: Exploring Width Scaling for Broad Information Seeking via Multi-Agent Reinforcement Learning

Paper • 2602.04634 • Published 9 days ago • 92

upvoted a collection 2 months ago

MARSHAL

Collection

MARSHAL: Incentivizing Multi-Agent Reasoning via Self-Play with Strategic LLMs 🎉 Accepted by ICLR 2026 • 6 items • Updated 2 days ago • 2

upvoted a paper 2 months ago

DAComp: Benchmarking Data Agents across the Full Data Intelligence Lifecycle

Paper • 2512.04324 • Published Dec 3, 2025 • 154

updated a collection 2 months ago

MARSHAL

Collection

MARSHAL: Incentivizing Multi-Agent Reasoning via Self-Play with Strategic LLMs 🎉 Accepted by ICLR 2026 • 6 items • Updated 2 days ago • 2

updated a collection 3 months ago

MARSHAL

Collection

MARSHAL: Incentivizing Multi-Agent Reasoning via Self-Play with Strategic LLMs 🎉 Accepted by ICLR 2026 • 6 items • Updated 2 days ago • 2

published 4 models 3 months ago

nics-efc/MARSHAL-Mini-Hanabi-Qwen3-4B

Text Generation • 4B • Updated 2 days ago • 5

nics-efc/MARSHAL-Kuhn-Poker-Qwen3-4B

Text Generation • 4B • Updated 2 days ago • 7

nics-efc/MARSHAL-Tic-Tac-Toe-Qwen3-4B

Text Generation • 4B • Updated 2 days ago • 8

nics-efc/MARSHAL-Generalist-Qwen3-8B

Text Generation • 8B • Updated 2 days ago • 9

Huining Yuan

AI & ML interests

Recent Activity

Organizations

HuiningYuan's activity