Longxu Dou

dreamerdeo

https://longxudou.github.io/

AI & ML interests

Natural Language Processing

Recent Activity

liked a model 16 days ago

miromind-ai/MiroThinker-1.7-mini

liked a model 16 days ago

miromind-ai/MiroThinker-1.7

upvoted a collection 16 days ago

MiroThinker-1.7

View all activity

Organizations

liked 2 models 16 days ago

miromind-ai/MiroThinker-1.7-mini

Text Generation • 31B • Updated 7 days ago • 1.72k • 85

miromind-ai/MiroThinker-1.7

Text Generation • 235B • Updated 7 days ago • 3.63k • 121

upvoted a collection 16 days ago

MiroThinker-1.7

Collection

2 items • Updated 16 days ago • 52

upvoted a paper about 1 month ago

On Data Engineering for Scaling LLM Terminal Capabilities

Paper • 2602.21193 • Published about 1 month ago • 101

liked a dataset about 1 month ago

zai-org/terminal-bench-2-verified

Updated 28 days ago • 2.72k • 63

upvoted a paper about 1 month ago

CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models

Paper • 2602.17684 • Published Feb 4 • 22

upvoted a paper about 2 months ago

Rethinking the Trust Region in LLM Reinforcement Learning

Paper • 2602.04879 • Published Feb 4 • 37

liked a dataset 4 months ago

Danau5tin/terminal-tasks

Viewer • Updated Sep 12, 2025 • 331 • 11 • 7

upvoted a paper 4 months ago

LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition

Paper • 2307.13269 • Published Jul 25, 2023 • 34

authored 2 papers 4 months ago

Diffusion Language Models are Super Data Learners

Paper • 2511.03276 • Published Nov 5, 2025 • 132

Training Optimal Large Diffusion Language Models

Paper • 2510.03280 • Published Sep 28, 2025

upvoted 3 papers 6 months ago

upvoted a collection 6 months ago

cwm

Collection

Collection for Code World Model, an agentic coding model from FAIR. • 3 items • Updated Sep 24, 2025 • 18

upvoted a paper 9 months ago

OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling

Paper • 2506.20512 • Published Jun 25, 2025 • 47

updated a Space 9 months ago

README

💻

upvoted 3 papers 10 months ago

Reinforcing General Reasoning without Verifiers

Paper • 2505.21493 • Published May 27, 2025 • 26

Fostering Video Reasoning via Next-Event Prediction

Paper • 2505.22457 • Published May 28, 2025 • 29

Lifelong Safety Alignment for Language Models

Paper • 2505.20259 • Published May 26, 2025 • 24

Longxu Dou

AI & ML interests

Recent Activity

Organizations

dreamerdeo's activity

README