3 18 4

charliezhang

Clockz

AI & ML interests

None yet

Recent Activity

upvoted a paper 6 days ago

SWE-rebench V2: Language-Agnostic SWE Task Collection at Scale

upvoted a paper 7 days ago

MMR-Life: Piecing Together Real-life Scenes for Multimodal Multi-image Reasoning

upvoted a paper 11 days ago

Does Your Reasoning Model Implicitly Know When to Stop Thinking?

View all activity

Organizations

upvoted a paper 6 days ago

SWE-rebench V2: Language-Agnostic SWE Task Collection at Scale

Paper • 2602.23866 • Published 11 days ago • 80

upvoted a paper 7 days ago

MMR-Life: Piecing Together Real-life Scenes for Multimodal Multi-image Reasoning

Paper • 2603.02024 • Published 8 days ago • 42

upvoted 2 papers 11 days ago

Does Your Reasoning Model Implicitly Know When to Stop Thinking?

Paper • 2602.08354 • Published 29 days ago • 261

Imagination Helps Visual Reasoning, But Not Yet in Latent Space

Paper • 2602.22766 • Published 12 days ago • 39

upvoted a paper 12 days ago

HyTRec: A Hybrid Temporal-Aware Attention Architecture for Long Behavior Sequential Recommendation

Paper • 2602.18283 • Published 18 days ago • 53

upvoted a paper 14 days ago

OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration

Paper • 2602.05400 • Published Feb 5 • 345

upvoted 3 papers about 1 month ago

Reinforcement Learning via Self-Distillation

Paper • 2601.20802 • Published Jan 28 • 40

ConceptMoE: Adaptive Token-to-Concept Compression for Implicit Compute Allocation

Paper • 2601.21420 • Published Jan 29 • 42

Dancing in Chains: Strategic Persuasion in Academic Rebuttal via Theory of Mind

Paper • 2601.15715 • Published Jan 22 • 14

updated 2 datasets about 1 month ago

Interplay-LM-Reasoning/composition

Viewer • Updated Jan 26 • 129M • 906 • 1

Interplay-LM-Reasoning/context

Viewer • Updated Jan 26 • 33.7M • 29 • 2

upvoted 2 papers 3 months ago

Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies

Paper • 2512.19673 • Published Dec 22, 2025 • 64

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published Dec 1, 2025 • 105

liked a model 3 months ago

allenai/Olmo-3.1-7B-RL-Zero-Math

Text Generation • 528k • Updated Jan 5 • 904 • 11

New activity in Interplay-LM-Reasoning/extrapolation_midtrain 3 months ago

Add pipeline tag, GitHub link, and improved model description

#1 opened 3 months ago by

nielsr

New activity in Interplay-LM-Reasoning/extrapolation_rl 3 months ago

Improve model card: Add pipeline tag and GitHub link

#1 opened 3 months ago by

nielsr

updated 2 models 3 months ago

Interplay-LM-Reasoning/extrapolation_rl

Text Generation • Updated Dec 14, 2025

Interplay-LM-Reasoning/extrapolation_midtrain