Rui Sun's picture

Rui Sun PRO

ThreeSR

·

https://threesr.github.io/

AI & ML interests

Vision and Language Multimodal Learning, CV, NLP, LLM

Recent Activity

updated a collection 13 days ago

updated a collection 14 days ago

upvoted a paper 14 days ago

Image Generators are Generalist Vision Learners

View all activity

Organizations

updated a collection 13 days ago

New Papers

104 items • Updated 13 days ago • 1

updated a collection 14 days ago

New Papers

104 items • Updated 13 days ago • 1

upvoted a paper 14 days ago

Image Generators are Generalist Vision Learners

Paper • 2604.20329 • Published 23 days ago • 20

updated a collection 14 days ago

New Papers

104 items • Updated 13 days ago • 1

updated a collection 15 days ago

New Papers

104 items • Updated 13 days ago • 1

upvoted 2 papers 15 days ago

Co-Director: Agentic Generative Video Storytelling

Paper • 2604.24842 • Published 18 days ago • 16

AutoResearchBench: Benchmarking AI Agents on Complex Scientific Literature Discovery

Paper • 2604.25256 • Published 17 days ago • 29

updated a collection 19 days ago

New Papers

104 items • Updated 13 days ago • 1

updated a collection 21 days ago

New Papers

104 items • Updated 13 days ago • 1

upvoted a paper 22 days ago

Mind DeepResearch Technical Report

Paper • 2604.14518 • Published 28 days ago • 23

upvoted a paper 26 days ago

DR^{3}-Eval: Towards Realistic and Reproducible Deep Research Evaluation

Paper • 2604.14683 • Published 29 days ago • 36

updated a collection about 1 month ago

New Papers

104 items • Updated 13 days ago • 1

upvoted 7 papers about 1 month ago

WildDet3D: Scaling Promptable 3D Detection in the Wild

Paper • 2604.08626 • Published Apr 9 • 245

OpenSpatial: A Principled Data Engine for Empowering Spatial Intelligence

Paper • 2604.07296 • Published Apr 8 • 40

MolmoWeb: Open Visual Web Agent and Open Data for the Open Web

Paper • 2604.08516 • Published Apr 9 • 43

Act Wisely: Cultivating Meta-Cognitive Tool Use in Agentic Multimodal Models

Paper • 2604.08545 • Published Apr 9 • 41

LPM 1.0: Video-based Character Performance Model

Paper • 2604.07823 • Published Apr 9 • 78

Externalization in LLM Agents: A Unified Review of Memory, Skills, Protocols and Harness Engineering

Paper • 2604.08224 • Published Apr 9 • 51

KnowU-Bench: Towards Interactive, Proactive, and Personalized Mobile Agent Evaluation

Paper • 2604.08455 • Published Apr 9 • 47