Guohui Zhang

zghhui

zghhui

AI & ML interests

None yet

Recent Activity

upvoted a paper 12 days ago

Vision-DeepResearch: Incentivizing DeepResearch Capability in Multimodal Large Language Models

upvoted a paper 12 days ago

Vision-DeepResearch Benchmark: Rethinking Visual and Textual Search for Multimodal Large Language Models

upvoted a paper about 1 month ago

UniCorn: Towards Self-Improving Unified Multimodal Models through Self-Generated Supervision

View all activity

Organizations

None yet

upvoted 2 papers 12 days ago

Vision-DeepResearch: Incentivizing DeepResearch Capability in Multimodal Large Language Models

Paper • 2601.22060 • Published 17 days ago • 151

Vision-DeepResearch Benchmark: Rethinking Visual and Textual Search for Multimodal Large Language Models

Paper • 2602.02185 • Published 13 days ago • 125

upvoted a paper about 1 month ago

UniCorn: Towards Self-Improving Unified Multimodal Models through Self-Generated Supervision

Paper • 2601.03193 • Published Jan 6 • 47

updated a collection about 2 months ago

MaskFocus

Collection

MaskFocus • 2 items • Updated Dec 21, 2025 • 2

updated 2 models about 2 months ago

zghhui/Meissonic_MaskFocus_HPS

Text-to-Image • Updated Dec 20, 2025 • 4 • 1

zghhui/Meissonic_MaskFocus_GenEval

Text-to-Image • Updated Dec 20, 2025 • 11 • 1

published a model about 2 months ago

zghhui/Meissonic_MaskFocus_HPS

Text-to-Image • Updated Dec 20, 2025 • 4 • 1

updated a model about 2 months ago

zghhui/Janus-Pro-1B-GCPO-HPS

Text-to-Image • 2B • Updated Dec 20, 2025

published a model about 2 months ago

zghhui/Meissonic_MaskFocus_GenEval

Text-to-Image • Updated Dec 20, 2025 • 11 • 1

upvoted 3 papers 4 months ago

IF-VidCap: Can Video Caption Models Follow Instructions?

Paper • 2510.18726 • Published Oct 21, 2025 • 26

MT-Video-Bench: A Holistic Video Understanding Benchmark for Evaluating Multimodal LLMs in Multi-Turn Dialogues

Paper • 2510.17722 • Published Oct 20, 2025 • 20

RLFR: Extending Reinforcement Learning for LLMs with Flow Environment

Paper • 2510.10201 • Published Oct 11, 2025 • 36

updated 4 models 5 months ago

upvoted a collection 5 months ago

GCPO

Collection

6 items • Updated Sep 26, 2025 • 1

updated a collection 5 months ago

GCPO

Collection

6 items • Updated Sep 26, 2025 • 1

published a model 5 months ago

zghhui/LlamaGen-T2I-GCPO

Text-to-Image • 0.8B • Updated Sep 28, 2025 • 1

Guohui Zhang

AI & ML interests

Recent Activity

Organizations

zghhui's activity