Min-Hung Chen

cmhungsteve

https://minhungchen.netlify.app/

AI & ML interests

Multimodal AI, Transfer Learning, Unsupervised Learning, Video Understanding, Vision Transformer, Computer Vision, Deep Learning

Recent Activity

upvoted a paper about 1 hour ago

TIPO: Text to Image with Text Presampling for Prompt Optimization

upvoted a collection about 1 hour ago

TIPO

upvoted a paper about 21 hours ago

Expected Harm: Rethinking Safety Evaluation of (Mis)Aligned LLMs

View all activity

Organizations

upvoted a paper about 1 hour ago

TIPO: Text to Image with Text Presampling for Prompt Optimization

Paper • 2411.08127 • Published Nov 12, 2024 • 4

upvoted a collection about 1 hour ago

TIPO

Collection

Text to Image with text presampling for Prompt Optimization • 6 items • Updated Jan 22, 2025 • 6

upvoted a paper about 21 hours ago

Expected Harm: Rethinking Safety Evaluation of (Mis)Aligned LLMs

Paper • 2602.01600 • Published 2 days ago • 18

upvoted a paper about 23 hours ago

PaperBanana: Automating Academic Illustration for AI Scientists

Paper • 2601.23265 • Published 4 days ago • 62

upvoted 5 papers 20 days ago

GenRecal: Generation after Recalibration from Large to Small Vision-Language Models

Paper • 2506.15681 • Published Jun 18, 2025 • 41

Quantile Rendering: Efficiently Embedding High-dimensional Feature on 3D Gaussian Splatting

Paper • 2512.20927 • Published Dec 24, 2025 • 16

OpenVoxel: Training-Free Grouping and Captioning Voxels for Open-Vocabulary 3D Scene Understanding

Paper • 2601.09575 • Published 20 days ago • 25

Fast-ThinkAct: Efficient Vision-Language-Action Reasoning via Verbalizable Latent Planning

Paper • 2601.09708 • Published 20 days ago • 51

3AM: Segment Anything with Geometric Consistency in Videos

Paper • 2601.08831 • Published 21 days ago • 34

upvoted a paper 26 days ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Paper • 2601.05242 • Published 26 days ago • 218

upvoted an article 29 days ago

Article

NVIDIA Cosmos Reason 2 Brings Advanced Reasoning To Physical AI

29 days ago

•

upvoted 8 papers about 1 month ago

Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Paper • 2512.20848 • Published Dec 23, 2025 • 37

NVIDIA Nemotron 3: Efficient and Open Intelligence

Paper • 2512.20856 • Published Dec 24, 2025 • 36

FoundationMotion: Auto-Labeling and Reasoning about Spatial Movement in Videos

Paper • 2512.10927 • Published Dec 11, 2025 • 6

Generative Refocusing: Flexible Defocus Control from a Single Image

Paper • 2512.16923 • Published Dec 18, 2025 • 39

4D-RGPT: Toward Region-level 4D Understanding via Perceptual Distillation

Paper • 2512.17012 • Published Dec 18, 2025 • 46

upvoted a paper about 2 months ago

Zoom-Zero: Reinforced Coarse-to-Fine Video Understanding via Temporal Zoom-in

Paper • 2512.14273 • Published Dec 16, 2025 • 9

Min-Hung Chen

AI & ML interests

Recent Activity

Organizations

cmhungsteve's activity

NVIDIA Cosmos Reason 2 Brings Advanced Reasoning To Physical AI