Presenting a Paper is an Art: Self-Improvement Aesthetic Agents for Academic Presentations Paper ⢠2510.05571 ⢠Published Oct 7, 2025 ⢠15
Exploring Vision Language Models for Facial Attribute Recognition: Emotion, Race, Gender, and Age Paper ⢠2410.24148 ⢠Published Oct 31, 2024 ⢠3
Aesthetic Alignment Risks Assimilation: How Image Generation and Reward Models Reinforce Beauty Bias and Ideological "Censorship" Paper ⢠2512.11883 ⢠Published Dec 9, 2025 ⢠7
view article Article Unified Models for Image Understanding and Generation: Understanding Cutting-Edge Model Architectures Sep 15, 2025 ⢠3
TIIF-Bench: How Does Your T2I Model Follow Your Instructions? Paper ⢠2506.02161 ⢠Published Jun 2, 2025 ⢠13
TiDAR: Think in Diffusion, Talk in Autoregression Paper ⢠2511.08923 ⢠Published Nov 12, 2025 ⢠124
Diffusion Language Models are Super Data Learners Paper ⢠2511.03276 ⢠Published Nov 5, 2025 ⢠129
VSF: Simple, Efficient, and Effective Negative Guidance in Few-Step Image Generation Models By Value Sign Flip Paper ⢠2508.10931 ⢠Published Aug 11, 2025 ⢠1
Position: The Pitfalls of Over-Alignment: Overly Caution Health-Related Responses From LLMs are Unethical and Dangerous Paper ⢠2509.08833 ⢠Published Aug 27, 2025 ⢠1
CamSAM2: Segment Anything Accurately in Camouflaged Videos Paper ⢠2503.19730 ⢠Published Mar 25, 2025 ⢠1
NegVSR: Augmenting Negatives for Generalized Noise Modeling in Real-World Video Super-Resolution Paper ⢠2305.14669 ⢠Published May 24, 2023 ⢠2
ZS-VCOS: Zero-Shot Video Camouflaged Object Segmentation By Optical Flow and Open Vocabulary Object Detection Paper ⢠2505.01431 ⢠Published Apr 10, 2025 ⢠1
LangGas: Introducing Language in Selective Zero-Shot Background Subtraction for Semi-Transparent Gas Leak Detection with a New Dataset Paper ⢠2503.02910 ⢠Published Mar 4, 2025 ⢠1