Revisiting Multimodal Positional Encoding in Vision-Language Models Paper โข 2510.23095 โข Published Oct 27, 2025 โข 22
RefHCM: A Unified Model for Referring Perceptions in Human-Centric Scenarios Paper โข 2412.14643 โข Published Dec 19, 2024