JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation Paper • 2512.22905 • Published 28 days ago • 19
OralGPT-Omni: A Versatile Dental Multimodal Large Language Model Paper • 2511.22055 • Published Nov 27, 2025 • 8
OralGPT-Omni: A Versatile Dental Multimodal Large Language Model Paper • 2511.22055 • Published Nov 27, 2025 • 8
OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe Paper • 2511.16334 • Published Nov 20, 2025 • 93
Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds Paper • 2511.08892 • Published Nov 12, 2025 • 208
The Station: An Open-World Environment for AI-Driven Discovery Paper • 2511.06309 • Published Nov 9, 2025 • 37
Towards Universal Video Retrieval: Generalizing Video Embedding via Synthesized Multimodal Pyramid Curriculum Paper • 2510.27571 • Published Oct 31, 2025 • 18
Foundation Models for Scientific Discovery: From Paradigm Enhancement to Paradigm Transition Paper • 2510.15280 • Published Oct 17, 2025 • 15
ACE: Attribution-Controlled Knowledge Editing for Multi-hop Factual Recall Paper • 2510.07896 • Published Oct 9, 2025 • 2
ACE: Attribution-Controlled Knowledge Editing for Multi-hop Factual Recall Paper • 2510.07896 • Published Oct 9, 2025 • 2