OmniCT: Towards a Unified Slice-Volume LVLM for Comprehensive CT Analysis Paper • 2602.16110 • Published 15 days ago • 1
OpenCodeInstruct: A Large-scale Instruction Tuning Dataset for Code LLMs Paper • 2504.04030 • Published Apr 5, 2025 • 3
rStar-Coder: Scaling Competitive Code Reasoning with a Large-Scale Verified Dataset Paper • 2505.21297 • Published May 27, 2025 • 30
SWITCH: Benchmarking Modeling and Handling of Tangible Interfaces in Long-horizon Embodied Scenarios Paper • 2511.17649 • Published Nov 20, 2025 • 4
Audio MultiChallenge: A Multi-Turn Evaluation of Spoken Dialogue Systems on Natural Human Interaction Paper • 2512.14865 • Published Dec 16, 2025 • 2
OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations Paper • 2412.07626 • Published Dec 10, 2024 • 29
Identity-Preserving Text-to-Video Generation by Frequency Decomposition Paper • 2411.17440 • Published Nov 26, 2024 • 38
zELO: ELO-inspired Training Method for Rerankers and Embedding Models Paper • 2509.12541 • Published Sep 16, 2025 • 8
Refusal in Language Models Is Mediated by a Single Direction Paper • 2406.11717 • Published Jun 17, 2024 • 7
GrapHist: Graph Self-Supervised Learning for Histopathology Paper • 2603.00143 • Published 8 days ago • 3
BeyondSWE: Can Current Code Agent Survive Beyond Single-Repo Bug Fixing? Paper • 2603.03194 • Published 1 day ago • 44
Ref-Adv: Exploring MLLM Visual Reasoning in Referring Expression Tasks Paper • 2602.23898 • Published 5 days ago • 10