Look Where It Matters: High-Resolution Crops Retrieval for Efficient VLMs Paper • 2603.16932 • Published 14 days ago • 84
CARES: Context-Aware Resolution Selector for VLMs Paper • 2510.19496 • Published Oct 22, 2025 • 9
NLE: Non-autoregressive LLM-based ASR by Transcript Editing Paper • 2603.08397 • Published 19 days ago • 21
NLE: Non-autoregressive LLM-based ASR by Transcript Editing Paper • 2603.08397 • Published 19 days ago • 21
view article Article Granite 4.0 1B Speech: Compact, Multilingual, and Built for the Edge 19 days ago • 14
ibm-granite/granite-4.0-1b-speech Automatic Speech Recognition • 2B • Updated about 17 hours ago • 50.2k • 194
Granite 3.1 Language Models Collection A series of language models with 128K context length trained by IBM licensed under Apache 2.0 license. • 9 items • Updated 12 days ago • 69
Charting and Navigating Hugging Face's Model Atlas Paper • 2503.10633 • Published Mar 13, 2025 • 93
Advancing Speech Understanding in Speech-Aware Language Models with GRPO Paper • 2509.16990 • Published Sep 21, 2025 • 22
Advancing Speech Understanding in Speech-Aware Language Models with GRPO Paper • 2509.16990 • Published Sep 21, 2025 • 22
view article Article Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers +5 Sep 11, 2025 • 185