CACARA: Cross-Modal Alignment Leveraging a Text-Centric Approach for Cost-Effective Multimodal and Multilingual Learning Paper • 2512.00496 • Published Nov 29, 2025
HuggingFaceTB/SmolLM2-135M-Instruct Text Generation • 0.1B • Updated Sep 22, 2025 • 455k • 292
view article Article MIEB: The Benchmark That Stress-Tests Image-Text Embeddings Like Never Before Apr 24, 2025 • 17
visheratin/nllb-clip-large-siglip Zero-Shot Image Classification • Updated Mar 2, 2025 • 299 • 8
Running on CPU Upgrade Featured 3k The Smol Training Playbook 📚 3k The secrets to building world-class LLMs
ClassiCC-Corpus/Curio-1.1b-intermediate-checkpoint-50B Text Generation • 1B • Updated Aug 10, 2025 • 2