Image Datasets google/wit Viewer • Updated Jul 4, 2022 • 2.66M • 110 • 62 google/docci Updated Jul 24, 2024 • 319 • 75 google/imageinwords Updated May 25, 2024 • 97 • 120 lygsbw/UMG-41M Updated Jul 13, 2024 • 19 • 5
Vision-Language moonshotai/Kimi-VL-A3B-Thinking Image-Text-to-Text • 16B • Updated 10 days ago • 77.8k • 445 OpenGVLab/InternViT-300M-448px-V2_5 Image Feature Extraction • 0.3B • Updated Dec 9, 2024 • 5.14k • 48 LifuWang/DistillT5 0.1B • Updated Apr 11, 2025 • 109 • 29
OpenGVLab/InternViT-300M-448px-V2_5 Image Feature Extraction • 0.3B • Updated Dec 9, 2024 • 5.14k • 48
Vision-Language moonshotai/Kimi-VL-A3B-Thinking Image-Text-to-Text • 16B • Updated 10 days ago • 77.8k • 445 OpenGVLab/InternViT-300M-448px-V2_5 Image Feature Extraction • 0.3B • Updated Dec 9, 2024 • 5.14k • 48 LifuWang/DistillT5 0.1B • Updated Apr 11, 2025 • 109 • 29
OpenGVLab/InternViT-300M-448px-V2_5 Image Feature Extraction • 0.3B • Updated Dec 9, 2024 • 5.14k • 48
Image Datasets google/wit Viewer • Updated Jul 4, 2022 • 2.66M • 110 • 62 google/docci Updated Jul 24, 2024 • 319 • 75 google/imageinwords Updated May 25, 2024 • 97 • 120 lygsbw/UMG-41M Updated Jul 13, 2024 • 19 • 5