AI & ML interests
multi-modal foundation models
Recent Activity
-
mvp-lab/LLaVA-OneVision-1.5-Instruct-Data
Viewer β’ Updated β’ 21.9M β’ 67k β’ 65 -
mvp-lab/LLaVA-OneVision-1.5-Mid-Training-85M
Viewer β’ Updated β’ 91.5M β’ 162k β’ 59 -
lmms-lab/LLaVA-OneVision-1.5-8B-Instruct
Image-Text-to-Text β’ 9B β’ Updated β’ 43.3k β’ 62 -
lmms-lab/LLaVA-OneVision-1.5-4B-Instruct
Image-Text-to-Text β’ 5B β’ Updated β’ 5.09k β’ 18
-
mvp-lab/LLaVA-OneVision-1.5-Instruct-Data
Viewer β’ Updated β’ 21.9M β’ 67k β’ 65 -
mvp-lab/LLaVA-OneVision-1.5-Mid-Training-85M
Viewer β’ Updated β’ 91.5M β’ 162k β’ 59 -
lmms-lab/LLaVA-OneVision-1.5-8B-Instruct
Image-Text-to-Text β’ 9B β’ Updated β’ 43.3k β’ 62 -
lmms-lab/LLaVA-OneVision-1.5-4B-Instruct
Image-Text-to-Text β’ 5B β’ Updated β’ 5.09k β’ 18
spaces 19
Configuration error
SyncAI
π
Generate a friendly greeting for any name
Running on Zero
Sketch2MotionAI
π
An AI-powered keyframe in-betweening tool
Paused
Canny_ControlNet
π»
Canny-Edge Guided Image Generation with ControlNet
Running
1
Dungeon of Decisions
β
Play an AIβpowered text D&D adventure with narration
Paused
2
Audio Generation
π
Generate smooth DJ transitions between two songs
Paused
Glasses Removal
π
Glasses Removal
datasets 6
mvp-lab/LLaVA-OneVision-1.5-RL-Data
Viewer
β’ Updated
β’ 69.2k β’ 345 β’ 6
mvp-lab/LLaVA-OneVision-1.5-Mid-Training-85M
Viewer
β’ Updated
β’ 91.5M β’ 162k β’ 59
mvp-lab/LLaVA-OneVision-1.5-Instruct-Data
Viewer
β’ Updated
β’ 21.9M β’ 67k β’ 65
mvp-lab/LLaVA-558K-Webdataset
Updated
β’ 478 β’ 4
mvp-lab/LLaVA-NeXT-780k-webdataset
Updated
β’ 1.14k
mvp-lab/LLaVA-OneVision-1.5-Mid-Training-Webdataset-Quick-Start-3M
Updated
β’ 6.68k β’ 2