VITA-QinYu: Expressive Spoken Language Model for Role-Playing and singing
AI & ML interests
Multimodal LLM
Recent Activity
Organization Card
[2024-09-06] ππ We release
VITA, including the training code, deployment code, and model weights.
models 23
VITA-MLLM/VITA-QinYu-4B
Audio-to-Audio β’ 5B β’ Updated β’ 34
VITA-MLLM/VITA-QinYu-Models
Updated
VITA-MLLM/VITA-QinYu-8B
9B β’ Updated β’ 10
VITA-MLLM/VITA-E
Updated β’ 2
VITA-MLLM/VITA-Audio-Plus-Boost
11B β’ Updated β’ 4 β’ 3
VITA-MLLM/VITA-Audio-Boost
10B β’ Updated β’ 2 β’ 3
VITA-MLLM/VITA-Audio-Plus-Vanilla
8B β’ Updated β’ 16 β’ 5
VITA-MLLM/VITA-Audio-Balance
10B β’ Updated β’ 3 β’ 3
VITA-MLLM/Long-VITA-1M_HF
15B β’ Updated β’ 1 β’ 1
VITA-MLLM/Long-VITA-1M_MG
Updated β’ 1
datasets 7
VITA-MLLM/VITA-QinYu-ToySample
Updated β’ 4
VITA-MLLM/VITA-Audio-Data
Preview β’ Updated β’ 19 β’ 7
VITA-MLLM/Emotion_NaturalConv_FunctionCall
Preview β’ Updated β’ 34 β’ 2
VITA-MLLM/AudioQA-1M
Preview β’ Updated β’ 63 β’ 3
VITA-MLLM/Comic-9K
Viewer β’ Updated β’ 239k β’ 184 β’ 6
VITA-MLLM/MovieNet-Summary
Updated β’ 8 β’ 2
VITA-MLLM/Long-VITA-Data
Viewer β’ Updated β’ 17.8M β’ 79 β’ 2