Video-Text-to-Text
Transformers
Safetensors
English
internvl_chat
feature-extraction
multimodal
custom_code
Eval Results (legacy)
Instructions to use OpenGVLab/InternVideo2_5_Chat_8B with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use OpenGVLab/InternVideo2_5_Chat_8B with Transformers:
# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("OpenGVLab/InternVideo2_5_Chat_8B", trust_remote_code=True, dtype="auto") - Notebooks
- Google Colab
- Kaggle
Error on get_vision_tower
#3
by Clip-AI - opened
AttributeError: 'InternVLChatModel' object has no attribute 'get_vision_tower'
Any ideas on how to get past this? I saw there might be a version error here: https://github.com/haotian-liu/LLaVA/issues/15 - but not sure how to solve with InternVideo2_5; you can comment it out, it's not necessary.
Clip-AI changed discussion status to closed
I've encountered a similar problem, did you find a solution?