MobileVLM : A Fast, Reproducible and Strong Vision Language Assistant for Mobile Devices
Paper
• 2312.16886 • Published
• 22
MobileLLaMA-2.7B-Chat is fine-tuned from MobileLLaMA-2.7B-Base with supervised instruction fine-tuning on ShareGPT dataset.
Model weights can be loaded with Hugging Face Transformers. Examples can be found at Github.
please refer to our paper in section 4.1: MobileVLM: A Fast, Strong and Open Vision Language Assistant for Mobile Devices.