How To Fine-tune LLaVA Model (From Your Laptop!)

Brev · Intermediate ·👁️ Computer Vision ·2y ago
Follow Along: https://console.brev.dev/launchable/deploy/now?userID=p2mzt91a8&orgID=jnj0c501d&launchableID=env-2hpxJ6HArVk5jzOYgJmFDJfvNmH&instance=A10G%40g5.12xlarge&diskStorage=300&cloudID=devplane-brev-1&python=3.10&cuda=12.2.2&file=https%3A%2F%2Fgithub.com%2Fbrevdev%2Fnotebooks%2Fblob%2Fmain%2Fllava-finetune.ipynb&name=Fine-tune+and+deploy+multimodal+LLaVA-1.5 Join Our Discord: https://discord.com/invite/NVDyv7TUgJ In this guide, we fine tune the popular open sourced model, LLaVA (Large Language-and-Vision Assistant) on a dataset to be used in a visual classification application. You can perform the fine tuning yourself, regardless your level of experience, or the level of compute you have access to. Please leave any future guides you would like made below!
Watch on YouTube ↗ (saves to browser)
Sign in to unlock AI tutor explanation · ⚡30

Related AI Lessons

Inside SAM 3D: how Meta turns a single image into 3D
Learn how Meta's SAM 3D technology turns a single image into 3D, revolutionizing the field of computer vision
Medium · Machine Learning
Inside SAM 3D: how Meta turns a single image into 3D
Learn how Meta's SAM 3D technology generates 3D models from single images, revolutionizing the field of computer vision
Medium · Deep Learning
Demystifying CNNs: How Convolutional Filters and Max-Pooling Actually Work
Learn how Convolutional Neural Networks (CNNs) use convolutional filters and max-pooling to recognize images
Medium · Data Science
Your "Biometric Age Check" Isn't Verifying Identity — And Defense Lawyers Know It
Biometric age checks don't verify identity, a crucial distinction for developers in computer vision and biometrics
Dev.to AI
Up next
How Transformers Finally Ate Vision – Isaac Robinson, Roboflow
AI Engineer
Watch →