Unclaimed project

Are you a maintainer of LLaVA? Claim this project to take control of your public changelog and roadmap.

Changelog

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

haotian-liu/LLaVA·

24k2.7kPythonApache-2.0

·Website

chatbotchatgptfoundation-modelsgpt-4instruction-tuningllama+7

Last updated over 1 year ago

Back to changelog

NewOctober 26, 2023

Release v1.1.3 (Bring your own data, LoRA training)

Updates

Support LoRA for the instruction tuning stage of LLaVA-1.5 -- comparable performance to full-model finetuning, and reduced requirements on GPU VRAM. (ckpts/logs, script)
Bring your own data and finetune LLaVA-1.5 to your own task. (instruction)
Basic support for Windows. (instruction)
Fix: the training behavior with gradient accumulation is the same as large-batch training.

Notes

A new LoRA schedule for LLaVA-1.5 is used,
- rank: 128
- alpha: 256
- lr (LoRA): 2e-4
- lr (projector): 2e-5

More Python Projects

AutoGPT

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

182.9k

Python

stable-diffusion-webui

Stable Diffusion web UI

162.0k

Python

transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

158.5k

Python

yt-dlp

A feature-rich command-line audio/video downloader

153.7k

Python

View all Python projects →