Release v1.2.0 (LLaVA-1.6)
LLaVA-1.6
- 34B model now outperforms Gemini Pro on several vision-language benchmarks
- 4x pixel processing increase — handles significantly higher resolution inputs
- Expanded task support — new applications and capabilities beyond 1.5
- Models available in Model Zoo; training/eval data and scripts coming soon
LLaVA-1.6 is out! With additional scaling to LLaVA-1.5, LLaVA-1.6-34B outperforms Gemini Pro on some benchmarks. It can now process 4x more pixels and perform more tasks/applications than before. Check out the blog post, and explore the demo! Models are available in Model Zoo. Training/eval data and scripts coming soon.