Unclaimed project

Are you a maintainer of LLaVA? Claim this project to take control of your public changelog and roadmap.

Changelog

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

haotian-liu/LLaVA·

25k2.8kPythonApache-2.0

·Website

chatbotchatgptfoundation-modelsgpt-4instruction-tuningllama+7

Last updated almost 2 years ago

Back to changelog

NewAI EnhancedJanuary 31, 2024

Release v1.2.0 (LLaVA-1.6)

LLaVA-1.6

34B model now outperforms Gemini Pro on several vision-language benchmarks
4x pixel processing increase — handles significantly higher resolution inputs
Expanded task support — new applications and capabilities beyond 1.5
Models available in Model Zoo; training/eval data and scripts coming soon

LLaVA-1.6 is out! With additional scaling to LLaVA-1.5, LLaVA-1.6-34B outperforms Gemini Pro on some benchmarks. It can now process 4x more pixels and perform more tasks/applications than before. Check out the blog post, and explore the demo! Models are available in Model Zoo. Training/eval data and scripts coming soon.

More Python Projects

AutoGPT

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

185.9k

Python

yt-dlp

A feature-rich command-line audio/video downloader

179.1k

Python

markitdown

Python tool for converting files and office documents to Markdown.

168.0k

Python

HelloGitHub

:octocat: 分享 GitHub 上有趣、入门级的开源项目。Share interesting, entry-level open source projects on GitHub.

166.5k

Python

View all Python projects →