📢 Spark NLP 6.3.1: LLM Backend Upgrade and Document Processing Improvements

Spark NLP 6.3.1 focuses on strengthening distributed local LLM inference by upgrading the jsl-llamacpp backend to a newer llama.cpp release, while also delivering important improvements in document structure handling and metadata consistency.

This enables you to use the latest LLMs and embeddings compatible with llama.cpp and perform advanced ingestion of tables and images.

🔥 Highlights

Upgraded jsl-llamacpp backend to llama.cpp tag b7247, bringing upstream performance improvements, stability fixes, and expanded model compatibility for local LLM inference.
Improved Reader2X annotator capabilities with structural position metadata for tables and images and integration with AutoGGUFVisionModel
- For an exhaustive overview on how to use Spark NLP for unstructured document ingestion, see our blog post Evaluating Document AI Frameworks: Spark NLP vs Unstructured for Large-Scale Text Processing

📢 Spark NLP 6.3.1: LLM Backend Upgrade and Document Processing Improvements

This enables you to use the latest LLMs and embeddings compatible with llama.cpp and perform advanced ingestion of tables and images.

🔥 Highlights

Upgraded jsl-llamacpp backend to llama.cpp tag b7247, bringing upstream performance improvements, stability fixes, and expanded model compatibility for local LLM inference.
Improved Reader2X annotator capabilities with structural position metadata for tables and images and integration with AutoGGUFVisionModel
- For an exhaustive overview on how to use Spark NLP for unstructured document ingestion, see our blog post Evaluating Document AI Frameworks: Spark NLP vs Unstructured for Large-Scale Text Processing

spark-nlp

6.3.1

📢 Spark NLP 6.3.1: LLM Backend Upgrade and Document Processing Improvements

🔥 Highlights

Related Projects

mapbox-navigation-android

ToastFish

barcodelib

haze

6.3.1

📢 Spark NLP 6.3.1: LLM Backend Upgrade and Document Processing Improvements

🔥 Highlights

🚀 New Features & Enhancements

LLM Backend Upgrade (llama.cpp)

Structural Metadata for Document Readers

Reader2Image Integration with AutoGGUFVisionModel

Platform Setup Documentation

🐛 Bug Fixes

❤️ Community Support

💻 Installation

Python

Spark Packages

CPU

GPU

Apple Silicon

AArch64

Maven

spark-nlp

spark-nlp-gpu

spark-nlp-silicon

spark-nlp-aarch64

FAT JARs

What's Changed

Related Projects

mapbox-navigation-android

ToastFish

barcodelib

haze