Unclaimed project

Are you a maintainer of ScaleLLM? Claim this project to take control of your public changelog and roadmap.

Claim this project

Changelog

ScaleLLM

A high-performance inference system for large language models, designed for production environments.

vectorch-ai/ScaleLLM
49540C++Apache-2.0
Website
cudaefficiencygpuinferencellamallama3+8

Last updated 3 months ago