v1.0.64
EXO v1.0.64 Release Notes
This release comes with support for GLM-4.7-Flash, IP-less RDMA discovery (removing the need for custom network locations) and OpenAI-compatible tool calling via the API. It also includes bug fixes for auto parallelism, fixing various models including Qwen, GPT-OSS and MiniMax that were getting stuck in LOADING / WARMING UP, as well as better error messages when things go wrong.
Model Support
- Added support for GLM-4.7-Flash (#1214)
API
- Added tool calling support to the OpenAI-compatible chat completions API (#1233)