v0.8.0
v0.8.0 - 2025-12-20
This release introduces a breaking change such that the inference workload is unified to StatefulSet. The Deployment resources created by existing workspaces will be removed by the controller and new StatefulSet resources will be created instead. No manual operation is required for this migration, and it is expected that the inference server hits a short period of downtime due to the Pod recreation.
Changelog
Breaking Changes π₯
- 3ab3f3d55a47d8ebb8cd26440c9f7732c259017a feat: [BREAKING] use statefulset for all workspace (#1523)
Features π
- b9664848f4a36a0454147909f2f2dcc78d45640c feat: update gpu-provisioner version to v0.3.8 for kaito (#1698)
- 91819b9e8141c8062c6b2bdb8f6b9ee2dcf8eb65 feat: preset-generator support generic model format and attn arch (#1690)
Bug Fixes π
- 1366f9a7d290453d01c10a800c8202b59bb8c6bb fix: set imagePullPolicy to Always (#1702)
- 8945b5b74fdeb40d8a9e5f1cb05895b021faa764 fix: workload type in ragengine e2e test (#1697)
- dffd5f342d9a980f5ea6b8c5481004d693d6f357 fix: invalid indentation in artifacthub links (#1683)
- e5d77e5c0e34556add8542780f0976212d6c48b2 fix: cancel latest release when it's perrelease (#1680)