v0.7.0
First Maven Central release. DataFrame-only API, 10+ divergences, Elkan acceleration, robust/sparse/multi-view/time-series/spectral/IB clustering, PySpark wrapper.
Unclaimed project
Are you a maintainer of generalized-kmeans-clustering? Claim this project to take control of your public changelog and roadmap.
Changelog
Production-ready K-Means clustering for Apache Spark with pluggable Bregman divergences (KL, Itakura-Saito, L1, etc). 6 algorithms, 740 tests, cross-version persistence. Drop-in replacement for MLlib with mathematically correct distance functions for probability distributions, spectral data, and count data.
First Maven Central release. DataFrame-only API, 10+ divergences, Elkan acceleration, robust/sparse/multi-view/time-series/spectral/IB clustering, PySpark wrapper.