Cluster one-dimensional data optimally?

r cluster-analysis k-means

32,731

Univariate k-means clustering can be solved in O(kn) time (on already sorted input) based on theoretical results on Monge matrices, but the approach was not popular most likely due to numerical instability and also perhaps coding challenges.

A better option is an O(knlgn) method that is now implemented in Ckmeans.1d.dp version 3.4.6. This implementation is as fast as heuristic k-means but offers guaranteed optimality, orders of magnitude better than heuristic k-means especially for large k's.

The generic dynamic programming solution by Richard Bellman (1973) does not touch upon specifics of the k-means problem and the implied runtime is O(kn^3).

32,731

Author by

Laciel

Updated on July 04, 2020

Comments

Laciel almost 4 years

Does anyone have a paper that explains how the Ckmeans.1d.dp algorithm works?

Or: what is the most optimal way to do k-means clustering in one-dimension?

Recents

Why Is PNG file with Drop Shadow in Flutter Web App Grainy?

How to troubleshoot crashes detected by Google Play Store for Flutter app

Cupertino DateTime picker interfering with scroll behaviour

Why does awk -F work for most letters, but not for the letter "t"?

Flutter change focus color and icon color but not works

How to print and connect to printer using flutter desktop via usb?

Critical issues have been reported with the following SDK versions: com.google.android.gms:play-services-safetynet:17.0.0

Flutter Dart - get localized country name from country code

navigatorState is null when using pushNamed Navigation onGenerateRoutes of GetMaterialPage

Android Sdk manager not found- Flutter doctor error

Flutter Laravel Push Notification without using any third party like(firebase,onesignal..etc)

How to change the color of ElevatedButton when entering text in TextField

Related

k-means: Same clusters for every execution

How can I get cluster number correspond to data using k-means clustering techniques in R?

kmeans: Quick-TRANSfer stage steps exceeded maximum

Cluster analysis in R: determine the optimal number of clusters

How to calculate BIC for k-means clustering in R

R draw kmeans clustering with heatmap

NA in clustering functions (kmeans, pam, clara). How to associate clusters to original data?

drawing heatmap with dendrogram along with sample labels

How to fix kmeans error in r : 'more cluster centers than distinct data points'

simple k-means clustering for bag of words model using python