k-means: Same clusters for every execution

r statistics cluster-analysis k-means

12,263

Solution 1

Yes. Use set.seed to set a seed for the random value before doing the clustering.

Using the example in kmeans:

set.seed(1)
x <- rbind(matrix(rnorm(100, sd = 0.3), ncol = 2),
           matrix(rnorm(100, mean = 1, sd = 0.3), ncol = 2))
colnames(x) <- c("x", "y")


set.seed(2)
XX <- kmeans(x, 2)

set.seed(2)
YY <- kmeans(x, 2)

Test for equality:

identical(XX, YY)
[1] TRUE

Solution 2

Yes, calling set.seed(foo) immediately prior to running kmeans(....) will give the same random start and hence the same clustering each time. foo is a seed, like 42 or some other numeric value.

12,263

Author by

Admin

Updated on June 04, 2022

Comments

Admin almost 2 years

Is it possible to get same kmeans clusters for every execution for a particular data set. Just like for a random value we can use a fixed seed. Is it possible to stop randomness for clustering?

Recents

Why Is PNG file with Drop Shadow in Flutter Web App Grainy?

How to troubleshoot crashes detected by Google Play Store for Flutter app

Cupertino DateTime picker interfering with scroll behaviour

Why does awk -F work for most letters, but not for the letter "t"?

Flutter change focus color and icon color but not works

How to print and connect to printer using flutter desktop via usb?

Critical issues have been reported with the following SDK versions: com.google.android.gms:play-services-safetynet:17.0.0

Flutter Dart - get localized country name from country code

navigatorState is null when using pushNamed Navigation onGenerateRoutes of GetMaterialPage

Android Sdk manager not found- Flutter doctor error

Flutter Laravel Push Notification without using any third party like(firebase,onesignal..etc)

How to change the color of ElevatedButton when entering text in TextField

Related

How can I get cluster number correspond to data using k-means clustering techniques in R?

Cluster one-dimensional data optimally?

Calculating the percentage of variance measure for k-means?

kmeans: Quick-TRANSfer stage steps exceeded maximum

Cluster analysis in R: determine the optimal number of clusters

How to calculate BIC for k-means clustering in R

R draw kmeans clustering with heatmap

NA in clustering functions (kmeans, pam, clara). How to associate clusters to original data?

drawing heatmap with dendrogram along with sample labels

How to fix kmeans error in r : 'more cluster centers than distinct data points'