Clustering Algorithms Resources

Published: 27 Aug 2015 Category: machine_learning

K-means

Yinyang K-Means: A Drop-In Replacement of the Classic K-Means with Consistent Speedup

Semi-supervised K-means++

k-Means Clustering Is Matrix Factorization

An efficient K-means algorithm for Massive Data

Boost K-Means

Compressive K-means

Convergence rate of stochastic k-means

Fast and Provably Good Seedings for k-Means using k-MC^2 and AFK-MC^2

An efficient K -means clustering algorithm for massive data

Stream Clustering

Neural Network-based Clustering

Spectral Clustering

On Spectral Clustering: Analysis and an algorithm

Hierarchical Clustering

Online Clustering

Papers

On Clustering Validation Techniques (2001)

Stream Clustering

Neural network-based clustering using pairwise constraints

PAC-Bayesian Online Clustering

Compressive Spectral Clustering

Interactive Bayesian Hierarchical Clustering

Practical Introduction to Clustering Data

Rényi divergence minimization based co-regularized multiview clustering

Consistent Algorithms for Clustering Time Series

Hybridization of Expectation-Maximization and K-Means Algorithms for Better Clustering Performance

mst_clustering: Clustering via Euclidean Minimum Spanning Trees

k2-means for fast and accurate large scale clustering

Context Aware Nonnegative Matrix Factorization Clustering

Clustering by fast search and find of density peaks

http://science.sciencemag.org/content/344/6191/1492

Comment on “Clustering by fast search and find of density peaks”

https://arxiv.org/abs/1501.04267

Datasets

Clustering datasets

https://cs.joensuu.fi/sipu/datasets/

Books

**Introduction to Clustering and Unsupervised Learning PACKT Books**

Blogs

Finding the K in K-means by Parametric Bootstrap

Random walk vectors for clustering

A comparison between PCA and hierarchical clustering

http://www.kdnuggets.com/2016/02/qlucore-comparison-pca-hierarchical-clustering.html

Visualization of Centroid Movements for K-Means Clustering

http://web.cecs.pdx.edu/~lane7/

K-Means Clustering on Handwritten Digits

http://johnloeber.com/docs/kmeans.html

Improved Seeding For Clustering With K-Means++ (★★★★★)

https://datasciencelab.wordpress.com/2014/01/15/improved-seeding-for-clustering-with-k-means/

Spectral Clustering – How Math is Redefining Decision Making

http://www.datasciencecentral.com/profiles/blogs/spectral-clustering-how-math-is-redefining-decision-making?overrideMobileRedirect=1

Visual comparison of machine learning algorithms: Clustering

http://haifengl.github.io/smile/index.html#clustering

Clustering Algorithms: From Start To State Of The Art

https://www.toptal.com/machine-learning/clustering-algorithms

Hierarchical clustering, using it to invest

Spectral Clustering: A quick overview

https://charlesmartin14.wordpress.com/2012/10/09/spectral-clustering/

Why K-Means is not always a good idea

https://datasciencemadesimpler.wordpress.com/2016/03/05/why-k-means-is-not-always-a-good-idea/

**High Quality, High Performance Clustering with HDBSCAN SciPy 2016**

Projects

MusicMappr: Find patterns in your favorite songs and remix them on the fly!

  • intro: MusicMappr finds chunks of songs that are similar, and clusters them accordingly. You can visualize these clusters and play them back at will. This is for music lovers who are curious about the structures inherent to their favorite songs.
  • github: https://github.com/fatsmcgee/MusicMappr

TfKmeans: A implementation of k-means clustering in TensorFlow

CUDA K-Means Clustering: A CUDA implementation of the k-means clustering algorithm

kmeans_cuda: CUDA implementation of k-means

K-means in TensorFlow

VAE-Clustering