1st Edition

Data Clustering with Python From Theory to Implementation

By Guojun Gan Copyright 2026
260 Pages 40 B/W Illustrations
by Chapman & Hall

260 Pages 40 B/W Illustrations
by Chapman & Hall

Data clustering, an interdisciplinary field with diverse applications, has gained increasing popularity since its origins in the 1950s. Over the past six decades, researchers from various fields have proposed numerous clustering algorithms. In 2011, I wrote a book on implementing clustering algorithms in C++ using object-oriented programming. While C++ offers efficiency, its steep learning curve... Read more

I Python Programming Preliminaries

1 Python Programming 101

2 The NumPy Library

3 The Pandas Library

4 The Matplotlib Library

II Data Clustering in Python

5 Introduction to Data Clustering

6 Agglomerative Hierarchical Algorithms

7 DIANA

8 The k-means Algorithm

9 The c-means Algorithm

10 The k-prototypes Algorithm

11 The Genetic k-modes Algorithm

12 The FSC Algorithm

13 The Gaussian Mixture Algorithm

14 The KMTD Algorithm

15 The Probability Propagation Algorithm

16 A Spectral Clustering Algorithm

17 A Mean-Shift Algorithm

Biography

Guojun Gan is an Associate Professor in the Department of Mathematics at the University of Connecticut, where he has been since August 2014. Prior to that, he worked at a large life insurance company in Toronto, Canada for six years and a hedge fund in Oakville, Canada for one year. He earned a BS degree from Jilin University, Changchun, China, in 2001 and MS and PhD degrees from York University, Toronto, Canada, in 2003 and 2007, respectively.