
1
Deep Clustering: A Comprehensive Survey
Yazhou Ren, Member, IEEE, Jingyu Pu, Zhimeng Yang, Jie Xu, Guofeng Li, Xiaorong Pu,
Philip S. Yu, Fellow, IEEE, Lifang He, Member, IEEE
Abstract—Cluster analysis plays an indispensable role in machine learning and data mining. Learning a good data representation is
crucial for clustering algorithms. Recently, deep clustering, which can learn clustering-friendly representations using deep neural
networks, has been broadly applied in a wide range of clustering tasks. Existing surveys for deep clustering mainly focus on the
single-view fields and the network architectures, ignoring the complex application scenarios of clustering. To address this issue, in this
paper we provide a comprehensive survey for deep clustering in views of data sources. With different data sources and initial
conditions, we systematically distinguish the clustering methods in terms of methodology, prior knowledge, and architecture.
Concretely, deep clustering methods are introduced according to four categories, i.e., traditional single-view deep clustering,
semi-supervised deep clustering, deep multi-view clustering, and deep transfer clustering. Finally, we discuss the open challenges and
potential future opportunities in different fields of deep clustering.
Index Terms—Deep clustering; semi-supervised clustering; multi-view clustering; transfer learning
F
1 INTRODUCTION
WITH the development of online media, abundant data with
high complexity can be gathered easily. Through pinpoint
analysis of these data, we can dig the value out and use these
conclusions in many fields, such as face recognition [1], [2],
sentiment analysis [3], [4], intelligent manufacturing [5], [6], etc.
A model which can be used to classify the data with different
labels is the base of many applications. For labeled data, it is
taken granted to use the labels as the most important information
as a guide. For unlabeled data, finding a quantifiable objective as
the guide of the model-building process is the key question of
clustering. Over the past decades, a large number of clustering
methods with shallow models have been proposed, including
centroid-based clustering [7], [8], density-based clustering [9],
[10], [11], [12], [13], distribution-based clustering [14], hierar-
chical clustering [15], ensemble clustering [16], [17], multi-view
clustering [18], [19], [20], [21], [22], [23], etc. These shallow
models are effective only when the features are representative,
while their performance on the complex data is usually limited
due to the poor power of feature learning.
In order to map the original complex data to a feature space
that is easy to cluster, many clustering methods focus on feature
extraction or feature transformation, such as PCA [24], kernel
method [25], spectral method [26], deep neural network [27], etc.
Among these methods, the deep neural network is a promising ap-
proach because of its excellent nonlinear mapping capability and
its flexibility in different scenarios. A well-designed deep learning
based clustering approach (referred to deep clustering) aims at
effectively extracting more clustering-friendly features from data
and performing clustering with learned features simultaneously.
Much research has been done in the field of deep clustering
and there are also some surveys about deep clustering methods
•Yazhou Ren, Jingyu Pu, Zhimeng Yang, Jie Xu, Guofeng Li and Xiaorong
Pu are with University of Electronic Science and Technology of China,
Chengdu 611731, China. Yazhou Ren is the corresponding author. E-mail:
yazhou.ren@uestc.edu.cn.
•Philip S. Yu is with University of Illinois at Chicago, IL 60607, USA.
•Lifang He is with Lehigh University, PA 18015, USA.
Manuscript received Oct. 2022.
[28], [29], [30], [31]. Specifically, existing systematic reviews for
deep clustering mainly focus on the single-view clustering tasks
and the architectures of neural networks. For example, Aljalbout
et al. [28] focus only on deep single-view clustering methods
which are based on deep autoencoder (AE or DAE). Min et
al. [29] classify deep clustering methods from the perspective
of different deep networks. Nutakki et al. [30] divide deep
single-view clustering methods into three categories according
to their training strategies: multi-step sequential deep clustering,
joint deep clustering, and closed-loop multi-step deep clustering.
Zhou et al. [31] categorize deep single-view clustering methods
by the interaction way between feature learning and clustering
modules. But in the real world, the datasets for clustering are
always associated, e.g., the taste for reading is correlated with
the taste for a movie, and the side face and full-face from the
same person should be labeled the same. For these data, deep
clustering methods based on semi-supervised learning, multi-view
learning, and transfer learning have also made significant progress.
Unfortunately, existing reviews do not discuss them too much.
Therefore, it is important to classify deep clustering from
the perspective of data sources and initial conditions. In this
survey, we summarize the deep clustering from the perspective of
initial settings of data combined with deep learning methodology.
We introduce the newest progress of deep clustering from the
perspective of network and data structure as shown in Fig. 1.
Specifically, we organize the deep clustering methods into the
following four categories:
•Deep single-view clustering
For conventional clustering tasks, it is often assumed that
the data are of the same form and structure, as known as single-
view or single-modal data. The extraction of representations for
these data by deep neural networks (DNNs) is a significant
characteristic of deep clustering. However, what is more note-
worthy is the different applied deep learning techniques, which
are highly correlated with the structure of DNNs. To compare the
technical route of specific DNNs, we divide those algorithms into
five categories: deep autoencoder (DAE) based deep clustering,
arXiv:2210.04142v1 [cs.LG] 9 Oct 2022