Image Clustering Algorithms by Deep Convolutional Autoencoders

doi:10.3778/j.issn.1673-9418.1806029

Abstract

Abstract: To avoid the big characteristic loss of deep convolutional embedded clustering (DCEC) algorithm, especially for complex images, a 17-layer deep network framework is proposed in this paper for unsupervised deep image clustering analysis, where subsampling layer is embedded in encode layers to reduce parameters and prevent overfitting while up-sampling is embedded in decode layers to restore the specific loss by subsampling in encode layers. Combining the loss functions of deep embedded clustering (DEC) and improved deep embedded clustering (IDEC), two deep convolutional autoencoder based algorithms for image clustering analysis are proposed in this paper, named as DEC_DCNN (deep embedded clustering based on deep convolutional neural network) and IDEC_DCNN (improved deep embedded clustering based on deep convolutional neural network) respectively. Adam (adaptive moment estimation) and Mini-Batch SGD (mini-batch stochastic gradient decent) are adopted to optimize parameters for the proposed algorithms. Three typical image datasets are used to test the power of the proposed algorithms. The experimental results demonstrate that the proposed 17-layer deep network framework is very robust and general. The DEC_DCNN and IDEC_DCNN algorithms based on the proposed deep network framework have got higher clustering accuracy (ACC) than that of the available clustering algorithms. The IDEC_DCNN is superior to DEC_DCNN in terms of benchmark metrics including AMI (adjusted mutual information), ARI (adjusted rand index) and ACC, which further demonstrates the advantages of IDEC_DCNN.

Key words: deep image clustering, convolutional autoencoders, convolutional neural network (CNN), deep learning, clustering

摘要： 针对现有深度卷积嵌入聚类算法（deep convolutional embedded clustering，DCEC）的网络特征损失过大，对复杂图像没有提取有效特征的问题，提出一个具有17层网络结构的无监督深度聚类框架，并在编码层加入下采样层，减少参数和防止过拟合；在解码层加入上采样层还原下采样造成的细节损失。分别结合DEC（deep embedded clustering）算法的损失函数和IDEC（improved deep embedded clustering）算法的采用局部结构保留优势的损失函数，得到两种基于卷积自编码的深度学习图像聚类算法DEC_DCNN（deep embedded clustering based on deep convolutional neural network）和IDEC_DCNN（improved deep embedded clustering based on deep convolutional neural network），并使用自适应矩估计（adaptive moment estimation，Adam）和小批量随机梯度下降（mini-batch stochastic gradient decent，mini-batch SGD）两种优化方法调整模型参数。3个经典图像数据集的实验结果显示，提出的17层网络结构对图像特征具有很好的鲁棒性和通用性，基于该网络结构的深度聚类算法取得了远优于现有深度聚类算法的结果，其聚类准确率均优于对比算法；对深度聚类算法DEC_DCNN和IDEC_DCNN的聚类结果准确率、指标值AMI（adjusted mutual information）和ARI（adjusted rand index）进行比较，IDEC_DCNN比DEC_DCNN的聚类性能更好，说明IDEC_DCNN算法的性能更优越。

关键词: 深度图像聚类, 卷积自编码, 卷积神经网络（CNN）, 深度学习, 聚类

XIE Juanying, HOU Qi, CAO Jiawen. Image Clustering Algorithms by Deep Convolutional Autoencoders[J]. Journal of Frontiers of Computer Science and Technology, 2019, 13(4): 586-595.

谢娟英，侯琦，曹嘉文. 深度卷积自编码图像聚类算法[J]. 计算机科学与探索, 2019, 13(4): 586-595.

[1]	WANG Dicong, BAI Chenshuai, WU Kaijun. Survey of Video Object Detection Based on Deep Learning [J]. Journal of Frontiers of Computer Science and Technology, 2021, 15(9): 1563-1577.
[2]	ZHANG Xiaoxu, MA Zhiqiang, LIU Zhiqiang, ZHU Fangyuan, WANG Chunyu. Research Status and Prospect of Transformer in Speech Recognition [J]. Journal of Frontiers of Computer Science and Technology, 2021, 15(9): 1578-1594.
[3]	CHEN Junfen, ZHANG Ming, ZHAO Jiacheng, XIE Bojun, LI Yan. Deep Clustering Algorithm Based on Denoising and Self-Attention [J]. Journal of Frontiers of Computer Science and Technology, 2021, 15(9): 1717-1727.
[4]	CHEN Fan, PENG Li. Person Re-identification Based on Multi-level Feature Fusion with Overlapping Stripes [J]. Journal of Frontiers of Computer Science and Technology, 2021, 15(9): 1753-1761.
[5]	WU Jiawei, SUN Yanchun. Recommendation System for Medical Consultation Integrating Knowledge Graph and Deep Learning Methods [J]. Journal of Frontiers of Computer Science and Technology, 2021, 15(8): 1432-1440.
[6]	MA Yu, DU Huimin, MAO Zhili, ZHANG Xia. Crowd Density Detection Technology Based on Deep Semantic Segmentation [J]. Journal of Frontiers of Computer Science and Technology, 2021, 15(8): 1469-1475.
[7]	RONG Huan, MA Tinghuai. Two-Phase Crowdsourced Comment Integration Method Based on Reward Prediction and Policy Gradient [J]. Journal of Frontiers of Computer Science and Technology, 2021, 15(8): 1476-1489.
[8]	WANG Dagang, DING Shifei, ZHONG Jin. Research of Density Peaks Clustering Algorithm Based on Second-Order k Neighbors [J]. Journal of Frontiers of Computer Science and Technology, 2021, 15(8): 1490-1500.
[9]	ZHANG Mengqian, ZHANG Li. Coarse-to-Fine Two-Stage Convolutional Neural Network Algorithm [J]. Journal of Frontiers of Computer Science and Technology, 2021, 15(8): 1501-1510.
[10]	MA Yukun, XU Yaowen, ZHAO Xin, XU Tao, WANG Zerui. Review of Presentation Attack Detection in Face Recognition System [J]. Journal of Frontiers of Computer Science and Technology, 2021, 15(7): 1195-1206.
[11]	GE Yizhou, XU Xiang, YANG Suorong, ZHOU Qing, SHEN Furao. Survey on Sequence Data Augmentation [J]. Journal of Frontiers of Computer Science and Technology, 2021, 15(7): 1207-1219.
[12]	FANG Junting, TAN Xiaoyang. Defect Detection of Metal Surface Based on Attention Cascade R-CNN [J]. Journal of Frontiers of Computer Science and Technology, 2021, 15(7): 1245-1254.
[13]	SHEN Xueli, QIN Xinyu. KNN Algorithm of Enhanced Clustering Based on Density Canopy and Deep Feature [J]. Journal of Frontiers of Computer Science and Technology, 2021, 15(7): 1289-1301.
[14]	TIAN Xuan, DING Qi, LIAO Zihui, SUN Guodong. Survey on Deep Learning Based News Recommendation Algorithm [J]. Journal of Frontiers of Computer Science and Technology, 2021, 15(6): 971-998.
[15]	NENG Wenpeng, LU Jun, ZHAO Caihong. Survey of Sleep Staging Based on Relational Induction Biases [J]. Journal of Frontiers of Computer Science and Technology, 2021, 15(6): 1026-1037.

Image Clustering Algorithms by Deep Convolutional Autoencoders

深度卷积自编码图像聚类算法

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles 0

Metrics