Abstract
Deep clustering obtains feature representation generally and then performs clustering for high dimension real-world data. However, conventional solutions are two-stage embedding learning-based methods and these two processes are separate and independent, which often leads to clustering results cannot feedback to optimize the representation learning and reduces the performance of deep clustering. In this paper, we aim to propose a deep boundary-aware clustering by jointly optimizing unsupervised representation learning. More specifically, we joint boundary-aware variational auto-encoder and deep regularized clustering for deep regularized clustering for unsupervised learning, named Boundary-aware DEep Clustering (BaDEC). BaDEC is able to learn feature representation and clustering simultaneously, and it introduces deep regularized clustering to reduce the unreliability of the similarity measures. In particular, we present a boundary-aware variational auto-encoder that tunes variable evidence lower bounds flexibly to assist feature representation learning better for more accurate clustering. Extensive experiments on various datasets from multiple domains demonstrate that the proposed method outperforms several popular comparison baseline methods.
Similar content being viewed by others
References
Boutsidis C, Drineas P, Mahoney MW (2009) Unsupervised feature selection for the \(k\)-means clustering problem. In: Advances in neural information processing systems, pp. 153–161
Chen X, Kingma DP, Salimans T, Duan Y, Dhariwal P, Schulman J, Sutskever I, Abbeel P (2017) Variational lossy autoencoder. In: 5th international conference on learning representations, Toulon, France, April 24-26
Devlin J, Chang MW, Lee K, Toutanova K Bert (2019) Pre-training of deep bidirectional transformers for language understanding. In: Proceedings of the 2019 conference of the north american chapter of the association for computational linguistics: human language technologies, Minneapolis, MN, USA, June 2-7, pp. 4171–4186
Fu Y, Yan Q, Liao J, Xiao C (2020) Joint texture and geometry optimization for RGB-D reconstruction. In: 2020 IEEE/CVF conference on computer vision and pattern recognition, Seattle, WA, USA, June 13-19, pp. 5949–5958
Ghasedi Dizaji K, Herandi A, Deng C, Cai W, Huang H (2017) Deep clustering via joint convolutional autoencoder embedding and relative entropy minimization. In: Proceedings of the IEEE international conference on computer vision, pp. 5736–5745
Gregor K, Besse F, Rezende DJ, Danihelka I, Wierstra, D (2016) Towards conceptual compression. In: Advances in neural information processing systems 29: Annual conference on neural information processing systems, Barcelona, Spain, December 5-10, pp. 3549–3557
Guo X, Gao L, Liu X, Yin J (2017) Improved deep embedded clustering with local structure preservation. In: IJCAI, pp. 1753–1759
Huang C, Sankaran K, Dhekane E, Lacoste A, Courville AC (2019) Hierarchical importance weighted autoencoders. In: Proceedings of the 36th international conference on machine learning, Long Beach, California, USA, June 9-15, pp. 2869–2878
Ji P, Zhang T, Li H, Salzmann M, Reid, I (2017) Deep subspace clustering networks. In: Advances in neural information processing systems, pp. 24–33
Jiang B, Tu W, Yang C, Yuan J (2020) Context-integrated and feature-refined network for lightweight object parsing. IEEE Trans Image Process 29:5079–5093
Jiang Z, Zheng Y, Tan H, Tang B, Zhou H (2017) Variational deep embedding: An unsupervised and generative approach to clustering. In: Proceedings of the twenty-sixth international joint conference on artificial intelligence, pp. 1965–1972
Kingma DP, Welling M (2014) Auto-encoding variational bayes. In: 2nd international conference on learning representations
Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. In: Advances in neural information processing systems, pp. 1097–1105
Liu D, Long C, Zhang H, Yu H, Dong X, Xiao C (2020) Arshadowgan: Shadow generative adversarial network for augmented reality in single light scenes. In: 2020 IEEE/CVF conference on computer vision and pattern recognition, Seattle, WA, USA, June 13-19, pp. 8136–8145
Martínez-Trinidad JF, Ruiz-Shulcloper J (2001) Fuzzy clustering of semantic spaces. Pattern Recognition 34(4):783–793
Mukherjee S, Asnani H, Lin E, Kannan S (2019) Clustergan: Latent space clustering in generative adversarial networks. In: Proceedings of the thirty-third AAAI conference on artificial intelligence, pp. 1181–1190
Ng AY, Jordan MI, Weiss Y (2002) On spectral clustering: Analysis and an algorithm. In: Advances in neural information processing systems, pp. 849–856
Rainforth T, Kosiorek AR, Le TA, Maddison CJ, Igl M, Wood F, Teh YW (2018) Tighter variational bounds are not necessarily better. In: Proceedings of the 35th international conference on machine learning, pp. 4274–4282
Ren Y, Hu K, Dai X, Pan L, Hoi SC, Xu Z (2019) Semi-supervised deep embedded clustering. Neurocomputing 325:121–130
Solorio-Fernández S, Martínez-Trinidad JF, Carrasco-Ochoa JA (2017) A new unsupervised spectral feature selection method for mixed data: a filter approach. Pattern Recognition 72:314–326
Vincent P, Larochelle H, Lajoie I, Bengio Y, Manzagol PA (2010) Stacked denoising autoencoders: Learning useful representations in a deep network with a local denoising criterion. Journal of Machine Learning Research 11:3371–3408
Xiang S, Nie F, Zhang C (2008) Learning a mahalanobis distance metric for data clustering and classification. Pattern recognition 41(12):3600–3612
Xiao H, Rasul K, Vollgraf R (2017) Fashion-mnist: a novel image dataset for benchmarking machine learning algorithms. arXiv:1708.07747
Xie J, Girshick R, Farhadi A (2016) Unsupervised deep embedding for clustering analysis. In: International conference on machine learning, pp. 478–487
Xing EP, Jordan MI, Russell, SJ, Ng AY (2003) Distance metric learning with application to clustering with side-information. In: Advances in neural information processing systems, pp. 521–528
Yang J, Parikh D, Batra D (2016) Joint unsupervised learning of deep representations and image clusters. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 5147–5156
Ye J, Zhao Z, Wu M (2008) Discriminative k-means for clustering. In: Advances in neural information processing systems, pp. 1649–1656
Yu J, Jiang H, Wang G, Guo Q (2012) Clustering-based energy-efficient broadcast tree in wireless networks. Int. J. Comput. Commun. Control 7(4):785–790
Zhang R, Tong H, Xia Y, Zhu Y (2019) Robust embedded deep k-means clustering. In: Proceedings of the 28th international conference on information and knowledge management, Beijing, China, November 3-7, pp. 1181–1190
Zhou F, De la Torre F, Hodgins JK (2012) Hierarchical aligned cluster analysis for temporal clustering of human motion. IEEE Transactions on Pattern Analysis and Machine Intelligence 35(3):582–596
Acknowledgements
This work is supported by the National Natural Science Foundation of China (Grant No. 61602353 and 61373148), Hubei Provincial Natural Science Foundation of China (Grant No. 2017CFA012), the National Social Science Foundation under Award (Grant No. 19BYY076), in part Key R & D project of Shandong Province (Grant No. 2019JZZY010129), and Shandong Provincial Social Science Planning Project (Grant No. 18CXWJ01, 18BJYJ04 and 19BJCJ51).
Author information
Authors and Affiliations
Corresponding authors
Rights and permissions
About this article
Cite this article
Wang, R., Li, L., Wang, P. et al. Deep boundary-aware clustering by jointly optimizing unsupervised representation learning. Multimed Tools Appl 81, 34309–34324 (2022). https://doi.org/10.1007/s11042-021-11597-2
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-021-11597-2