Abstract
In order to make up for the defect that the traditional spectral clustering algorithm cannot determine the number of clusters and the time-consuming calculation, this paper studies and improves the spectral clustering algorithm. In complex community networks, the spectral clustering algorithm based on modularity optimization is chosen to find the number of communities. In addition, four types of user attribute information are integrated, and a more reasonable user similarity model is constructed. At the same time, the original non-parallelized spectral clustering algorithm is optimized, and its improved scheme is suitable for the application of distributed computing. Many Hadoop optimization strategies are proposed for virtual community discovery scenarios in large-scale communities. Finally, the experimental results show that the efficiency of the parallelized spectral clustering algorithm is greatly improved, which can be applied to the virtual community discovery in large-scale social networks.
Similar content being viewed by others
References
Porter, M. A., Onnela, J. P., & Mucha, P. J. (2009). Communities in networks. Notices American Mathematical Society, 56(9), 1082–1097, 1164–1166.
Mottaghi, S., & Zahabi, M. R. (2015). Optimizing LEACH clustering algorithm with mobile sink and rendezvous nodes. AEUE - International Journal of Electronics and Communications, 69(2), 507–514.
Chopade, P., & Zhan, J. (2015). Structural and functional analytics for community detection in large-scale complex networks. Journal of Big Data, 2(1), 1–28.
Nayak, P., & Devulapalli, A. (2015). A fuzzy logic-based clustering algorithm for WSN to extend the network lifetime. IEEE Sensors Journal, 16(1), 137–144.
Li, X., & Zheng, Y. (2016). Clustering approach based on hi-erarchical expansion for community detection of scientific collaboration network. High Technology Letters, 22(4), 419–425.
Zhao, F., Liu, H., & Fan, J. (2015). A multiobjective spatial fuzzy clustering algorithm for image segmentation. Applied Soft Computing, 30(C), 48–57.
Ludwig, S. A. (2015). MapReduce-based fuzzy c-means clustering algorithm: Implementation and scalability. International Journal of Machine Learning and Cybernetics, 6(6), 923–934.
Ferrari, D. G., & de Castro, L. N. (2015). Clustering algorithm selection by meta-learning systems: A new distance-based problem characterization and ranking combination methods. Information Sciences, 301(C), 181–194.
Begum, S. S. F., & Rajesh, A. (2016). Modified load-balanced clustering algorithm with distributed self-organization for wireless sensor networks. International Journal of Pharmacy & Technology, 8(4), 23349–23356.
Shao, P., Shi, W., He, P., et al. (2016). Novel approach to unsupervised change detection based on a robust semi-supervised FCM clustering algorithm. Remote Sensing, 8(3), 264.
Le, H. S. (2015). A novel kernel fuzzy clustering algorithm for geo-demographic analysis. Information Sciences, 317(10), 202–223.
Ma, A., Zhong, Y., & Zhang, L. (2015). Adaptive multiobjective memetic fuzzy clustering algorithm for remote sensing imagery. IEEE Transactions on Geoscience and Remote Sensing, 53(8), 4202–4217.
Zhang, Y., Wang, J., Han, D., et al. (2017). Fuzzy-logic based distributed energy-efficient clustering algorithm for wireless sensor networks. Sensors, 17(7), 1554.
Zou, J., Peng, C., Xu, H., et al. (2015). A fuzzy clustering algorithm-based dynamic equivalent modeling method for wind farm with DFIG. IEEE Transactions on Energy Conversion, 30(4), 1329–1337.
Nokhanji, N., Hanapi, Z. M., Subramaniam, S., et al. (2015). An energy aware distributed clustering algorithm using fuzzy logic for wireless sensor networks with non-uniform node distribution. Wireless Personal Communications, 84(1), 395–419.
Mustapha, I., Ali, B. M., Rasid, M. F. A., et al. (2015). An energy-efficient spectrum-aware reinforcement learning-based clustering algorithm for cognitive radio sensor networks. Sensors, 15(8), 19783–19818.
Guo, K., Guo, W., Chen, Y., et al. (2015). Community discovery by propagating local and global information based on the MapReduce model. Information Sciences, 323(C), 73–93.
Balouchestani, M., & Krishnan, S. (2016). Advanced K-means clustering algorithm for large ECG data sets based on a collaboration of compressed sensing theory and K-SVD approach. Signal, Image and Video Processing, 10(1), 113–120.
Acknowledgements
The authors acknowledge the National Natural Science Foundation of China (Grant: 61673248), the Natural Science Foundation of Shanxi Province (Grant: 201601D102030).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Zhang, H., Wu, Y. Optimization and Application of Clustering Algorithm in Community Discovery. Wireless Pers Commun 102, 2443–2454 (2018). https://doi.org/10.1007/s11277-018-5264-x
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11277-018-5264-x