The Research on Large Scale Data Set Clustering Algorithm Based on Tag Set

Conference paper
Part of the Communications in Computer and Information Science book series (CCIS, volume 575)

Abstract

This paper proposes a set of SSLOKmeans algorithm that helps to guide the clustering before using tag memory resident, this algorithm can further improve the large-scale data sets clustering efficiency and clustering results of quality.

Keywords

Clustering Tag set SSLOKmeans algorithm Data set 

References

  1. 1.
    Kantabutra, S., Couch, A.L.: Parallel K-means clustering algorithm on NOWs. Tech. J. 1(6), 243–248 (2000)Google Scholar
  2. 2.
    Alsabti, K., Ranka, S., Singh, V.: An efficient K-means clustering algorithm. In: Proceedings of IPPS/SPDP Workshop on High Performance Data Mining, 1998, pp. 1–6. ACM, New York, NY (2011)Google Scholar
  3. 3.
    Chang, H., Yeung, D.Y.: Locally linear metric adaptation with application to semi-supervised clustering and image retrieval. Pattern Recogn. 39(7), 1253–1264 (2006)CrossRefMATHGoogle Scholar
  4. 4.
    Wagstaff, K., Cardie, C., Rogers, S., et al.: Constrained K-means clustering with background knowledge. In: Proceedings of the 18th International Conference on Machine Learning, pp. 577–584. Morgan Kaufmann, San Francisco, CA (2001)Google Scholar
  5. 5.
    Basu, S., Banerjee, A., Mooney. R.: Semi-supervised clustering by seeding. In: Proceedings of the 19th International Conference on Machine Learning, pp. 27–34. Morgan Kaufmann, San Francisco, CA (2002)Google Scholar
  6. 6.
    Basu, S., Bilenko, M., Mooney R.J.: A probabilistic framework for semi-supervised clustering. In: Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 59–68. ACM, New York, NY (2004)Google Scholar
  7. 7.
    Lu, Z., Leen, T.K.: Semi-supervised clustering with pair wise constraints: a discriminative approach. In: Proceedings of the 11th International Conference on Artificial Intelligence and Statistics, AISTATS 2007, pp. 299–306. Microtome Publishing, USA (2007)Google Scholar
  8. 8.
    Bradley, P.S., Fayyad, U., Reina, C.: Scaling clustering algorithms to large databases. In: Proceedings of the Fourth International Conference on Knowledge Discovery and Data Mining (KDD 1998), pp. 9–15. AAAI Press, New York, USA (1998)Google Scholar

Copyright information

© Springer Science+Business Media Singapore 2016

Authors and Affiliations

  1. 1.Department of Computer ScienceGuangdong University of Science and TechnologyDongguanChina

Personalised recommendations