Abstract
Domain-Specific features clustering aims to cluster the features from related domains into K clusters. Although traditional clustering algorithms can be used to domain-specific features clustering, the performance may not good as the features have little inter-connection in related domains. In this paper, we develop a solution that uses the domain-independent feature as a bridge to connect the domain-specific features. And we use spectral clustering to cluster the domain-specific features into K clusters. We present theoretical analysis to show that our algorithm is able to produce high quality clusters. The experimental results show that our algorithm improves the clustering performance over the traditional algorithms.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Blitzer, J.: Domain Adaptation of Natural Language Processing Systems. PhD thesis, The University of Pennsylvania (2007)
Liu, B.: Web DataMinging: Exploring Hyperlinks contents and Usage Data. Springer (January 2007)
Hagen, L., Kahng, A.B.: New spectral methods for ratio cut partitioning and clustering. IEEE Transactions on CAD (1992)
Verma, D., Meila, M.: Acomparison of spectral clustering Algorithms. UW CSE Technical Report 03-05-01 (2003)
Ng, A.Y., Jardan, M., Weiss, Y.: On spectral clustering: analysis and an algorithm. Neural Information Processing Systems 14 (2002)
Kernighan, B., Lin, S.: An efficient heuristic procedure for partitioning graphs. The Bell System Technical Journal (1970)
Mohar, B.: The Laplacian spectrum of graphs. Graph Theory Combinatorics and Applications, 871–898 (1988)
Hagen, L., Kahng, A.B.: New spectral methods for ratio cut partitioning and clustering. IEEE Transactions on Computer-Aided Design (1992)
Shi, J., Malik, J.: Normalized cuts and image segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence (2000)
Callon, M., Law, J., Rip, A.: Mapping the Dynamics of Science and Technology: Sociology of Science in the RealWorld, London (1986)
Ding, C., He, X.: K-means clustering via principal component analysis. In: Proceedings of the Twenty-First International Conference on Machine learning, pp. 225–232. ACM, Banff (2004)
Fayyad, U., Piatetsky-Shapiro, G., Smyth, P.: From data mining to knowledge discovery: An Overview. In: Advances in Knowledge Discovery and Data Mining, pp. 1–36. MIT Press, Cambridge (1996)
Michael, C.: Discriminative ranking for natural language parsing. In: Proceedings of the 17th International Conference on Machine Learning, ICML (2000)
Li, l.: Ruled Extraction of Rough Set Model Based on Important Connection Degree Anqing Radio and Television University, Anqing 246003, China (2011)
Cover, T.M., Thomas, J.A.: Elements of information theory. Wiley-Interscience (1991)
Francis, R.: Learning Spectral Clustering. Computer Science and Statistics University of California Berkeley, CA 94720 (2003)
Dhillon, I.S.: Co-clustering documents and words using bipartite spectral graph partitioning. In: Proceedings of the 7th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 269–274. ACM (2001)
Salton, G., McGill, M.J.: Introduction to Modern Retrieval. McGraw-Hill Book Company (1983)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Yang, X., Wang, M., Fang, L., Yue, L., Lv, Y. (2012). Research on Domain-Specific Features Clustering Based Spectral Clustering. In: Tan, Y., Shi, Y., Ji, Z. (eds) Advances in Swarm Intelligence. ICSI 2012. Lecture Notes in Computer Science, vol 7332. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-31020-1_11
Download citation
DOI: https://doi.org/10.1007/978-3-642-31020-1_11
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-31019-5
Online ISBN: 978-3-642-31020-1
eBook Packages: Computer ScienceComputer Science (R0)