Abstract
This paper proposes a method to measure the performance of keyword extraction based on topic coverage. The answer set of a keyword is required to evaluate keyword extraction by methods such as TF-IDF. However, creating an answer set for a large document is expensive. Thus, this paper proposes a new measurement called topic coverage on the basis of the assumption that the keywords extracted by a superior method can express the topic information efficiently. The experiment using the proceedings of a conference shows the feasibility of our proposed method.
Chapter PDF
Similar content being viewed by others
References
Manning, C.D., Raghavan, P., SchĂĽtze, H.: Introduction to Information Retrieval. Cambridge University Press (2008)
Zhang, K., Xu, H., Tang, J., Li, J.: Keyword Extraction Using Support Vector Machine. In: Yu, J.X., Kitsuregawa, M., Leong, H.-V. (eds.) WAIM 2006. LNCS, vol. 4016, pp. 85–96. Springer, Heidelberg (2006)
Salton, G.: Automatic Text Processing: The Transformation Analysis and Retrieval of Information by Computer. Addison-Wesley Publisher (1988)
New York Times: Artificial Intelligence, With Help From the Humans (2007), http://www.nytimes.com/2007/03/25/business/yourmoney/25Stream.html (accessed in March 2013)
Blei, D.M., Lafferty, J.D.: Dynamic topic models. In: Proceedings of the 23rd International Conference on Machine Learning, pp. 113–120 (2006)
Jayabharathy, J., Kanmani, S., Parveen, A.A.: A Survey of Document Clustering Algorithms with Topic Discovery. Journal of Computing 3, 21–28 (2011)
Han, J., Kamber, M., Pei, J.: Data Mining: Concepts and Techniques, 3rd edn. Morgan Kaufmann (2011)
Newman, M.E.: Detecting community structure in networks. The European Physical Journal B-Condensed Matter and Complex Systems 38, 321–330 (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Saga, R., Kobayashi, H., Miyamoto, T., Tsuji, H. (2014). Measurement Evaluation of Keyword Extraction Based on Topic Coverage. In: Stephanidis, C. (eds) HCI International 2014 - Posters’ Extended Abstracts. HCI 2014. Communications in Computer and Information Science, vol 434. Springer, Cham. https://doi.org/10.1007/978-3-319-07857-1_40
Download citation
DOI: https://doi.org/10.1007/978-3-319-07857-1_40
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-07856-4
Online ISBN: 978-3-319-07857-1
eBook Packages: Computer ScienceComputer Science (R0)