Keyword Extraction Using Graph Centrality and WordNet

Sharma, Chhavi; Jain, Minni; Aggarwal, Ayush

doi:10.1007/978-981-13-2348-5_27

Chhavi Sharma⁴,
Minni Jain⁴ &
Ayush Aggarwal⁴

566 Accesses
1 Citations

Abstract

Keywords are a summarized shortened version of any document. While a lot of research has been done on keyword extraction, very few of them analyze the network of a semantic network to identify the most important words in the document. In this research, we present one such method which uses WordNet as our knowledge base to exploit the semantic relatedness of terms and hence determine keywords. This is based upon graph centrality measures which help to identify the central nodes or keywords from the document.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

eBook: USD 16.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Unsupervised Keyword Extraction Using the GoW Model and Centrality Scores

Semantic Measures for Keywords Extraction

A New Extraction Algorithm for Hierarchical Keyword Using Text Social Network

Notes

1.
Stop-words are the words in a language that do not serve much meaning but are used for construction of a sentence and binding it together.

References

Jones, S., & Paynter, G. (2002). Automatic extraction of document keyphrases for use in digital libraries: Evaluation and applications. Journal of the American Society for Information Science and Technology.
Google Scholar
Fellbaum, C. (1998). WordNet: An electronic lexical database. Cambridge, MA: MIT Press.
Google Scholar
Zhang, K., Hui, X., Tang, J., Li, J., Yu, J. X., Kitsuregawa, M., Leong, H. V. (2006). Keyword extraction using support vector machine advances in web-age information management.
Google Scholar
Rose, S., & Engel, D., Cramer, N., Cowley, W. (2010). Automatic keyword extraction from individual documents. Text Mining: Applications and Theory, 1–20. https://doi.org/10.1002/9780470689646.ch1.
Google Scholar
Witten, I. H., Paynter, G. W, Frank, E., Gutwin, C., & Nevill-Manning, C. G. (1999). KEA: Practical automatic keyphrase extraction. ACM DL.
Google Scholar
Turney, P. (1999). Learning to extract keyphrases from text. Information Retrieval.
Google Scholar
Witten, I. H. & Medelyan, O. (2006). Thesaurus based automatic keyphrase indexing. In Proceedings of the 6th ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL’06) (pp. 296–297), Chapel Hill, NC.
Google Scholar
Cerbulescu, C., & Leotescu, G. S. (2017). Extracting text keywords using WordNet (pp. 1–4). https://doi.org/10.1145/3136273.3136280.
Beliga, S., Ana, M., & Martinčić-Ipšić, S. (2015). An overview of graph-based keyword extraction methods and approaches. Journal of Information and Organizational Sciences, 39(1), 1–20.
Google Scholar
HaCohen-Kerner, Y. (2003). Automatic extraction of keywords from abstracts.
Google Scholar
Pudotta, A., Dattolo, A., & Baruzzo, A. (2010). New domain independent keyphrase extraction system digital libraries. In 6th Italian Research Conference, IRCDL. Padua, Italy.
Google Scholar
Wei, T., Lu, Y., Chang, H., Zhou, Q., & Bao, X. (2015). A semantic approach for text clustering using WordNet and lexical chains. Expert Systems with Applications, 42(4), 2264–2275, ISSN 0957-4174.
Article Google Scholar
Tsatsaronis, G., Varlamis, I., & Nørvåg, K. (2010). SemanticRank: ranking keywords and sentences using semantic graphs. In Proceedings of the 23rd International Conference on Computational Linguistics (COLING’10).
Google Scholar
Page, L., Brin, S., Motwani, R., & Winograd, T. (1999). The pagerank citation ranking: bringing order to the web. Technical Report, Stanford InfoLab.
Google Scholar
Kleinberg, J. M. (1998). Authoritative sources in a hyperlinked environment. In Proceedings of the Ninth Symposium Discrete Algorithms (pp. 668–677).
Google Scholar
Boudin, F. (2013). A comparison of centrality measures for graph-based keyphrase extraction. In International Joint Conference on Natural Language Processing (IJCNLP) (pp. 834–838), Nagoya, Japan.
Google Scholar
Schluter, N. (2014). Centrality measures for non-contextual graph-based unsupervised single document keyword extraction. In Proceedings of TALN Association for Computational Linguistics.
Google Scholar
Tixier, A., Malliaros, F., & Vazirgiannis, M. (2016). A graph degeneracy-based approach to keyword extraction. In EMNLP.
Google Scholar
Navigli, R., & Lapata, M. (2010). An experimental study of graph connectiv-ity for unsupervised word sense disambiguation. IEEE Transaction on Pattern Analysis and Machine Learning, 32(4).
Article Google Scholar
Jain, A., Mittal, K., & Tayal, D. K. (2014). Automatically incorporating context meaning for query expansion using graph connectivity measures. Progress in Artificial Intelligence, 2, 129–139.
Article Google Scholar
Jain, A., & Lobiyal, D. K. (2014). A new approach for unsupervised word sense disambiguation in Hindi language using graph connectivity measures. International Journal Artificial Intelligence Soft Computing 4(4), 318–334.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Delhi Technological University, Shahbad Daulatpur, Bawana Road, New Delhi, 110042, Delhi, India
Chhavi Sharma, Minni Jain & Ayush Aggarwal

Authors

Chhavi Sharma
View author publications
You can also search for this author in PubMed Google Scholar
Minni Jain
View author publications
You can also search for this author in PubMed Google Scholar
Ayush Aggarwal
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Minni Jain .

Editor information

Editors and Affiliations

Department of Computer Engineering, Netaji Subhas Institute of Technology, University of Delhi, New Delhi, India
Shampa Chakraverty
SAP Canada, Waterloo, ON, Canada
Anil Goel
Department of Electrical and Information Engineering, Covenant University, Ota, Nigeria
Sanjay Misra

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Sharma, C., Jain, M., Aggarwal, A. (2018). Keyword Extraction Using Graph Centrality and WordNet. In: Chakraverty, S., Goel, A., Misra, S. (eds) Towards Extensible and Adaptable Methods in Computing. Springer, Singapore. https://doi.org/10.1007/978-981-13-2348-5_27

Download citation

DOI: https://doi.org/10.1007/978-981-13-2348-5_27
Published: 05 November 2018
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-2347-8
Online ISBN: 978-981-13-2348-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Keyword Extraction Using Graph Centrality and WordNet

Abstract

Access this chapter

Similar content being viewed by others

Unsupervised Keyword Extraction Using the GoW Model and Centrality Scores

Semantic Measures for Keywords Extraction

A New Extraction Algorithm for Hierarchical Keyword Using Text Social Network

Notes

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Publish with us

Navigation

Keyword Extraction Using Graph Centrality and WordNet

Abstract

Access this chapter

Similar content being viewed by others

Unsupervised Keyword Extraction Using the GoW Model and Centrality Scores

Semantic Measures for Keywords Extraction

A New Extraction Algorithm for Hierarchical Keyword Using Text Social Network

Notes

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation