Abstract
Keywords are set of important thematic words that represent the whole document. This paper discusses a novel approach for determination of Punjabi keywords. Earlier Keywords extraction systems for Punjabi were not much efficient as those were using less number of features for extracting keywords. But the proposed Punjabi keywords extraction system is very efficient as it is using six features for extracting Punjabi keywords as compared to earlier systems like: TF-ISF feature, Noun frequency feature, font type feature (Bold font, Italics font and Underlined font), Cue phrase feature, Position feature and title keyword feature. The proposed approach also uses regression for assigning weights to these six features.
Keywords
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Kaur J, Gupta V (2010) Effective approaches for extraction of keywords. Int J Comput Sci Issues 7:144–148
Gupta V, Lehal GS (2011) Punjabi language stemmer for nouns and proper names. In: Proceedings of the 2nd workshop on South and Southeast Asian natural language processing (WSSANLP), IJCNLP 2011, Chiang Mai, Thailand, pp 35–39
Kaur K, Gupta V (2011) Keyword extraction for Punjabi language. Indian J Comput Sci Eng (IJCSE) 2:364–370
Neto JL et al (2000) Document clustering and text summarization. In: Proceedings of international conference on practical app of knowledge discovery & data mining, London, pp 41–55
Gupta V, Lehal GS (2011) Automatic keywords extraction for Punjabi language. Int J Comput Sci Issues 8:327–331
Gupta V, Lehal GS (2012) Automatic Punjabi text extractive summarization system. In: Proceedings of international conference on computational linguistics COLING ‘12, pp 191–198
Fattah MA, Ren F (2008) Automatic text summarization. Proc World Acad Sci Eng Technol 27:192–195
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer India
About this paper
Cite this paper
Gupta, V. (2014). A New Punjabi Keywords Extraction System. In: Sengupta, S., Das, K., Khan, G. (eds) Emerging Trends in Computing and Communication. Lecture Notes in Electrical Engineering, vol 298. Springer, New Delhi. https://doi.org/10.1007/978-81-322-1817-3_46
Download citation
DOI: https://doi.org/10.1007/978-81-322-1817-3_46
Published:
Publisher Name: Springer, New Delhi
Print ISBN: 978-81-322-1816-6
Online ISBN: 978-81-322-1817-3
eBook Packages: EngineeringEngineering (R0)