Skip to main content

Development of Punjabi WordNet, Bilingual Dictionaries, Lexical Relations Creation, and Its Challenges

  • Chapter
  • First Online:
The WordNet in Indian Languages
  • 198 Accesses

Abstract

WordNet is an electronic lexical database and is a powerful resource for computational linguistics and natural language processing. WordNet for Hindi language has already been developed by IIT Bombay. The Indian languages WordNets are being created using expansion approach from Hindi WordNet under IndoWordNet project. This paper describes the process of creation of Punjabi WordNet, where semantic relations are borrowed from the Hindi language, while the lexical relations are created for Punjabi language, as these relations are language dependent. To create lexical relations, a lexical relation tool has been proposed in this paper. The development of bilingual Hindi–Punjabi and Punjabi–Hindi dictionaries through this process has also been presented in this paper. This paper also discusses the challenges in the development of Punjabi WordNet and creation of language-specific synsets with reference to Punjabi WordNet.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  • Banerjee, S., & Pedersen, T. (2002). an adapted lesk algorithm for word sense disambiguation using WordNet. 3rd International Conference on Intelligent Text Processing and Computational Linguistics (pp. 1–10). Mexico City.

    Google Scholar 

  • Bellare, K., Sharma, A. D., Sharma, A. D., Loiwal, N., & Bhattacharyya, P. (2004). Generic Text Summarization Using WordNet. Language Resources Engineering Conference. Barcelona.

    Google Scholar 

  • Bhattacharyya, P. (2010). IndoWordNet. Lexical Resources Engineering Conference. Malta.

    Google Scholar 

  • Gabrilovich, E., & Markovitch, S. (2004). Text categorization with many redundant features: Using aggressive feature selection to make SVMs competitive with C4.5. 21st International Conference on Machine Learning (pp. 321–328). Canada.

    Google Scholar 

  • IndoWordNet Database design. (2011). Technical Report by Goa University, Goa.

    Google Scholar 

  • Khapra, M. M., Chauhan, S., Nair, S., Sharma, A., & Bhattacharyya, P. (2010). Domain specific iterative word sense disambiguation in a multilingual setting. 5th International Conference on Global WordNet.

    Google Scholar 

  • Kumar, P., Sharma, R., & Narang, A. (2014). Creation of lexical relations for IndoWordNet. 7th Conference on Global WordNet. Tartu (Estonia).

    Google Scholar 

  • Miller, G. A., Beckwith, R., Fellbaum, C., Gross, D., & Miller, K. J. (1990). Introduction to WordNet: An on-line lexical database. International Journal of Lexicography, 235–244.

    Google Scholar 

  • Narang, A., Sharma, R., & Kumar, P. (2013). Development of Punjabi WordNet. Springer CSI Transactions on ICT (pp: 349–354).

    Google Scholar 

  • Rana, P., & Kumar, P. (2015). word sense disambiguation for punjabi language using overlap based approach. In: M. El-Sayed & El-Alfy et al. (Eds.), Advances in intelligent informatics (pp. 607–619). Springer International Publishing.

    Google Scholar 

  • Tufis, D., Cristea, D., & Stamou, S. (2004). Balkanet: Aims, methods, results and perspectives. A general overview. Romanian Journal of Information, Science and Technology, 7(1–2), 9–43.

    Google Scholar 

  • Vossen, P. (Ed.). (1998). EuroWordNet: A multilingual database with lexical semantic networks. Dordrecht: Kluwer Academic Publishers.

    Google Scholar 

Download references

Acknowledgments

This work has been carried out under research project titled ‘Development of Indradhanush: An Integrated WordNet for Bengali, Gujarati, Kashmiri, Konkani, Oriya, Punjabi and Urdu’ under the leadership of IIT Bombay and Goa University. This project is sponsored by MoCIT, Govt. of India. We also acknowledge the contribution of Punjabi University, Patiala team for the development of Punjabi WordNet.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Parteek Kumar .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer Science+Business Media Singapore

About this chapter

Cite this chapter

Sharma, R.K., Kumar, P. (2017). Development of Punjabi WordNet, Bilingual Dictionaries, Lexical Relations Creation, and Its Challenges. In: Dash, N., Bhattacharyya, P., Pawar, J. (eds) The WordNet in Indian Languages. Springer, Singapore. https://doi.org/10.1007/978-981-10-1909-8_5

Download citation

  • DOI: https://doi.org/10.1007/978-981-10-1909-8_5

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-10-1907-4

  • Online ISBN: 978-981-10-1909-8

  • eBook Packages: Social SciencesSocial Sciences (R0)

Publish with us

Policies and ethics