Advertisement

Development of a Micro Hindi Opinion WordNet and Aligning with Hown Ontology for Automatic Recognition of Opinion Words from Hindi Documents

  • D. Teja Santosh
  • Vikram Sunil BajajEmail author
  • Varun Sunil Bajaj
Conference paper
Part of the Communications in Computer and Information Science book series (CCIS, volume 827)

Abstract

The Indian languages are deprived in terms of accessibility of natural language tools. Especially, the tools for carrying out the particular opinion mining task: opinion word orientation in native language is not available. Reasoning about such natural language words requires a semantically rich lexical resource. When the ontology is aligned with a lexical resource like WordNet, a rich knowledge base is created which can be useful for various information retrieval and natural language processing applications. In order to do this, a micro level Hindi Opinion WordNet is developed and is aligned with the Hindi Opinion WordNet Ontology (HOWN). The opinion lexicon (both Hindi positive and negative words) for 700 Hindi adjectives is also developed. The synset ID values of Hindi opinion synsets are mapped with the synset ID values of corresponding English opinion WordNet synsets. A front end query interface is designed to query the HOWN ontology for opinion word details. This query is transformed into SPARQL format. This task is for automatic recognition of opinionated terms from Hindi documents by the machine.

Keywords

Semantic web Ontology Hindi WordNet Opinion words SPARQL 

References

  1. 1.
    Krishnamurthi, K., Panuganti, V.R., Bulusu, V.V.: Understanding document semantics from summaries: a case study on Hindi texts. ACM Trans. Asian Low-Resour. Lang. Inf. Process. (TALLIP) 16(1), 7 (2016)Google Scholar
  2. 2.
    Sharma, R., Nigam, S., Jain, R.: Opinion mining in Hindi language: a survey. arXiv preprint arXiv:1404.4935 (2014)
  3. 3.
    Fellbaum, C. (ed.): WordNet: An Electronic Lexical Database. MIT Press, Cambridge (1998)zbMATHGoogle Scholar
  4. 4.
    Chakrabarty, D., Pande, P., Narayan, D., Bhattacharyya, P.: An experience in building the indowordnet - a wordnet for Hindi. In: International Conference on Global WordNet (GWC02), Mysore, India (2002)Google Scholar
  5. 5.
    Vossen, P.: A Multilingual Database with Lexical Semantic Networks. Kluwer Academic Publishers, Dordrecht (1998)CrossRefGoogle Scholar
  6. 6.
    Das, A., Bandyopadhyay, S.: Sentiwordnet for bangla. Knowl. Shar. Event-4: Task 2 (2010)Google Scholar
  7. 7.
    Joshi, A., Balamurali, A.R., Bhattacharyya, P.: A fall-back strategy for sentiment analysis in Hindi: a case study. In: Proceedings of the 8th ICON (2010)Google Scholar
  8. 8.
    Bakliwal, A., Arora, P., Varma, V.: Hindi subjective lexicon: a lexical resource for Hindi polarity classification. In: Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC) (2012)Google Scholar
  9. 9.
    Mittal, N., et al.: Sentiment analysis of Hindi review based on negation and discourse relation. In: Proceedings of International Joint Conference on Natural Language Processing (2013)Google Scholar
  10. 10.
    Bhatt, B., Bhattacharyya, P.: IndoWordnet and its linking with ontology. In: Proceedings of the 9th International Conference on Natural Language Processing (ICON 2011) (2011)Google Scholar
  11. 11.
    Gangemi, A., et al.: Sweetening wordnet with dolce. AI Mag. 24(3), 13 (2003)Google Scholar
  12. 12.
    Niles, I., Pease, A.: Towards a standard upper ontology. In: Proceedings of the International Conference on Formal Ontology in Information Systems-Volume 2001. ACM (2001)Google Scholar
  13. 13.
    Bhattacharyya, P.: IndoWordNet. In: Dash, N., Bhattacharyya, P., Pawar, J. (eds.) The WordNet in Indian Languages, pp. 1–18. Springer, Singapore (2017).  https://doi.org/10.1007/978-981-10-1909-8_1CrossRefGoogle Scholar
  14. 14.
    Álvez, J., Lucio, P., Rigau, G.: Improving the competency of first-order ontologies. In: Proceedings of the 8th International Conference on Knowledge Capture. ACM (2015)Google Scholar
  15. 15.
    Xu, B., Kang, D., Lu, J.: A framework of extracting sub-ontology. In: Chi, C.-H., Lam, K.-Y. (eds.) AWCC 2004. LNCS, vol. 3309, pp. 493–498. Springer, Heidelberg (2004).  https://doi.org/10.1007/978-3-540-30483-8_61CrossRefGoogle Scholar
  16. 16.
  17. 17.
  18. 18.
  19. 19.
    Baek, S., Cho, M., Kim, P.: Matching colors with KANSEI vocabulary using similarity measure based on WordNet. In: Gervasi, O., Gavrilova, M.L., Kumar, V., Laganà, A., Lee, H.P., Mun, Y., Taniar, D., Tan, C.J.K. (eds.) ICCSA 2005. LNCS, vol. 3480, pp. 37–45. Springer, Heidelberg (2005).  https://doi.org/10.1007/11424758_5CrossRefGoogle Scholar
  20. 20.
    Gruber, T.R.: Toward principles for the design of ontologies used for knowledge sharing? Int. J. Hum.-Comput. Stud. 43(5-6), 907–928 (1995)CrossRefGoogle Scholar
  21. 21.
    Alani, H., et al.: Using protege for automatic ontology instantiation (2004)Google Scholar

Copyright information

© Springer Nature Singapore Pte Ltd. 2018

Authors and Affiliations

  • D. Teja Santosh
    • 1
  • Vikram Sunil Bajaj
    • 2
    Email author
  • Varun Sunil Bajaj
    • 3
  1. 1.GITAM (Deemed to be University)HyderabadIndia
  2. 2.New York UniversityBrooklynUSA
  3. 3.Rochester Institute of TechnologyRochesterUSA

Personalised recommendations