Abstract
Major portion of web pages contains the natural language text and understanding of natural language text from the web pages is a major challenge for machines. Due to this lacking search engines are not able to provide relevant information to the users. This problem is tackled by natural language processing techniques and the development of ontologies from natural language text. With the help of such ontologies search of information can increases manifold. Much research in the field of text processing and automatic ontology building from text has been done to address these challenges. The proposed method in this paper is another effort to build automatic ontology from domain specific text. In this method we first extract concepts from a given domain specific text. We have used a Stanford parser to parse the text and the dictionary of basic concepts is created manually containing all the domain specific concepts and their relationships by recognizing laxico-syntactic patterns in the text corpus. Once concepts and relations among concepts as well as properties of concepts are identified, the extracted information can be represented in the form of graph and OWL.
Similar content being viewed by others
References
ADen L (2004) Natural language understanding. Pearson Publication, New York
Agirre E, Ansa O, Hovy E, Martinez D (2000) Enriching very large ontologies using the WWW. In: Proceedings of the ontology learning workshop, ECAl, Berlin, Germany
Allen JF (1993) Natural language, knowledge representation and logical form. In: Bates M, Weischedel R (eds) Challenges in natural language processing. Cambridge University Press, Cambridge
Bedini I, Nguyen B (2007) Automatic ontology generation: state of the art. In: PRiSM Laboratory Technical Report. University of Versailles
Cho M, Kim H, Kirn P (2006) A new method for ontology merging based on concept using wordnet. In: Advanced communication technology, 2006. ICACT 2006. The 8th international conference, vol 3, p 157311576
Chomsky N (2002) Theory of syntactic structures. ISBN-13: 978-1614278047
Dorr BJ, Jordan PW, Benoit HW (1998) A survey of current paradigms in machine translation. Technical Report: LL\MP-TR-027, UMIACS-TR-98-72, CS-TR-3961, University of Maryland, College Park, December 1998
Fromkin V et al (2000) Linguistics: an introduction to linguistic theory, Chapter 3.2 (Constituent Order, Case Marking and Thematic Roles). Blackwell Publishing, Hoboken
Gruber TR (1995) Toward principles for the design of ontologies used for knowledge sharing. Int J Hum Comput Stud 43:907–928
Jurafsky D, Martin J (2000) Speech and language processing: an introduction to natural language processing, computational linguistics, and speech recognition upper saddle river. Prentice Hall, New Jersey
Kietz J, Maedche A, Volz R (2000) A method for semi-automatic ontology acquisition from a corporate intranet. In: Proceedings of EKA W-2000 workshop ontologies and text, Juan-Les-Pins, France
Kong H, Hwang M, Kim P (2006) Design of the automatic ontology building system about the specific domain knowledge. In: Advanced communication technology, 2006. ICACT 2006. The 8th international conference, vol 2
MiUer GA (1995) WORDNET: a lexical database for English. Commun ACM 1:39–41
Moldovan DI, Girju R (2000) Domain-specific knowledge acquisition and classification using wordnet. In: Proceedings of the thirteenth international florida artificial intelligence research society conference, pp 224/228. AAAI Press, p 81
Noy NF, McGuinness DL. Ontology Development 101: a Guide to creating your first Ontology. http://protege.stanford.edu/publications/ontology_development/ontology101.pdf
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
None.
Rights and permissions
About this article
Cite this article
Kumar, N., Kumar, M. & Singh, M. Automated ontology generation from a plain text using statistical and NLP techniques. Int J Syst Assur Eng Manag 7 (Suppl 1), 282–293 (2016). https://doi.org/10.1007/s13198-015-0403-1
Received:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s13198-015-0403-1