Abstract
This paper describes the an approach to indexing texts by their conceptual content using ontologies. Central to this approach is a two-phase extraction principle divided into a syntactic annotation phase and a semantic generation phase drawing on lexico-syntactic information and semantic role assignment provided by existing lexical resources. Meaningful chunks of text are transformed into conceptual feature structures and mapped into concepts in a generative ontology. By this approach, synonymous but linguistically quite distinct expressions are extracted and mapped to the same concept in the ontology, providing a semantic indexing which enables content-based search.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Agichtein, E., Gravano, L.: Snowball - extracting relations from large plain-text collections. In: Proceedings of the 5th ACM International Conference on Digital Libraries, pp. 85–94(2000)
Andreasen, T., Fischer Nilsson, J.: Grammatical specification of domain ontologies. Data & Knowledge Engineering 48(2), 221–230 (2004)
Berners-Lee, T., Hendler, J., Lassila, O.: The semantic web. In: Scientific American Magazine (2001)
Costello, F.J., Veale, T., Dunne, S.: Using wordnet to automatically deduce relations between words in noun-noun compounds. In: Proceedings of COLING/ACL (2006)
Fischer Nilsson, J.: A logico-algebraic framework for ontologies ontolog. In: Anker Jensen, P., Skadhauge, P. (eds.) Proceedings of the First International. OntoQuery Workshop,Ontology-based Interpretation of Noun Phrases,Kolding. Department of Business Communication and Information Science. University of Southern Denmark, Denmark (2001)
Girju, R., Moldovan, D., Tatu, M., Antohe, D.: On the semantics of noun compounds. Computer Speech and Language 19, 479–496 (2005)
Girju, R., Beamer, B., Rozovskaya, A., Fister, A., Bhat, S.: A knowledge-rich approach to identifying semantic relations between nominals. Information Processing and Management 46, 589–610 (2009a)
Girju, R., Nakov, P., Nastase, V., Szpakowicz, S., Turney, P., Yuret, D.: Classification of semantic relations between nominals. Language Resources and Evaluation 43, 105–121 (2009b)
Hearst, M.A.: Automatic acquisition of hyponyms from large text corpora. In: Proceedings of the Fourteenth International Conference on Computational Linguistics, Nantes, France, pp. 539–545 (1992)
Kim, S.N., Baldwin, T.: Automatic Interpretation of Noun Compounds Using WordNet Similarity. In: Dale, R., Wong, K.-F., Su, J., Kwong, O.Y. (eds.) IJCNLP 2005. LNCS (LNAI), vol. 3651, pp. 945–956. Springer, Heidelberg (2005)
Kipper-Schuler, K.: VerbNet: A broad-coverage, comprehensive verb lexicon. Ph.D. thesis, Computer and Information Science Dept. University of Pennsylvania, Philadelphia (2006)
Lassen, T.: Uncovering Prepositional Senses, Computer Science Research Report vol. 131. Ph.D. thesis, Computer Science dept., CBIT, Roskilde University (2010)
Lieber, R., Stekauer, P. (eds.): The Oxford Handbook of Compounding. Oxford Handbooks. Oxford University Press, Oxford (2009)
Litkowski, K.C., Hargraves, O.: The preposition project. In: ACL-SIGSEM Workshop on The Linguistic Dimensions of Prepositions and Their Use in Computational Linguistic Formalisms and Applications, pp. 171–179 (2005)
Maedche, A., Staab, S.: Learning ontologies for the semantic web. In: Semantic Web Worskhop (2001)
Muslea, J.: Extraction patterns for information extraction tasks: A survey. In: AAAI 1999 Workshop on Machine Learning for Information Extraction (1999)
Velardi, P., Navigli, R.: Learning domain ontologies from document warehouses and dedicated web sites. Computational Linguistics 30(2), 151–179 (2004)
Fischer Nilsson, J., Szymczak, B., Jensen, P.A.: ONTOGRABBING: Extracting information from texts using generative ontologies. In: Andreasen, T., Yager, R.R., Bulskov, H., Christiansen, H., Larsen, H.L. (eds.) FQAS 2009. LNCS, vol. 5822, pp. 275–286. Springer, Heidelberg (2009)
Phillips, E., Riloff, W.: Exploiting role-identifying nouns and expressions for information extraction. In: 2007 Proceedings of RANLP 2007 (2007)
Scalise, S., Vogel, I. (eds.): Cross-Disciplinary Issues in Compounding. Current Issues in Linguistic Theory, vol. 311. John Benjamins, University of Bologna / University of Delaware (2010)
Serban, A., ten Teije, R., van Harmelen, F., Marcos, M., Polo-Conde, C.: Extraction and use of linguistic patterns for modelling medical guidelines. Elsevier Arificial Intelligence in Medicine 39(2), 137–149 (2007)
Sánchez, D., Moreno, A.: Pattern-based automatic taxonomy learning from the web. AI Communications Archive 21(1), 27–48 (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Andreasen, T., Bulskov, H., Jensen, P.A., Lassen, T. (2011). Extracting Conceptual Feature Structures from Text. In: Kryszkiewicz, M., Rybinski, H., Skowron, A., Raś, Z.W. (eds) Foundations of Intelligent Systems. ISMIS 2011. Lecture Notes in Computer Science(), vol 6804. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-21916-0_43
Download citation
DOI: https://doi.org/10.1007/978-3-642-21916-0_43
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-21915-3
Online ISBN: 978-3-642-21916-0
eBook Packages: Computer ScienceComputer Science (R0)