Extracting Conceptual Feature Structures from Text

Andreasen, Troels; Bulskov, Henrik; Jensen, Per Anker; Lassen, Tine

doi:10.1007/978-3-642-21916-0_43

Troels Andreasen^23,24,
Henrik Bulskov^23,24,
Per Anker Jensen^23,24 &
…
Tine Lassen^23,24

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6804))

Included in the following conference series:

International Symposium on Methodologies for Intelligent Systems

3711 Accesses
2 Citations

Abstract

This paper describes the an approach to indexing texts by their conceptual content using ontologies. Central to this approach is a two-phase extraction principle divided into a syntactic annotation phase and a semantic generation phase drawing on lexico-syntactic information and semantic role assignment provided by existing lexical resources. Meaningful chunks of text are transformed into conceptual feature structures and mapped into concepts in a generative ontology. By this approach, synonymous but linguistically quite distinct expressions are extracted and mapped to the same concept in the ontology, providing a semantic indexing which enables content-based search.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Agichtein, E., Gravano, L.: Snowball - extracting relations from large plain-text collections. In: Proceedings of the 5th ACM International Conference on Digital Libraries, pp. 85–94(2000)
Google Scholar
Andreasen, T., Fischer Nilsson, J.: Grammatical specification of domain ontologies. Data & Knowledge Engineering 48(2), 221–230 (2004)
Article Google Scholar
Berners-Lee, T., Hendler, J., Lassila, O.: The semantic web. In: Scientific American Magazine (2001)
Google Scholar
Costello, F.J., Veale, T., Dunne, S.: Using wordnet to automatically deduce relations between words in noun-noun compounds. In: Proceedings of COLING/ACL (2006)
Google Scholar
Fischer Nilsson, J.: A logico-algebraic framework for ontologies ontolog. In: Anker Jensen, P., Skadhauge, P. (eds.) Proceedings of the First International. OntoQuery Workshop,Ontology-based Interpretation of Noun Phrases,Kolding. Department of Business Communication and Information Science. University of Southern Denmark, Denmark (2001)
Google Scholar
Girju, R., Moldovan, D., Tatu, M., Antohe, D.: On the semantics of noun compounds. Computer Speech and Language 19, 479–496 (2005)
Article Google Scholar
Girju, R., Beamer, B., Rozovskaya, A., Fister, A., Bhat, S.: A knowledge-rich approach to identifying semantic relations between nominals. Information Processing and Management 46, 589–610 (2009a)
Article Google Scholar
Girju, R., Nakov, P., Nastase, V., Szpakowicz, S., Turney, P., Yuret, D.: Classification of semantic relations between nominals. Language Resources and Evaluation 43, 105–121 (2009b)
Article Google Scholar
Hearst, M.A.: Automatic acquisition of hyponyms from large text corpora. In: Proceedings of the Fourteenth International Conference on Computational Linguistics, Nantes, France, pp. 539–545 (1992)
Google Scholar
Kim, S.N., Baldwin, T.: Automatic Interpretation of Noun Compounds Using WordNet Similarity. In: Dale, R., Wong, K.-F., Su, J., Kwong, O.Y. (eds.) IJCNLP 2005. LNCS (LNAI), vol. 3651, pp. 945–956. Springer, Heidelberg (2005)
Chapter Google Scholar
Kipper-Schuler, K.: VerbNet: A broad-coverage, comprehensive verb lexicon. Ph.D. thesis, Computer and Information Science Dept. University of Pennsylvania, Philadelphia (2006)
Google Scholar
Lassen, T.: Uncovering Prepositional Senses, Computer Science Research Report vol. 131. Ph.D. thesis, Computer Science dept., CBIT, Roskilde University (2010)
Google Scholar
Lieber, R., Stekauer, P. (eds.): The Oxford Handbook of Compounding. Oxford Handbooks. Oxford University Press, Oxford (2009)
Google Scholar
Litkowski, K.C., Hargraves, O.: The preposition project. In: ACL-SIGSEM Workshop on The Linguistic Dimensions of Prepositions and Their Use in Computational Linguistic Formalisms and Applications, pp. 171–179 (2005)
Google Scholar
Maedche, A., Staab, S.: Learning ontologies for the semantic web. In: Semantic Web Worskhop (2001)
Google Scholar
Muslea, J.: Extraction patterns for information extraction tasks: A survey. In: AAAI 1999 Workshop on Machine Learning for Information Extraction (1999)
Google Scholar
Velardi, P., Navigli, R.: Learning domain ontologies from document warehouses and dedicated web sites. Computational Linguistics 30(2), 151–179 (2004)
Article MATH Google Scholar
Fischer Nilsson, J., Szymczak, B., Jensen, P.A.: ONTOGRABBING: Extracting information from texts using generative ontologies. In: Andreasen, T., Yager, R.R., Bulskov, H., Christiansen, H., Larsen, H.L. (eds.) FQAS 2009. LNCS, vol. 5822, pp. 275–286. Springer, Heidelberg (2009)
Chapter Google Scholar
Phillips, E., Riloff, W.: Exploiting role-identifying nouns and expressions for information extraction. In: 2007 Proceedings of RANLP 2007 (2007)
Google Scholar
Scalise, S., Vogel, I. (eds.): Cross-Disciplinary Issues in Compounding. Current Issues in Linguistic Theory, vol. 311. John Benjamins, University of Bologna / University of Delaware (2010)
Google Scholar
Serban, A., ten Teije, R., van Harmelen, F., Marcos, M., Polo-Conde, C.: Extraction and use of linguistic patterns for modelling medical guidelines. Elsevier Arificial Intelligence in Medicine 39(2), 137–149 (2007)
Article Google Scholar
Sánchez, D., Moreno, A.: Pattern-based automatic taxonomy learning from the web. AI Communications Archive 21(1), 27–48 (2008)
MathSciNet MATH Google Scholar

Download references

Author information

Authors and Affiliations

CBIT, Roskilde University, Universitetsvej 1, DK-4000, Roskilde, Denmark
Troels Andreasen, Henrik Bulskov, Per Anker Jensen & Tine Lassen
Copenhagen Business School, ISV, Dalgas Have 15, DK - 2000, Frederiksberg, Denmark
Troels Andreasen, Henrik Bulskov, Per Anker Jensen & Tine Lassen

Authors

Troels Andreasen
View author publications
You can also search for this author in PubMed Google Scholar
Henrik Bulskov
View author publications
You can also search for this author in PubMed Google Scholar
Per Anker Jensen
View author publications
You can also search for this author in PubMed Google Scholar
Tine Lassen
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Faculty of Electronics and Information Technology, Institute of Computer Science, Warsaw University of Technology,, Nowowiejska 15/19, 00-665, Warsaw, Poland
Marzena Kryszkiewicz
Faculty of Electronics and Information Technology, Institute of Computer Science, Warsaw University of Technology, Nowowiejska 15/19, 00-665, Warsaw, Poland
Henryk Rybinski
University of Warsaw, 02-097, Warsaw, Poland
Andrzej Skowron
Faculty of Electronics and Information Technology, Institute of Computer Science, Warsaw University of Technology, Nowowiejska 15/19,, 00-665, Warsaw, Poland
Zbigniew W. Raś

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Andreasen, T., Bulskov, H., Jensen, P.A., Lassen, T. (2011). Extracting Conceptual Feature Structures from Text. In: Kryszkiewicz, M., Rybinski, H., Skowron, A., Raś, Z.W. (eds) Foundations of Intelligent Systems. ISMIS 2011. Lecture Notes in Computer Science(), vol 6804. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-21916-0_43

Download citation

DOI: https://doi.org/10.1007/978-3-642-21916-0_43
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-21915-3
Online ISBN: 978-3-642-21916-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics