Skip to main content

Populating an Allergens Ontology Using Natural Language Processing and Machine Learning Techniques

  • Conference paper
Artificial Intelligence in Medicine (AIME 2005)

Abstract

Ontologies are becoming increasingly important in the biomedical domain since they enable the re-use and sharing of knowledge in a formal, homogeneous and unambiguous way. In the rapidly growing field of biomedicine, knowledge is usually evolving and therefore an ontology maintenance process is required to keep the ontological knowledge up-to-date. This paper presents our approach for populating a formally defined ontology for the allergen domain exploiting PubMed abstracts on allergens and using natural language processing and machine learning techniques. This approach is composed of two stages: locating initially instances of ontology concepts in the PubMed corpus, and finding at a 2nd stage instances’ properties and relations between instances.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Appelt, D.: Introduction to Information Extraction. AI Communications Journal 2(3), 161–172

    Google Scholar 

  2. Boisen, S., Crystal, M., Schwartz, R., Stone, R., Weischedel, R.: Annotating Resources for Information Extraction. In: proceedings of the 2nd International Conference on Language Resources and Evaluation (LREC-2000), Athens, Greece (2000)

    Google Scholar 

  3. Brewster, C., Ciravegna, F., Wilks, Y.: User-centred ontology learning for knowledge management. In: Andersson, B., Bergholtz, M., Johannesson, P. (eds.) NLDB 2002. LNCS, vol. 2553, pp. 203–207. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  4. Brusic, V., Petrovsky, N.: Bioinformatics for Characterisation of Allergens, Allergenicity and Allergic Cross-Reactivity. Trends in Immunology 24, 225–228 (2003)

    Article  Google Scholar 

  5. Petasis, G., Karkaletsis, V., Paliouras, G., Androutsopoulos, I., Spyropoulos, C.D.: Ellogon: A New Text Engineering Platform. In: Proceedings of the 3rd International Conference on Language Resources and Evaluation (LREC-2002), Las Palmas, Spain, pp. 72–78 (May 2002)

    Google Scholar 

  6. Freitag, D., McCallum, A.: Information extraction using hmms and shrinkage. In: Workshop on Machine Learning for Information Extraction(AAAI 1999), pp. 31–36 (1999)

    Google Scholar 

  7. GeneBank, http://www.ncbi.nlm.nih.gov/Genbank/GenbankSearch.html

  8. Huang, M.L., Zhu, X.Y., Hao, Y., Payan, D.G., Qu, K.B., Li, M.: Discovering patterns to extract protein-protein interactions from full texts. Bioinformatics 20(18) (2004)

    Google Scholar 

  9. International Union of Immunological Societies, http://www.allergen.org/List.htm

  10. Message Understanding Conference, http://www.itl.nist.gov/iaui/894.02/related_projects/muc/proceedings/muc_7_toc.html

  11. Protégé’s OWL plug-in, http://protege.stanford.edu/plugins/owl/

  12. Protégé Ontology Editor, http://protege.stanford.edu/

  13. PubMed, http://www.ncbi.nlm.nih.gov/entrez/query.fcgi

  14. Web Ontology Language, http://www.w3.org/TR/owl-ref

  15. Pustejovsky, J., Castano, J., Zhang, J.: Robust Relational Parsing over Biomedical Literature: Extracting Inhibit Relations. In: the Proceedings of the 7th Pacific Symposium on Biocomputing (PSB 2002), pp. 362–373 (2002)

    Google Scholar 

  16. SwissProt, http://au.expasy.org/sprot/

  17. Temkin, J.M., Gilder, M.R.: Extraction of protein interaction information from unstructured text using a context-free grammar. Bioinformatics 19 (2003)

    Google Scholar 

  18. Valarakos, A., Paliouras, G., Karkaletsis, V., Vouros, G.: Name-Matching Algorithm for Supporting Ontology Enrichment. In: Vouros, G.A., Panayiotopoulos, T. (eds.) SETN 2004. LNCS (LNAI), vol. 3025, pp. 381–389. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

  19. Valarakos, A., Paliouras, G., Karkaletsis, V., Vouros, G.: Enhancing Ontological Knowledge through Ontology Population and Enrichment. In: Motta, E., Shadbolt, N.R., Stutt, A., Gibbins, N. (eds.) EKAW 2004. LNCS (LNAI), vol. 3257, pp. 144–156. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

  20. Valarakos, A., Karkaletsis, V., Alexopoulou, D., Papadimitriou, E.: Building an Allergens Domain Ontology and Maintaining it using Machine Learning Techniques, NCSR Technical report, 2005/7 (2005), http://www.iit.demokritos.gr/~alexv/publications/alexvTR2004-7.pdf

  21. Zhou, G., Zhang, J., Su, Z., Shen, D., Tan, C.: Recognizing Names in Biomedical Texts: a Machine Learning Approach (2004)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Valarakos, A.G., Karkaletsis, V., Alexopoulou, D., Papadimitriou, E., Spyropoulos, C.D. (2005). Populating an Allergens Ontology Using Natural Language Processing and Machine Learning Techniques. In: Miksch, S., Hunter, J., Keravnou, E.T. (eds) Artificial Intelligence in Medicine. AIME 2005. Lecture Notes in Computer Science(), vol 3581. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11527770_38

Download citation

  • DOI: https://doi.org/10.1007/11527770_38

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-27831-3

  • Online ISBN: 978-3-540-31884-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics