Skip to main content

Biomedical Event Trigger Detection Based on Hybrid Methods Integrating Word Embeddings

  • Conference paper
  • First Online:
Knowledge Graph and Semantic Computing: Semantic, Knowledge, and Linked Big Data (CCKS 2016)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 650))

Included in the following conference series:

Abstract

Trigger detection as the preceding task is of great importance in biomedical event extraction. By now, most of the state-of-the-art systems have been based on single classifiers, and the words encoded by one-hot are unable to represent the semantic information. In this paper, we utilize hybrid methods integrating word embeddings to get higher performance. In hybrid methods, first, multiple single classifiers are constructed based on rich manual features including dependency and syntactic parsed results. Then multiple predicting results are integrated by set operation, voting and stacking method. Hybrid methods can take advantage of the difference among classifiers and make up for their deficiencies and thus improve performance. Word embeddings are learnt from large scale unlabeled texts and integrated as unsupervised features into other rich features based on dependency parse graphs, and thus a lot of semantic information can be represented. Experimental results show our method outperforms the state-of-the-art systems.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Björne, J., Heimonen, J., Ginter, F., Airola, A., Pahikkala, T., Salakoski, T.: Extracting complex biological events with rich graph-based feature sets. In: Proceedings of Workshop on Current Trends in Biomedical Natural Language Processing: Shared Task, pp. 10–18. ACL, Boulder, Colorado (2009)

    Google Scholar 

  2. Martinez, D., Baldwin, T.: Word sense disambiguation for event trigger word detection in biomedicine. BMC Bioinform. 12(Suppl. 2), S4 (2011)

    Article  Google Scholar 

  3. Zhang, Y., Lin, H., Yang, Z., Wang, J., Li, Y.: Biomolecular event trigger detection using neighborhood hash features. J. Theoret. Biol. 318, 22–28 (2013)

    Article  Google Scholar 

  4. Majumder, A.: Multiple features based approach to extract bio-molecular event triggers using conditional random field. Int. J. Intell. Syst. Appl. 4(12), 41–47 (2012)

    Google Scholar 

  5. Wang, J., Wu, Y., Lin, H., Yang, Z.: Biological event trigger extraction based on deep parsing. Comput. Eng. 39, 25–30 (2013)

    Google Scholar 

  6. Domingos, P.: A few useful things to know about machine learning. Commun. ACM 55(10), 78–87 (2012)

    Article  Google Scholar 

  7. Li, L., Fan, W., Huang, D., Dang, Y., Sun, J.: Boosting performance of gene mention tagging system by hybrid methods. J. Biomed. Inf. 45(1), 156–164 (2012)

    Article  Google Scholar 

  8. Crammer, K., Dekel, O., Keshet, J., Shalev-Shwartz, S., Singer, Y.: Online passive-aggressive algorithms. J. Mach. Learn. Res. 7, 551–585 (2006)

    MathSciNet  MATH  Google Scholar 

  9. Breiman, L.: Random forests. Mach. Learn. 45, 5–32 (2001)

    Article  MATH  Google Scholar 

  10. Tang, B., Cao, H., Wang, X., Chen, Q., Xu, H.: Evaluating word representation features in biomedical named entity recognition tasks. BioMed. Res. Int. 2014, Article ID 240403, 1–6 (2014). Hindawi Publishing Corporation

    Google Scholar 

  11. Turian, J., Ratinov, L., Bengio, Y.: Word representations: a simple and general method for semi-supervised learning. In: Proceedings of 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, pp. 384–394 (2010)

    Google Scholar 

  12. Collobert, R., Weston, J., Bottou, L., Karlen, M., Kavukcuoglu, K., Kuksa, P., Collins, M.: Natural language processing (almost) from scratch. J. Mach. Learn. Res. 12, 2493–2537 (2011)

    MATH  Google Scholar 

  13. Mnih, A., Hinton, G.: A scalable hierarchical distributed language model. In: NIPS, pp. 1081–1088 (2008)

    Google Scholar 

  14. Mikolov, T., Sutskever, I., Chen, K., Corrado, G., Dean, J.: Distributed representations of words and phrases and their compositionality. Adv. Neural Inf. Process. Syst. 26, 3111–3119 (2013)

    Google Scholar 

  15. Mikolov, T., Yih, W.T., Zweig, G.: Linguistic regularities in continuous space word representations. In: Proceedings of NAACL-HLT, Atlanta, Georgia, pp. 746–751 (2013)

    Google Scholar 

  16. McClosky, D., Charniak, E.: Self-training for biomedical parsing. In: Proceedings of 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies, Columbus, Ohio, pp. 101–104 (2008)

    Google Scholar 

  17. Miyao, Y., Sagae, K., Saetre, R., Matsuzaki, T., Tsujii, J.: Evaluating contributions of natural language parsers to protein–protein interaction extraction. Bioinformatics 25(3), 394–400 (2009)

    Article  Google Scholar 

  18. Miwa, M., Saetre, R., Kim, J.D., Tsujii, J.: Event extraction with complex event classification using rich features. J. Bioinform. Comput. Biol. 8(1), 131–146 (2010). doi:10.1142/S0219720010004586

    Article  Google Scholar 

  19. Vapnik, V.N.: The Nature of Statistical Learning Theory. Springer, Berlin (1995)

    Book  MATH  Google Scholar 

  20. Kim, J.D., Ohta, T., Pyysalo, S., Kano, Y., Tsujii, J.: Overview of BioNLP’09 shared task on event extraction. In: Proceedings of Workshop on BioNLP: Shared Task, Boulder, Colorado, pp. 1–9 (2009)

    Google Scholar 

  21. Kim, J.D., Pyysalo, S., Ohta, T., Bossy, R., Nguyen, N., Tsujii, J.: Overview of BioNLP shared task 2011. In: Proceedings of BioNLP Shared Task 2011 Workshop, pp. 1–6. Association for Computational Linguistics, Portland (2011)

    Google Scholar 

Download references

Acknowledgments

The authors gratefully acknowledge the financial support provided by the National Natural Science Foundation of China under No. 61672126, 61173101.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Lishuang Li .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer Nature Singapore Pte Ltd.

About this paper

Cite this paper

Li, L., Qin, M., Huang, D. (2016). Biomedical Event Trigger Detection Based on Hybrid Methods Integrating Word Embeddings. In: Chen, H., Ji, H., Sun, L., Wang, H., Qian, T., Ruan, T. (eds) Knowledge Graph and Semantic Computing: Semantic, Knowledge, and Linked Big Data. CCKS 2016. Communications in Computer and Information Science, vol 650. Springer, Singapore. https://doi.org/10.1007/978-981-10-3168-7_7

Download citation

  • DOI: https://doi.org/10.1007/978-981-10-3168-7_7

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-10-3167-0

  • Online ISBN: 978-981-10-3168-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics