Skip to main content

Chronic Disease Related Entity Extraction in Online Chinese Question and Answer Services

  • Conference paper
  • First Online:
Smart Health (ICSH 2015)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9545))

Included in the following conference series:

Abstract

Chinese chronic disease entity extraction aims to extract health related entities from online questions and answers (QA). Our research tackles challenges in Chinese chronic disease entity extraction from three aspects: Chinese health lexicons construction, feature development, and equivalence conjunctions tagging. We construct large scale Chinese health lexicons based on expert knowledge and the Web resources; develop a feature extraction approach that draws out character, part-of-speech, and lexical features from QA data; and improve the performance of answer entity extraction by leveraging equivalence conjunctions (punctuation marks and conjunctional words) in Chinese to capture dependencies between tags of entities. Experiments on question and answer entity extraction demonstrate that the Precision, Recall and F-1 score are improved using our proposed features, and the Precision and F-1 score can be further improved by considering equivalence conjunctions.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Big Data Search and Mining Lab, BIT.: Natural Language Processing and Information Retrieval Sharing Platform. http://www.nlpir.org/

  2. Cao, Y., Liu, F., Simpson, P., Antieau, L., Bennett, A., Cimino, J.J., Ely, J., Yu, H.: Askhermes: an online question answering system for complex clinical questions. J. Biomed. Inform. 44(2), 277–288 (2011)

    Article  Google Scholar 

  3. Keretna, S., Lim, C.P., Creighton, D.C., Shaban, K.B.: Enhancing medical named entity recognition with an extended segment representation technique. Comput. Methods Programs Biomed. 119(2), 88–100 (2015)

    Article  Google Scholar 

  4. Kudo, T.: CRF++: Yet Another CRF toolkit. http://taku910.github.io/crfpp/

  5. Lafferty, J., McCallum, A., Pereira, F.C.: Conditional random fields: Probabilistic models for segmenting and labeling sequence data (2001)

    Google Scholar 

  6. Lee, M., Cimino, J., Zhu, H.R., Sable, C., Shanker, V., Ely, J., Yu, H.: Beyond information retrieval—medical question answering. In: AMIA Annual Symposium Proceedings, vol. 2006, p. 469. American Medical Informatics Association (2006)

    Google Scholar 

  7. McCallum, A., Li, W.: Early results for named entity recognition with conditional random fields, feature induction and web-enhanced lexicons. In: Proceedings of the seventh conference on Natural language learning at HLT-NAACL 2003, vol. 4, pp. 188–191. Association for Computational Linguistics (2003)

    Google Scholar 

  8. Nadeau, D., Sekine, S.: A survey of named entity recognition and classification. Lingvisticae Investigationes 30(1), 3–26 (2007)

    Article  Google Scholar 

  9. National health and family planning commission of the people’s republic of China (2015). http://www.nhfpc.gov.cn/

  10. Nlm.nih.gov: Unified Medical Language System (UMLS). http://www.nlm.nih.gov/research/umls/

  11. Pasca, M., Lin, D., Bigham, J., Lifchits, A., Jain, A.: Organizing and searching the world wide web of facts-step one: the one-million fact extraction challenge. In: AAAI, vol. 6, pp. 1400–1405 (2006)

    Google Scholar 

  12. Pasupat, P., Liang, P.: Zero-shot entity extraction from web pages. In: Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, ACL 2014. Long Papers, Baltimore, MD, USA, 22–27 June 2014, vol. 1, pp. 391–401 (2014)

    Google Scholar 

  13. Peng, X.Y., Chen, Y., Huang, Z.W.: A Chinese question answering system using web service on restricted domain. In: 2010 International Conference on Artificial Intelligence and Computational Intelligence (AICI), vol. 1, pp. 350–353. IEEE (2010)

    Google Scholar 

  14. Shaalan, K.: A survey of arabic named entity recognition and classification. Comput. Linguis. 40(2), 469–510 (2014). http://dx.doi.org/10.1162/COLI_a_00178

    Google Scholar 

  15. Zhang, H., Xu, S., Li, W., Zhu, L.: XML-based document retrieval in Chinese diseases question answering system. In: (Jong Hyuk) Park, J.J., Adeli, H., Park, N., Woungang, I. (eds.) Mobile, Ubiquitous, and Intelligent Computing. LNEE, vol. 274, pp. 211–217. Springer, Heidelberg (2014)

    Chapter  Google Scholar 

  16. Zhao, H., Kit, C.: Unsupervised segmentation helps supervised learning of character tagging for word segmentation and named entity recognition. In: IJCNLP, pp. 106–111. Citeseer (2008)

    Google Scholar 

Download references

Acknowledgments

This work was supported by the National High-tech R&D Program of China (Grant No. SS2015AA020102), National Basic Research Program of China (Grant No. 2011CB302302), the 1000-Talent program, and the Tsinghua University Initiative Scientific Research Program. We thank the research assistance provided by Qingbo Cao at Tsinghua University.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yan Zhang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Zhang, Y., Zhang, Y., Yin, Y., Xu, J., Xing, C., Chen, H. (2016). Chronic Disease Related Entity Extraction in Online Chinese Question and Answer Services. In: Zheng, X., Zeng, D., Chen, H., Leischow, S. (eds) Smart Health. ICSH 2015. Lecture Notes in Computer Science(), vol 9545. Springer, Cham. https://doi.org/10.1007/978-3-319-29175-8_6

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-29175-8_6

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-29174-1

  • Online ISBN: 978-3-319-29175-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics