Section Heading Recognition in Electronic Health Records Using Conditional Random Fields

Chen, Chih-Wei; Chang, Nai-Wen; Chang, Yung-Chun; Dai, Hong-Jie

doi:10.1007/978-3-319-13987-6_5

Chih-Wei Chen²¹,
Nai-Wen Chang^22,23,
Yung-Chun Chang²⁴ &
…
Hong-Jie Dai²¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8916))

Included in the following conference series:

International Conference on Technologies and Applications of Artificial Intelligence

1616 Accesses
3 Citations

Abstract

Electronic health records (EHRs) contain a wealth of information, such as discharge diagnoses, laboratory results, and pharmacy orders, which can be used to support clinical decision support systems and enable clinical and translational research. Unfortunately, the information is represented in a highly heterogeneous semi-structured or unstructured format with author- and domain-specific idiosyncrasies, acronyms and abbreviations. To take full advantage of health data, text-mining techniques have been applied by researchers to recognize named entities (NEs) mentioned in EHRs. However, the judgment of clinical data cannot be known solely from the NE level. For instance, a disease mention in the section of past medical history has different clinical significance when mentioned in the family medical history section. To obtain high-quality information and improve the understanding of clinical records, this work developed a machine learning-based section heading recognition system and evaluated its performance on a manually annotated corpus. The experiment results showed that the machine learning-based system achieved a satisfactory F-score of 0.939, which outperformed a dictionary-based system by 0.321.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Aronson, A.: Effective Mapping of Biomedical Text to the UMLS Metathesaurus: The MetaMap Program. Journal of Biomedical Informatic 35, 17–21 (2001)
Google Scholar
Denny, J.C., Miller, R.A., Johnson, K.B., Spickard III, A.: Development and evaluation of a clinical note section header terminology. In: AMIA Annu. Symp. Proc., pp. 156–160 (2008)
Google Scholar
Friedman, C., Shagina, L., Lussier, Y., Hripcsak, G.: Automated encoding of clinical documents based on natural language processing. J. Am. Med. Inform. Assoc. 11(5), 392–402 (2004)
Article Google Scholar
Lafferty, J., McCallum, A., Pereira, F.: Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In: Proceedings of the 18th International Conference on Machine Learning (ICML), pp. 282–289 (2001)
Google Scholar
Savova, G.K., Masanz, J.J., Ogren, P.V., Zheng, J., Sohn, S., Kipper-Schuler, K.C., Chute, C.G.: Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications. Journal of the American Medical Informatics Association 17(5), 507–513 (2010)
Article Google Scholar
Smith, L., Rindflesch, T., Wilbur, W.J.: MedPost: a part-of-speech tagger for bioMedical text. Bioinformatics 20(14), 2320–2321 (2004)
Article Google Scholar
Stubbs, A., Kotfila, C., Xu, H., Uzuner, O.: Practical applications for NLP in Clinical Research: the 2014 i2b2/UTHealth shared tasks. In: Proceedings of the i2b2 2014 Shared Task and Workshop Challenges in Natural Language Processing for Clinical Data (2014)
Google Scholar
Tsai, R.T.-H., Sung, C.-L., Dai, H.-J., Hung, H.-C., Sung, T.-Y., Hsu, W.-L.: NERBio: using selected word conjunctions, term normalization, and global patterns to improve biomedical named entity recognition. BMC Bioinformatics7(suppl. 5), S11 (2006)
Google Scholar

Download references

Author information

Authors and Affiliations

Graduate Institute of Biomedical Informatics, College of Medical Science and Technology, Taipei Medical University, Taiwan
Chih-Wei Chen & Hong-Jie Dai
Institution of Information Science, Academia Sinica, Taiwan
Nai-Wen Chang
Graduate Institute of Biomedical Electronics and Bioinformatics, National Taiwan University, Taiwan
Nai-Wen Chang
Institute of Information Science, Academia Simica, Taiwan
Yung-Chun Chang

Authors

Chih-Wei Chen
View author publications
You can also search for this author in PubMed Google Scholar
Nai-Wen Chang
View author publications
You can also search for this author in PubMed Google Scholar
Yung-Chun Chang
View author publications
You can also search for this author in PubMed Google Scholar
Hong-Jie Dai
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science and Information Engineering, National Taiwan University of Science and Technology, No. 43, Sec. 4, Keelung Rd., Da’an Dist., 106, Taipei City, Taiwan
Shin-Ming Cheng
Department of Information Management, Tamkang University, No. 151, Yingzhuan Rd., Danshui Dist., 25137, New Taipei City, Taiwan
Min-Yuh Day

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chen, CW., Chang, NW., Chang, YC., Dai, HJ. (2014). Section Heading Recognition in Electronic Health Records Using Conditional Random Fields. In: Cheng, SM., Day, MY. (eds) Technologies and Applications of Artificial Intelligence. TAAI 2014. Lecture Notes in Computer Science(), vol 8916. Springer, Cham. https://doi.org/10.1007/978-3-319-13987-6_5

Download citation

DOI: https://doi.org/10.1007/978-3-319-13987-6_5
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-13986-9
Online ISBN: 978-3-319-13987-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics