Medical Entity Recognition and Negation Extraction: Assessment of NegEx on Health Records in Spanish

Santiso, Sara; Casillas, Arantza; Pérez, Alicia; Oronoz, Maite

doi:10.1007/978-3-319-56148-6_15

Sara Santiso¹⁵,
Arantza Casillas¹⁵,
Alicia Pérez¹⁵ &
…
Maite Oronoz¹⁵

Part of the book series: Lecture Notes in Computer Science ((LNBI,volume 10208))

Included in the following conference series:

International Conference on Bioinformatics and Biomedical Engineering

2032 Accesses
1 Citations

Abstract

This work focuses on biomedical text mining. The core of this work is to make a step ahead in the negation detection of biomedical entities on Electronic Health Records (EHRs), where the detection of non-negated entities is as important as the identification of negated entities. For instance, the identification of a negated entity as factual, can produce diagnostic errors in decision support systems.

Negated entity recognition tackles two tasks: (1) entity recognition; (2) entity classification as negated or not. To identify negations, in the literature rule-based and machine-learning techniques have been used. This paper presents an adaptation of the rule-based system NegEx, which uses exact-matching for the aforementioned tasks.

Our contribution consist in assessing the aforementioned two tasks and explored alternatives for each of them, in such a way that the negation detection improves when the entity recognition is able to detect more entities correctly.

The evaluation was carried out within a real domain of 75 EHRs written in Spanish obtaining an f-measure of 76.2 for entity recognition and 73.8 for negation detection.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Blanco, E., Moldovan, D.I.: Some issues on detecting negation from text. In: FLAIRS (2011)
Google Scholar
Bretonnel, K., Demmer-Fushman, D.: Biomedical Natural Language Processing, vol. 11. John Benjamins Publishing Company, Amsterdam (2014)
Book Google Scholar
Ceusters, W., Elkin, P., Smith, B.: Negative findings in electronic health records and biomedical ontologies: a realist approach. Int. J. Med. Inform. 76, 326–333 (2017)
Google Scholar
Chapman, W.W., Bridewell, W., Hanbury, P., Cooper, G.F., Buchanan, B.G.: A simple algorithm for identifying negated findings and diseases in discharge summaries. J. Biomed. inform. 34(5), 301–310 (2001)
Article Google Scholar
Costumero, R., Lopez, F., Gonzalo-Martín, C., Millan, M., Menasalvas, E.: An approach to detect negation on medical documents in Spanish. In: Ślȩzak, D., Tan, A.-H., Peters, J.F., Schwabe, L. (eds.) BIH 2014. LNCS (LNAI), vol. 8609, pp. 366–375. Springer, Heidelberg (2014). doi:10.1007/978-3-319-09891-3_34
Google Scholar
Henriksson, A., Kvist, M., Dalianis, H., Duneld, M.: Identifying adverse drug event information in clinical notes with distributional semantic representations of context. J. Biomed. Inform. 57, 333–349 (2015)
Article Google Scholar
Lafferty, J., McCallum, A., Pereira, F.: Conditional random fields: probabilistic models for segmenting and labeling sequence data. In: Proceedings of the Eighteenth International Conference on Machine Learning, ICML, vol. 1, pp. 282–289 (2001)
Google Scholar
Nakov, P., Zesch, T. (eds.): Proceedings of the 8th International Workshop on Semantic Evaluation (SemEval 2014). Association for Computational Linguistics and Dublin City University, Dublin, Ireland (2014)
Google Scholar
Nawaz, R., Thompson, P., Ananiadou, S.: Negated bio-events: analysis and identification. BMC Bioinform. 14, 14 (2013)
Article Google Scholar
Oronoz, M., Gojenola, K., Pérez, A., de Ilarraza, A.D., Casillas, A.: On the creation of a clinical gold standard corpus in Spanish: mining adverse drug reactions. J. Biomed. Inform. 56, 318–332 (2015)
Article Google Scholar
Skeppstedt, M.: Negation detection in swedish clinical text. In: Proceedings of the NAACL HLT 2010 Second Louhi Workshop on Text and Data Mining of Health Documents, pp. 15–21. Association for Computational Linguistics (2010)
Google Scholar
Skeppstedt, M., Dalianis, H., Nilsson, G.H.: Retrieving disorders and findings: results using SNOMED CT and NegEx adapted for swedish. In: Third International Workshop on Health Document Text Mining and Information AnalysisBled, Slovenia, 6 July 2011, Bled Slovenia, Collocated with AIME 2011, pp. 11–17 (2011)
Google Scholar
Weegar, R., Kvist, M., Sundström, K., Brunak, S., Dalianis, H.: Finding cervical cancer symptoms in swedish clinical text using a machine learning approach and NegEx. In: AMIA Annual Symposium Proceedings. vol. 2015, p. 1296. American Medical Informatics Association (2015)
Google Scholar

Download references

Acknowledgments

The authors would like to thank the personnel of Pharmacy and Pharmacovigilance services of the Galdakao-Usansolo Hospital. This work was partially funded by the Spanish Ministry of Science and Innovation (EXTRECM: TIN2013-46616-C2-1-R, TADEEP: TIN2015-70214-P) and the Basque Government (DETEAMI: Ministry of Health 2014111003, Predoctoral Grant: PRE 2015 1 0211).

Author information

Authors and Affiliations

IXA Group, University of the Basque Country (UPV-EHU), T649, 20080, Donostia, Spain
Sara Santiso, Arantza Casillas, Alicia Pérez & Maite Oronoz

Authors

Sara Santiso
View author publications
You can also search for this author in PubMed Google Scholar
Arantza Casillas
View author publications
You can also search for this author in PubMed Google Scholar
Alicia Pérez
View author publications
You can also search for this author in PubMed Google Scholar
Maite Oronoz
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sara Santiso .

Editor information

Editors and Affiliations

Universidad de Granada, Granada, Spain
Ignacio Rojas
Universidad de Granada, Granada, Spain
Francisco Ortuño

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Santiso, S., Casillas, A., Pérez, A., Oronoz, M. (2017). Medical Entity Recognition and Negation Extraction: Assessment of NegEx on Health Records in Spanish. In: Rojas, I., Ortuño, F. (eds) Bioinformatics and Biomedical Engineering. IWBBIO 2017. Lecture Notes in Computer Science(), vol 10208. Springer, Cham. https://doi.org/10.1007/978-3-319-56148-6_15

Download citation

DOI: https://doi.org/10.1007/978-3-319-56148-6_15
Published: 01 April 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-56147-9
Online ISBN: 978-3-319-56148-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics