Automated Detection of Measurements and Their Descriptors in Radiology Reports Using a Hybrid Natural Language Processing Algorithm

Bozkurt, Selen; Alkim, Emel; Banerjee, Imon; Rubin, Daniel L.

doi:10.1007/s10278-019-00237-9

Automated Detection of Measurements and Their Descriptors in Radiology Reports Using a Hybrid Natural Language Processing Algorithm

Original Paper
Open access
Published: 20 June 2019

Volume 32, pages 544–553, (2019)
Cite this article

Download PDF

You have full access to this open access article

Journal of Digital Imaging Aims and scope Submit manuscript

Automated Detection of Measurements and Their Descriptors in Radiology Reports Using a Hybrid Natural Language Processing Algorithm

Download PDF

Selen Bozkurt¹,
Emel Alkim¹,
Imon Banerjee^1,2 &
…
Daniel L. Rubin ORCID: orcid.org/0000-0001-5057-4369^1,2

4893 Accesses
27 Citations
1 Altmetric
Explore all metrics

Abstract

Radiological measurements are reported in free text reports, and it is challenging to extract such measures for treatment planning such as lesion summarization and cancer response assessment. The purpose of this work is to develop and evaluate a natural language processing (NLP) pipeline that can extract measurements and their core descriptors, such as temporality, anatomical entity, imaging observation, RadLex descriptors, series number, image number, and segment from a wide variety of radiology reports (MR, CT, and mammogram). We created a hybrid NLP pipeline that integrates rule-based feature extraction modules and conditional random field (CRF) model for extraction of the measurements from the radiology reports and links them with clinically relevant features such as anatomical entities or imaging observations. The pipeline was trained on 1117 CT/MR reports, and performance of the system was evaluated on an independent set of 100 expert-annotated CT/MR reports and also tested on 25 mammography reports. The system detected 813 out of 806 measurements in the CT/MR reports; 784 were true positives, 29 were false positives, and 0 were false negatives. Similarly, from the mammography reports, 96% of the measurements with their modifiers were extracted correctly. Our approach could enable the development of computerized applications that can utilize summarized lesion measurements from radiology report of varying modalities and improve practice by tracking the same lesions along multiple radiologic encounters.

Proposing New RadLex Terms by Analyzing Free-Text Mammography Reports

Article 20 March 2018

Structured Reporting and Artificial Intelligence

Automated Radiology Report Summarization Using an Open-Source Natural Language Processing Pipeline

Article 30 October 2017

Discover the latest articles, news and stories from top researchers in related subjects.

Medical Imaging

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Background

Radiology reports include a great variety of information about normal and abnormal structures in a free text format. Particularly for cancer patients, radiology reports describe measurements of cancer lesions and interval changes in their size are crucial indicators of response or resistance of cancer therapies. Measurements of lesion size (as well as organ size) are the predominant type of quantitative data recorded within the radiology reports. However, unlike other numerical, phenotypic evidence, such as lab values in ED notes, measurements are recorded as free text, which hampers extraction and utilization of such data by computer applications. Consequently, radiologists and clinicians need to ferret out lesion measurements from the radiology report for assessing changes in the tumor burden. Radiology reports typically capture lesion measurements, the anatomical locations, and spatial location—image and series number from where the measurements were taken. However, still, there are no widely adapted structured reporting standards for measurements in terms of its dimensions or descriptor terminology. In addition to measurement reporting, usage of different templates in general radiology reporting obstructs the automatic information extraction tasks.

Recent advancement in natural language processing (NLP) techniques could provide a fully automated solution for processing free text radiology reports to extract task-specific information, including measurements [1]. Yet, NLP techniques have been applied on radiological content either in the form of general-purpose systems or as targeted systems addressing one particular task [2,3,4,5,6,7,8,9,10,11,12,13,14,15,16]. Lesion measurement extraction and classification has been investigated in only a few studies [8, 13, 14, 17]. Sevenster et al. reported average F-measure 0.942 [12, 13] for classifying measurement descriptors; however, they did not link findings or anatomical locations to radiological measures. Similarly, Yim et al. extracted measurements from free text radiology reports as tumor characteristics but only focused on hepatocellular carcinoma patients [17].

Despite the increased use of the NLP approaches, radiology reports still introduce unique challenges for NLP, especially for granular tasks, such as determining anatomic relationships and temporal changes [18]. In addition to accurately extracting measurements, an NLP approach that can extract anatomic and spatial location of lesions and temporality of lesion measurements is also needed, which to our knowledge has not yet been undertaken. Moreover, no studies exist in literature which extract measurements as target concepts and links them with their descriptors of anatomical entities and imaging “slice (image) number” and “series number.” Recently, there are promising studies with advanced NLP methodologies in order to summarize radiology findings and generate “Impression” but so far measurements have not been extracted [19].

Therefore, the purpose of this work is to develop and evaluate an NLP engine that can extract measurements from narrative radiology reports with their core descriptors—“temporality,” “anatomical location and segment,” “imaging observation,” “scan-specific information”—“image number,” and “series number.”

Methods

Dataset

Under an Institutional Review Board (IRB)–approved protocol, we used a dataset of 980 CT and 237 MR reports from our institution for training and testing purposes. In terms of anatomic locations, our dataset consists of 782 chest/abdomen/pelvis, 52 head/neck, 157 lung/thorax, 44 pancreas, and 182 other types of reports. We randomly selected 100 radiology (26 MRI and 74 CT) reports from our data set to create our evaluation (test) set, and we used the remaining reports as our training set (1100) to train our conditional random field (CRF) model. In addition, we used a set of 25 mammography reports from our institution in order to evaluate the generalizability of our pipeline to other types of radiology reports.

Proposed Pipeline

In order to extract measurements from radiology reports with their descriptors, we created a rule-based NLP pipeline (Fig. 1) that includes automated named entity tagging using a CRF model. We also analyzed the label transition scores identified by the CRF model to explore relationships between descriptors. To compare the benefit of our approach over a commonly used approach, we created a baseline dictionary-based method that only uses terms in a dictionary as a knowledge source.

1.
Pre-processing

The reports were subjected to a boundary detection algorithm that recognizes sections and sentences in narrative radiology reports. The text was split into sections using regular expressions matched against a list of known section headers (commonly used in radiology) and segmented with respect to five sections of the report: “Comparison,” “Technique,” “Clinical History,” “Findings,” and “Conclusions.” Since lesion measurements are recorded in the Findings section, it was used in our pipeline as an input to extract descriptors of interest. All sections were decoded to ‘utf-8’ and were split into sentences using the Natural Language Toolkit (NLTK) [20] library in Python. In addition, all text was lowercased and punctuations were removed after measurements were tagged.
2.
Measurement tagging

In this phase, we aimed to tag measurements and their temporality which identifies whether a measurement is seen on the current scan or listed as a reference to a prior measurement. Measurements were tagged using several regular expression patterns and pre-defined rules. For measurement and temporality tagging, regex patterns defined by Sevenster et al. [14] were used with some modifications (Appendix 1). After measurement tagging, we only included the sentences that include measurements as the input for the following steps of the pipeline. Similarly, a complete textual description of all dimensions of a measurement in a sentence was targeted. In order to detect different measurements in the same sentence and tag temporality correctly as current or prior, we divided a sentence into sub-parts using the approach given as a pseudocode at Appendix 2. Basically, the sub-parts were created based on the number of measurements and their temporality. For example, the measurement sentence in Fig. 2 was divided into 2 parts as (1) current measurement part and (2) historical measurement part (Fig. 2). Similarly, other descriptors (image number series and series number on which the measurement is made, segment of the organ which measurements is made on) were extracted using several regular expression patterns and pre-defined rules. Regular expressions for image, series, and segment tagging were defined (Appendix 3).

3.
Named entity (lesions and modifiers) tagging
Fig. 2
Example sentence to sub-sentence division to capture current and prior measures of the nodule
Full size image

We used a CRF method to adopt a more generalizable approach for named entity tagging since dictionary lookup might not able to capture all the lexical and linguistic variants of a medical term and radiology reports contain different writing styles depending upon the preference of radiologists and institutions. CRF is a probabilistic graphical model to discover patterns, given the context of a neighborhood, thus capture many correlated features of the inputs. CRF also helps to investigate the sequential relationships among the descriptors. The CRF model was trained to achieve automatic named entity tagging for anatomical entity, imaging observations, and RadLex descriptors (as RadLex sub-classes associated with the measurement). We also analyzed the label transition scores identified by the CRF model in order to explore and visualize relationships between descriptors. Label transition scores are the conditional probabilities of possible next states given the current state and the observation sequence [21]. As input feature to CRF, we used part of speech tagging and dictionary maps. We did not use any higher-level syntactic features like NP chunks, and we used the Python sklearn-crfsuite library (http://www.chokkan.org/software/crfsuite/) with its default model parameters.
4.
Rule-based measurement descriptor extraction

We mainly focused on 7 descriptors that characterize a measurement in radiology: (1) temporality, (2) anatomical entity, (3) imaging observation, (4) RadLex descriptor, (5) image number and (6) series number on which the measurement is reported, and (7) segment number of an organ. Output was recorded as frames, in which the measurement is the target entity and all other entities in the report are assumed to be related to the target entry as its descriptors. Thus, a secondary entity’s label encodes the type of the entity plus the type of relation with the target entity. Each measurement was represented as a single frame object containing the numeric measure of the lesion size and its descriptors as output from our pipeline (Fig. 3).

Manual Annotation of the Reports

In order to evaluate the accuracy of the measurement extraction pipeline, we created a development (17 reports) and an evaluation set of randomly selected 100 radiology reports (26 MRI and 74 CT) and had them manually annotated by a domain expert. Reports were annotated to indicate measurements and their measurement descriptors (temporality, the image and series number, segment, anatomical entity, and imaging observation). Similarly, 25 mammography reports were also manually annotated by an expert.

In order to annotate the larger training set of 1100 reports for the CRF model, given its size, we used the “light annotation [22]” strategy in which we first generated the annotations for entities and relationships automatically via dictionary lookup and sentence boundaries; then, those annotations were manually corrected by experts to create the final set. This training set, being potentially lower quality than our evaluation set of annotations, were only used to train the CRF model and in creating rules of our pipeline.

Statistical Evaluation

Using our evaluation set, we calculated precision, recall, and F scores for measurement extraction at the sentence level. The performance of extraction at report level was assessed as “no match,” “partial match,” and “full match.” If a measurement was extracted correctly with all of its descriptors, it was a “full match”, while, if even one descriptor was missed by the system, it was considered a “partial match”. “No match” occurred when the system failed to recognize any of the descriptors in addition to the measurement itself.

Results

We compared the accuracy of our baseline and proposed pipeline with our manually annotated test set. The results of the proposed pipeline in terms of precision and recall are shown in Table 1.

Table 1 Evaluation of the NLP extraction pipelines from measure and its seven targeted descriptors

Full size table

The gold standard set of 100 reports contained a total of 806 reported measurements. Our system extracted 784 (97%) of them, and there were 29 (4%) false positives with no false negatives. Among 806 measurements, 258 of them were historical which referred to an earlier measurement and our system detected 206 (79.84%) of them correctly as prior measurements.

For the goal of perfect-matched information frame extraction, we investigated the match percentages on the combination of descriptors. Figure 4 shows the final results for full and partial match cases with their frequencies and percentages based on the different descriptors of the measurements. The number of fully matched measurements was 465 (47.28%). Regarding partial matches, for example, 672 (83%) of the measurements were partial matches and they were matched at least with their anatomical entities correctly. It can be clearly seen in Fig. 4 that, when the number of descriptors related to a measurement increases, the number of full matches decreases.

In order to explore and visualize relationships between descriptors using CRF-generated probability scores, all possible label transition scores among descriptors were illustrated in Fig. 5. We observed that a measurement is most likely to be followed by an imaging observation or anatomical entity but when it comes to segment and image number, it is difficult to detect the order of sequencing.

Evaluation on Mammography Reports

Among 25 mammography reports included in our second evaluations set, 14 (56%) of them included multiple measurements in the same report. Among a total of 305 sentences in the dataset, 51 (17%) of those sentences included measurements and it corresponds to 51 unique measurements since none of the sentences had more than one measurement. Our measurement extraction pipeline extracted 49 (96%) of those measurements with their modifiers correctly (full match). The two cases that were extracted as false positive had an uncommon pattern as “increased in size (by 1 mm),” and our system extracted the measurement as “1 mm”.

Discussion

In this paper, we describe an NLP system to extract measurements with their descriptors in a structured format from radiology reports. All of this information is necessary to track a lesion in a report over time, since the measurement itself is ambiguous, but the measurement in addition to its descriptors makes it sufficiently unique that it can be distinguished from other measurements in the report, enabling tracking of lesions. The recall and precision of our system for measurement extraction were 100% and 96.43%, respectively, which are reasonably good results for MR and CT reports. Among 784 (97%) correctly extracted measurements, only 465 (58%) measurements were extracted as fully matched with all of their related descriptors due to very diverse and unstructured referencing in text. On the other hand, in order to identify a measurement for any follow-up encounter, it is necessary to be able to distinguish them based on their all descriptors. As opposed to previous studies, in which measurements were defined as quantitative descriptions of other entities [23], in this study, we treated measurements as core concepts and defined other related entities as their descriptors. Errors we observed during the evaluation phase were primarily due to expressing several measurements in the same sentence with their previous measurements and insufficient descriptions of features for each measurement as in the example below.

Example Sentence

“…reduced size of scattered enhancing nodules on series 11, for example 4 mm left frontal nodule (image 127, previously 7 mm), 4 mm right frontal nodule (image 124, previously 9 mm), 3 mm right frontal nodule (image 114; previously 7 mm), 2 mm lateral right frontal nodule (image 118, previously 5 mm), 2 mm left cerebellar nodule (image 53, previously 6 mm)….”

In this sentence, there are 10 different measurements (5 current and 5 prior) and their descriptors (imaging observations, RadLex descriptors, and image numbers). Our system finds each measurement correctly with its temporality, image/series numbers, and laterality. On the other hand, it only detected the “scattered|Radlex_Descriptor” and “enchancing|Imaging_Observation” for the first measurement since it is the closest and these modifiers are not reported in the other sub-sentences. Therefore, we calculated 9 of 10 cases as “partial match” and it decreased our system’s performance. These kinds of problems are due to a lack of description of each measurement separately, which might be solved by specific rules but it can also increase false positive cases.

One important goal of any information extraction task is to reveal the relations between concepts. However, relationship extraction is a granular task which includes several modifiers related to target measurement and requires detailed relationship labels generated for training a machine learning pipeline or purpose of rule development. Moreover, as a unique challenge of radiology report parsing, relationships between measurement and their characteristics are not obviously definable via adverbs, and relational and qualitative adjectives for all of the entities in our corpus except “Measure_of” anatomical entity. Therefore, we tried to learn entity sequencing using CRF models in order to provide some insights for associating modifiers with measurements via rules rather than directly using it as a relationship extraction model.

CRF is the most popular supervised machine learning algorithm for named entity tagging tasks. Being a statistical machine learning method, CRF analyzes the data to infer rules and patterns and uses sequence labeling to model relationships between neighbors [24, 25]. In this study, we trained a CRF model to label the named entities of interest automatically. We also aimed to mine relationships between measurement and their descriptors using the calculated transition probabilities of this model such as the following: a measurement is most likely to be followed by an imaging observation or anatomical entity, but we observed that it is difficult to decide about the order of sequencing for a given dataset with small training data set. Therefore, we only used the CRF model’s output for named entity tagging phase.

For the generalizability evaluation, we tested our system on 25 mammography reports and 96% of measurements were extracted correctly with their modifiers. Although the performance was very high, it should be noted that, in those reports, a single sentence does not include more than one measurement and our system performs best for sentences having only one measurement. On the other hand, this pattern would be common in mammography reports. As a future work, we are planning to evaluate the success of the pipeline on other modality reports.

The main limitation of this study was the small dataset from one single institution; in our future experiments, we plan to increase the training and test size with reports from multiple different institutions, hence, increasing the generalizability of our system. Similarly, due to limited resource, we performed a “light annotation [22]” of the training set of (1100 reports) for CRF model by a single expert. On the other hand, the annotation of the test set was a completely manual effort which we think is a valuable resource that we will use in future work for developing appropriate lesion tracking models. In the future, we also intend to adopt attention-based convolutional neural network model for extracting relations between the entities. It should be noted that all these methodologies require larger training sets and manual annotation of the training data is a very labor-intensive task.

Extracting measurements and their descriptors as a structured summary of the lesions from unstructured radiology reports might be quite valuable for lesion tracking purposes. That information might be used to disambiguate the lesions across studies to identify the baseline and follow-up measurements of the same lesion. For example, if a lesion in the fifth segment of the liver is identified in the baseline study and then, it is identified again in the follow-up study, the anatomical entity and the segment number can be used to associate the measurements as the measurement of the same lesion. Moreover, the historical references can be used to bind a measurement to the measurement of the same lesion in a prior study. This can help in generating automatic lesion tracking and tumor burden reports. In addition, another impact of automated text annotation, such as in our work, is large-scale data labeling to train models that automate image interpretation.

Conclusion

Notwithstanding the foregoing limitations and challenges, we believe there is potential for clinical utility of our approach to improve radiologist practice by enabling automatic measurement extraction and summarization from radiology reports. With further testing, the system may ultimately help to improve radiology practice by enabling automated lesion summary, facilitate the assessment of changes in tumor burden, and improve the quality of patient care.

References

Wang Y, Wang L, Rastegar-Mojarad M, Moon S, Shen F, Afzal N, Liu S, Zeng Y, Mehrabi S, Sohn S, Liu H: Clinical information extraction applications: a literature review. J Biomed Inform 77:34–49, 2018
Article PubMed Google Scholar
Banerjee I, Chen MC, Lungren MP, Rubin DL: Radiology report annotation using intelligent word embeddings: applied to multi-institutional chest CT cohort. Journal of biomedical informatics 77:11–20, 2018
Article PubMed Google Scholar
Bozkurt S, Gulkesen KH, Rubin D: Annotation for information extraction from mammography reports. Stud Health Technol Inform 190:183–185, 2013
PubMed Google Scholar
Bozkurt S, Rubin D: Automated detection of ambiguity in BI-RADS assessment categories in mammography reports. Stud Health Technol Inform 197:35–39, 2014
PubMed Google Scholar
Bozkurt S, Lipson JA, Senol U, Rubin DL: Automatic abstraction of imaging observations with their characteristics from mammography reports. J Am Med Inform Assoc 22(e1):e81–e92, 2015
PubMed Google Scholar
Bozkurt S, Gimenez F, Burnside ES, Gulkesen KH, Rubin DL: Using automatically extracted information from mammography reports for decision-support. J Biomed Inform 62:224–231, 2016
Article PubMed PubMed Central Google Scholar
Hassanpour S, Langlotz CP: Information extraction from multi-institutional radiology reports. Artif Intell Med 66:29–39, 2016
Article PubMed Google Scholar
Kim Y, Garvin JH, Goldstein MK, Hwang TS, Redd A, Bolton D, Heidenreich PA, Meystre SM: Extraction of left ventricular ejection fraction information from various types of clinical reports. J Biomed Inform 67:42–48, 2017
Article PubMed PubMed Central Google Scholar
Pan HY, Shaitelman SF, Perkins GH, Schlembach PJ, Woodward WA, Smith BD: Implementing a real-time electronic data capture system to improve clinical documentation in radiation oncology. J Am Coll Radiol 13(4):401–407, 2016
Article PubMed Google Scholar
Percha B, Zhang Y, Bozkurt S, Rubin D, Altman RB, Langlotz CP: Expanding a radiology lexicon using contextual patterns in radiology reports. Journal of the American Medical Informatics Association 25:679–685, 2018
Article PubMed PubMed Central Google Scholar
Pons E, Braun LM, Hunink MG, Kors JA: Natural language processing in radiology: a systematic review. Radiology 279(2):329–343, 2016
Article PubMed Google Scholar
Sevenster M, Bozeman J, Cowhy A, Trost W: Automatically pairing measured findings across narrative abdomen CT reports. AMIA Annu Symp Proc 2013:1262–1271, 2013
PubMed PubMed Central Google Scholar
Sevenster M, Bozeman J, Cowhy A, Trost W: A natural language processing pipeline for pairing measurements uniquely across free-text CT reports. J Biomed Inform 53:36–48, 2015
Article PubMed Google Scholar
Sevenster M, Buurman J, Liu P, Peters JF, Chang PJ: Natural language processing techniques for extracting and categorizing finding measurements in narrative radiology reports. Appl Clin Inform 6(3):600–110, 2015
Article CAS PubMed PubMed Central Google Scholar
Tan WK, Hassanpour S, Heagerty PJ, Rundell SD, Suri P, Huhdanpaa HT, James K, Carrell DS, Langlotz CP, Organ NL, Meier EN, Sherman KJ, Kallmes DF, Luetmer PH, Griffith B, Nerenz DR, Jarvik JG: Comparison of natural language processing rules-based and machine-learning systems to identify lumbar spine imaging findings related to low back pain. Acad Radiol 25:1422–1432, 2018
Article PubMed Google Scholar
Zuccon G, Wagholikar AS, Nguyen AN, Butt L, Chu K, Martin S, Greenslade J: Automatic classification of free-text radiology reports to identify limb fractures using machine learning and the SNOMED CT ontology. AMIA Jt Summits Transl Sci Proc 2013:300–304, 2013
PubMed PubMed Central Google Scholar
Yim WW, Denman T, Kwan SW, Yetisgen M: Tumor information extraction in radiology reports for hepatocellular carcinoma patients. AMIA Jt Summits Transl Sci Proc 2016:455–464, 2016
PubMed PubMed Central Google Scholar
Cai T, Giannopoulos AA, Yu S, Kelil T, Ripley B, Kumamaru KK, Rybicki FJ, Mitsouras D: Natural language processing technologies in radiology research and clinical applications. Radiographics 36(1):176–191, 2016
Article PubMed PubMed Central Google Scholar
Zhang Y, Ding DY, Qian T, Manning CD, Langlotz CP. Learning to summarize radiology findings. arXiv preprint arXiv:180904698. 2018.
Loper E, Bird S. NLTK: The natural language toolkit. arXiv preprint cs/0205028. 2002.
Lafferty J, McCallum A, Pereira FCN. Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data. 2001. https://repository.upenn.edu/cgi/viewcontent.cgi?article=1162&context=cis_papers
Pham A-D, Névéol A, Lavergne T, Yasunaga D, Clément O, Meyer G, Morello R, Burgun A: Natural language processing of radiology reports for the detection of thromboembolic diseases and clinically relevant incidental findings. BMC bioinformatics 15(1):266, 2014
Article PubMed PubMed Central Google Scholar
Deléger L, Campillos L, Ligozat AL, Névéol A: Design of an extensive information representation scheme for clinical narratives. J Biomed Semantics 8(1):37, 2017
Article PubMed PubMed Central Google Scholar
Bundschus M, Dejori M, Stetter M, Tresp V, Kriegel H-P: Extraction of semantic biomedical relations from text using conditional random fields. BMC bioinformatics 9(1):207, 2008
Article CAS PubMed PubMed Central Google Scholar
Sutton C, McCallum A: An introduction to conditional random fields. Foundations and Trends® in Machine Learning 4(4):267–373, 2012
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Biomedical Data Science, Stanford University School of Medicine, Medical School Office Building (MSOB), Room X-335, MC 5464, 1265 Welch Road, Stanford, CA, 94305-5479, USA
Selen Bozkurt, Emel Alkim, Imon Banerjee & Daniel L. Rubin
Department of Radiology, Stanford University School of Medicine, Stanford, CA, 94305, USA
Imon Banerjee & Daniel L. Rubin

Authors

Selen Bozkurt
View author publications
You can also search for this author in PubMed Google Scholar
Emel Alkim
View author publications
You can also search for this author in PubMed Google Scholar
Imon Banerjee
View author publications
You can also search for this author in PubMed Google Scholar
Daniel L. Rubin
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Daniel L. Rubin.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendices

Measurement and temporality extraction

Measurement and auxiliary regular expressions are used same as they are defined by Sevenster et al. [14]. m specifies the measurement regular expression to match; using other regular expressions listed under Measurement. x specifies the numerical part of the measurement whereas cm specifies the unit. Auxiliary regular expressions are helper regular expressions that are used in prior regular expressions to match temporality.

Sevenster et al. [14] defined regular expressions for both current and prior. However, in this study, we used only prior regular expressions and tagged every measurement that is not matched by prior regular expressions as current. p5 was removed from the regular expressions for temporality (prior) regular expressions, and five new ones were defined (p0, p12, p13, p14, p15).

Pseudocode

The pseudocode listed below specifies three significant functionalities of the pipeline. The main flow of the tagging process is listed in tag_reports. Two methods that are used during tagging processes split_according_to_temporality and split_for_measurement are also listed to clarify the algorithm behind splitting which plays a significant role in extracting the relationships between modifiers and measurements.

Series number, image number, and segment extraction

Series number, image number, and segment are extracted using the regular expressions below. Segment regular expression is defined to also handle roman numerals.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Cite this article

Bozkurt, S., Alkim, E., Banerjee, I. et al. Automated Detection of Measurements and Their Descriptors in Radiology Reports Using a Hybrid Natural Language Processing Algorithm. J Digit Imaging 32, 544–553 (2019). https://doi.org/10.1007/s10278-019-00237-9

Download citation

Published: 20 June 2019
Issue Date: 15 August 2019
DOI: https://doi.org/10.1007/s10278-019-00237-9

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Automated Detection of Measurements and Their Descriptors in Radiology Reports Using a Hybrid Natural Language Processing Algorithm

Abstract

Similar content being viewed by others

Proposing New RadLex Terms by Analyzing Free-Text Mammography Reports

Structured Reporting and Artificial Intelligence

Automated Radiology Report Summarization Using an Open-Source Natural Language Processing Pipeline

Background