Abstract
The purpose of our study was to demonstrate the use of Natural Language Processing (Leximer), along with Online Analytic Processing, (NLP-OLAP), for extraction of finding trends in a large radiology practice. Prior studies have validated the Natural Language Processing (NLP) program, Leximer for classifying unstructured radiology reports based on the presence of positive radiology findings (F POS) and negative radiology findings (F NEG). The F POS included new relevant radiology findings and any change in status from prior imaging. Electronic radiology reports from 1995–2002 and data from analysis of these reports with NLP-Leximer were saved in a data warehouse and exported to a multidimensional structure called the Radcube. Various relational queries on the data in the Radcube were performed using OLAP technique. Thus, NLP-OLAP was applied to determine trends of F POS in different radiology exams for different patient and examination attributes. Pivot tables were exported from NLP-OLAP interface to Microsoft Excel for statistical analysis. Radcube allowed rapid and comprehensive analysis of F POS and F NEG trends in a large radiology report database. Trends of F POS were extracted for different patient attributes such as age groups, gender, clinical indications, diseases with ICD codes, patient types (inpatient, ambulatory), imaging characteristics such as imaging modalities, referring physicians, radiology subspecialties, and body regions. Data analysis showed substantial differences between F POS rates for different imaging modalities ranging from 23.1% (mammography, 49,163/212,906) to 85.8% (nuclear medicine, 93,852/109,374; p < 0.0001). In conclusion, NLP-OLAP can help in analysis of yield of different radiology exams from a large radiology report database.
Similar content being viewed by others
References
Shannon CE: The mathematical theory of communication. Bell Syst Tech J 27:379–423, 1948
Dreyer KJ, Kalra MK, Maher MM et al: Application of recently developed computer algorithm for automated classification of unstructured radiology reports: Validation study. Radiology 234:323–329, 2005
Poisal JA, Truffer C, Smith S, Sisko A et al: Health spending projections through 2016: Modest changes obscure part D’s impact. Health Aff (Millwood) 26:w242–w253, 2007
Lubitz J: Health, technology, and medical care spending. Health Aff (Millwood) 24(2):w5R81–w5R85, 2005
Matin A, Bates DW, Sussman A, Ros P, Hanson R, Khosarani R: Inpatient radiology utilization: trends over the past decade. AJR 186:7–11, 2006
The House Committee on Ways and Means. Statement of Record, American College of Radiology, Josh Cooper. February 10, 2005. Website: http://waysandmeans.house.gov/hearings.asp?formmode=view&id=3074&keywords=cooper (Accessed on May 3, 2007).
Frush DP: Pediatric CT: practical approach to diminish the radiation dose. Pediatr Radiol 32:714–7, 2002
Bhargavan M, Sunshine JH: Utilization of radiology services in United States: Levels and trends in modalities, regions, and populations. Radiology 234:824–832, 2005
Hersh W, Mailhot M, Arnott-Smith C, Lowe H: Selective automated indexing of findings and diagnoses in radiology reports. J Biomed Inform 34:262–273, 2001
Friedman C, Alderson PO, Austin JH, et al: A general natural-language text processor for clinical radiology. J Am Med Inform Assoc 1:161–174, 1994
Hripcsak G, Friedman C, Alderson PO et al: Unlocking clinical data from narrative reports: a study of natural language processing. Ann Intern Med 122:681–688, 1995
Jain NL, Knirsch CA, Friedman C, Hripcsak G: Identification of suspected tuberculosis patients based on natural language processing of chest radiograph reports. Proc AMIA Annu Fall Symp 542–546, 1996
Fiszman M, Chapman WW, Aronsky D, Evans RS, Haug PJ: Automatic detection of acute bacterial pneumonia from chest X-ray reports. J Am Med Inform Assoc 7:593–604, 2000
Elkins JS, Friedman C, Boden-Albala B et al: Coding neuroradiology reports for the Northern Manhattan Stroke Study: A comparison of natural language processing and manual review. Comput Biomed Res 33:1–10, 2000
Hripcsak G, Austin JH, Alderson PO, Friedman C: Use of natural language processing to translate clinical information from a database of 889,921 chest radiographic reports. Radiology 224:157–163, 2002
Mamlin BW, Heinze DT, McDonald CJ: Automated extraction and normalization of findings from cancer-related free-text radiology reports. AMIA Annu Symp Proc 420–424, 2003
Diederich S, Das M: Solitary pulmonary nodule: Detection and management. Cancer Imaging 6:S42–S46, 2006
Wilcox AB, Hripcsak G: The role of domain knowledge in automating medical text report classification. J Am Med Inform Assoc 10:330–338, 2003
Wilcox A, Hripcsak G: Medical text representations for inductive learning. Proc AMIA Symp 923–927, 2000
Shah SP, Huang Y, Xu T, Yuen MM, Ling J, Ouellette BF: Atlas—a data warehouse for integrative bioinformatics. BMC Bioinformatics 6:34, 2005
Lee TJ, Pouliot Y, Wagner V et al: BioWarehouse: A bioinformatics database warehouse toolkit. BMC Bioinformatics 7:170, 2006
Sanders NW, Mann NH 3rd, Spengler DM: Web client and ODBC access to legacy database information: a low cost approach. Proc AMIA Annu Fall Symp 799–803, 1997
Newland RF, Baker RA, Stanley R: Electronic data processing: the pathway to automated quality control of cardiopulmonary bypass. J Extra Corpor Technol 38(2):139–143, 2006
Beer SR, Field WE: Analysis of factors contributing to 674 agricultural driveline-related injuries and fatalities documented between 1970 to 2003. J Agromedicine 10(3):3–19, 2005
Dillavou ED, Muluk SC, Makaroun MS: A decade of change in abdominal aortic aneurysm repair in the United States: Have we improved outcomes equally between men and women? J Vasc Surg 43(2):230–238, 2006 Feb, discussion 238
Robinson B, Frizelle F, Dickson M, Frampton C: Colorectal cancer treated at Christchurch Hospital, New Zealand: a comparison of 1993 and 1998 cohorts. N Z Med J 118(1210):U1323, 2005
Zavala-Alarcon E, Cecena F, Ashar R: Safety of elective–including “high risk”–percutaneous coronary interventions without on-site cardiac surgery. Am Heart J 148(4):676–683, 2004
Gu S, Du Y, Chen J: Large-scale quantitative proteomic study of PUMA-induced apoptosis using two-dimensional liquid chromatography-mass spectrometry coupled with amino acid-coded mass tagging. J Proteome Res 3(6):1191–1200, 2004 Nov-Dec
Creighton C, Hanash S: Mining gene expression databases for association rules. Bioinformatics 19(1):79–86, 2003
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Dang, P.A., Kalra, M.K., Blake, M.A. et al. Use of Radcube for Extraction of Finding Trends in a Large Radiology Practice. J Digit Imaging 22, 629–640 (2009). https://doi.org/10.1007/s10278-008-9128-x
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10278-008-9128-x