Information from Searching Content with an Ontology-Utilizing Toolkit (iSCOUT)

Lacson, Ronilda; Andriole, Katherine P.; Prevedello, Luciano M.; Khorasani, Ramin

doi:10.1007/s10278-012-9463-9

Information from Searching Content with an Ontology-Utilizing Toolkit (iSCOUT)

Published: 14 February 2012

Volume 25, pages 512–519, (2012)
Cite this article

Journal of Digital Imaging Aims and scope Submit manuscript

Ronilda Lacson^1,2,
Katherine P. Andriole¹,
Luciano M. Prevedello¹ &
…
Ramin Khorasani¹

275 Accesses
32 Citations
Explore all metrics

Abstract

Radiology reports are permanent legal documents that serve as official interpretation of imaging tests. Manual analysis of textual information contained in these reports requires significant time and effort. This study describes the development and initial evaluation of a toolkit that enables automated identification of relevant information from within these largely unstructured text reports. We developed and made publicly available a natural language processing toolkit, Information from Searching Content with an Ontology-Utilizing Toolkit (iSCOUT). Core functions are included in the following modules: the Data Loader, Header Extractor, Terminology Interface, Reviewer, and Analyzer. The toolkit enables search for specific terms and retrieval of (radiology) reports containing exact term matches as well as similar or synonymous term matches within the text of the report. The Terminology Interface is the main component of the toolkit. It allows query expansion based on synonyms from a controlled terminology (e.g., RadLex or National Cancer Institute Thesaurus [NCIT]). We evaluated iSCOUT document retrieval of radiology reports that contained liver cysts, and compared precision and recall with and without using NCIT synonyms for query expansion. iSCOUT retrieved radiology reports with documented liver cysts with a precision of 0.92 and recall of 0.96, utilizing NCIT. This recall (i.e., utilizing the Terminology Interface) is significantly better than using each of two search terms alone (0.72, p = 0.03 for liver cyst and 0.52, p = 0.0002 for hepatic cyst). iSCOUT reliably assembled relevant radiology reports for a cohort of patients with liver cysts with significant improvement in document retrieval when utilizing controlled lexicons.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Integrating terminologies into standard SQL: a new approach for research on routine data

Article Open access 24 April 2019

André Sander & Roland Wauer

KneeTex: an ontology–driven system for information extraction from MRI reports

Article Open access 07 September 2015

Irena Spasić, Bo Zhao, … Kate Button

Evaluation of Doc’EDS: a French semantic search tool to query health documents from a clinical data warehouse

Article Open access 08 February 2022

Thibaut Pressat-Laffouilhère, Pierre Balayé, … Julien Grosjean

References

Taira RK, Soderland SG, Jakobovits RM: Automatic structuring of radiology free-text reports. Radiographics 21(1):237–245, 2001
PubMed CAS Google Scholar
Mamlin BW, Heinze DT, McDonald CJ. Automated extraction and normalization of findings from cancer-related free-text radiology reports. AMIA Annu Symp Proc 420–424, 2003
Zingmond D, Lenert LA: Monitoring free-text data using medical language processing. Comput Biomed Res 26(5):467–481, 1993
Article PubMed CAS Google Scholar
Fiszman M, Haug PJ, Frederick PR. Automatic extraction of PIOPED interpretations from ventilation/perfusion lung scan reports. Proc AMIA Symp 860–864, 1998
Thomas BJ, Ouellette H, Halpern EF, Rosenthal DI: Automated computer-assisted categorization of radiology reports. AJR Am J Roentgenol 184(2):687–690, 2005
PubMed Google Scholar
Dreyer KJ, Kalra MK, Maher MM, Hurier AM, Asfaw BA, Schultz T, et al: Application of recently developed computer algorithm for automatic classification of unstructured radiology reports: validation study. Radiology 234(2):323–329, 2005
Article PubMed Google Scholar
Friedman C, Alderson PO, Austin JH, Cimino JJ, Johnson SB: A general natural-language text processor for clinical radiology. J Am Med Inform Assoc 1(2):161–174, 1994
Article PubMed CAS Google Scholar
Pines JM: Trends in the rates of radiography use and important diagnoses in emergency department patients with abdominal pain. Med Care 47(7):782–786, 2009
Article PubMed Google Scholar
Korley FK, Pham JC, Kirsch TD: Use of advanced radiology during visits to US emergency departments for injury-related conditions, 1998–2007. JAMA 304(13):1465–1471, 2010
Article PubMed CAS Google Scholar
Meystre SM, Haug PJ. Comparing natural language processing tools to extract medical problems from narrative text. AMIA Annu Symp Proc 525–529, 2005
Xu H, Fu Z, Shah A, Chen Y, Peterson NB, Chen Q, et al: Extracting and integrating data from entire electronic health records for detecting colorectal cancer cases. AMIA Annu Symp Proc 2011:1564–1572, 2011
PubMed Google Scholar
Uzuner O, South BR, Shen S, Duvall SL: 2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text. J Am Med Inform Assoc 18(5):552–556, 2011
Article PubMed Google Scholar
Meystre S, Haug PJ: Natural language processing to extract medical problems from electronic clinical documents: performance evaluation. J Biomed Inform 39(6):589–599, 2006
Article PubMed Google Scholar
Zeng QT, Goryachev S, Weiss S, Sordo M, Murphy SN, Lazarus R: Extracting principal diagnosis, co-morbidity and smoking status for asthma research: evaluation of a natural language processing system. BMC Med Inform Decis Mak 6:30, 2006
Article PubMed Google Scholar
Cunningham H, D Maynard, K Bontcheva, V Tablan. GATE: A framework and graphical development environment for robust NLP tools and applications. Proc 40th Assoc for Computational Linguistics, 2002
de Coronado S, Haber MW, Sioutos N, Tuttle MS, Wright LW: NCI Thesaurus: using science-based terminology to integrate cancer research results. Stud Health Technol Inform 107(Pt 1):33–37, 2004
PubMed Google Scholar
Langlotz CP: RadLex: a new method for indexing online educational materials. Radiographics 26(6):1595–1597, 2006
Article PubMed Google Scholar
Andriole KP, Khorasani R: Implementing a replacement PACS: issues to consider. J Am Coll Radiol 4(6):416–418, 2007
Article PubMed Google Scholar
Gershanik EF, Lacson R, Khorasani R: Critical finding capture in the impression section of radiology reports. AMIA Annu Symp Proc 2011:465–469, 2011
PubMed Google Scholar
National Cancer Institute. http://ncit.nci.nih.gov. 26 July 2010.
Hersh W: Evaluation of biomedical text-mining systems: lessons learned from information retrieval. Brief Bioinform 6(4):344–356, 2005
Article PubMed CAS Google Scholar
Su K, Ries JE, Peterson GM, Cullinan Sievert ME, Patrick TB, Moxley DE et al. Comparing frequency of word occurrences in abstracts and texts using two stop word lists. Proc AMIA Symp 682–686, 2001
Nadkarni PM, Ohno-Machado L, Chapman WW: Natural language processing: an introduction. J Am Med Inform Assoc 18(5):544–551, 2011
Article PubMed Google Scholar
Chapman WW, Bridewell W, Hanbury P, Cooper GF, Buchanan BG: A simple algorithm for identifying negated findings and diseases in discharge summaries. J Biomed Inform 34(5):301–310, 2001
Article PubMed CAS Google Scholar
Lindberg DA, Humphreys BL, McCray AT: The unified medical language system. Methods Inf Med 32(4):281–291, 1993
PubMed CAS Google Scholar
Loy P: International classification of diseases—9th revision. Med Rec Health Care Inf J 19(2):390–396, 1978
PubMed CAS Google Scholar
Cote RA, Robboy S: Progress in medical information management. Systematized nomenclature of medicine (SNOMED). JAMA 243(8):756–762, 1980
Article PubMed CAS Google Scholar
Rogers FB: Medical subject headings. Bull Med Libr Assoc 51:114–116, 1963
PubMed CAS Google Scholar
Cheng LT, Zheng J, Savova GK, Erickson BJ: Discerning tumor status from unstructured MRI reports—completeness of information in existing reports and utility of automated natural language processing. J Digit Imaging 23(2):119–132, 2010
Article PubMed Google Scholar
Cheng B, Titterington D: Neural networks: a review from a statistical perspective. Stat Sci 9(1):2–54, 1994
Article Google Scholar
Savova GK, Fan J, Ye Z, Murphy SP, Zheng J, Chute CG, et al: Discovering peripheral arterial disease cases from radiology notes using natural language processing. AMIA Annu Symp Proc 2010:722–726, 2010
PubMed Google Scholar
Warden GI, Lacson R, Khorasani R: Leveraging terminologies for retrieval of radiology reports with critical imaging findings. AMIA Annu Symp Proc 2011:1481–1488, 2011
PubMed Google Scholar

Download references

Acknowledgments

This work was partly funded by AHRQ grant 1R18HS019635.

Author information

Authors and Affiliations

Center for Evidence-Based Imaging, Department of Radiology, Brigham and Women’s Hospital, Harvard Medical School, 75 Francis Street, Boston, MA, 02115, USA
Ronilda Lacson, Katherine P. Andriole, Luciano M. Prevedello & Ramin Khorasani
20 Kent Street, Rm. 260A, Brookline, MA, 02445, USA
Ronilda Lacson

Authors

Ronilda Lacson
View author publications
You can also search for this author in PubMed Google Scholar
Katherine P. Andriole
View author publications
You can also search for this author in PubMed Google Scholar
Luciano M. Prevedello
View author publications
You can also search for this author in PubMed Google Scholar
Ramin Khorasani
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ronilda Lacson.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Lacson, R., Andriole, K.P., Prevedello, L.M. et al. Information from Searching Content with an Ontology-Utilizing Toolkit (iSCOUT). J Digit Imaging 25, 512–519 (2012). https://doi.org/10.1007/s10278-012-9463-9

Download citation

Published: 14 February 2012
Issue Date: August 2012
DOI: https://doi.org/10.1007/s10278-012-9463-9

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Information from Searching Content with an Ontology-Utilizing Toolkit (iSCOUT)

Abstract

Access this article

Similar content being viewed by others

Integrating terminologies into standard SQL: a new approach for research on routine data

KneeTex: an ontology–driven system for information extraction from MRI reports

Evaluation of Doc’EDS: a French semantic search tool to query health documents from a clinical data warehouse

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Information from Searching Content with an Ontology-Utilizing Toolkit (iSCOUT)

Abstract

Access this article

Similar content being viewed by others

Integrating terminologies into standard SQL: a new approach for research on routine data

KneeTex: an ontology–driven system for information extraction from MRI reports

Evaluation of Doc’EDS: a French semantic search tool to query health documents from a clinical data warehouse

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation