Skip to main content

Finding Hidden Relationships Between Medical Concepts by Leveraging Metamap and Text Mining Techniques

  • Conference paper
  • First Online:
Advanced Data Mining and Applications (ADMA 2022)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 13725))

Included in the following conference series:

  • 965 Accesses


Text is one of the most common ways to store data in this computerized world. At a glance, it may seem that those data are not interconnected. But in reality, data can have hidden connections. Therefore, in this research, a new model has been presented that can find hidden relationships between two medical concepts by using MetaMap and appropriate text-mining techniques. Specifically, the model creates a new comprehensive index structure and can find cross-document hidden links connecting topics of interest that most existing approaches have ignored. Experiments show the effectiveness of the proposed model in discovering new connections between topics.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
USD 79.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 99.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others


  1. Belkin, N.J.: Interaction with texts: Information retrieval as information seeking behavior. In: Information Retrieval. p. 55–66 (1993).

    Google Scholar 

  2. Swanson, D.R.: Complementary structures in disjoint science literatures. In: Proceedings of the 14th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, ACM Press, Chicago, IL, pp. 280–289 (1991).

  3. Aronson, A.R.: Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program. In: Proceedings of AMIA Annual Symposium, pp. 17–21 (2001).

  4. Kay Deeney. MetaMap - A Tool for Recognizing UMLS Concepts in Text. U.S. National Library of Medicine (2017).

  5. Chapman, W.W., Fiszman, M., , Dowling, J.N., Chapman, B.E., Rindflesch, T.C.: Identifying respiratory findings in emergency department reports for biosurveillance using MetaMap. Studies in Health Technology and Informatics, 107(Pt 1), pp. 487–91 (2004).

  6. Zuccon, G., Holloway, A., Koopman , B., Nguyen, A.: Identify disorders in health records using conditional random fields and metamap. In: Proceedings of the CLEF 2013 Workshop on Cross-Language Evaluation of Methods, Applications, and Resources for eHealth Document Analysis, pp. 1–8 (2013).

  7. Pratt, W., Yetisgen-Yildiz, M.: A study of biomedical concept identification: MetaMap vs. people. In: AMIA Annual Symposium Proceedings, pp. 529–33 (2003).

  8. Jin, W., Srihari, R.K.: Knowledge discovery across documents through concept chain queries. In: Proceedings of the Sixth IEEE International Conference on Data Mining – Workshops (ICDMW’06), pp. 448–452 (2006).

  9. Gopalakrishnan, V., Jha, K., Jin, W., Zhang, A.: A survey on literature based discovery approaches in biomedical domain. In: Journal of Biomedical Informatics, 93, 103141 (2019). doi:

  10. Philipps, J., Rumpe, B.: Refinement of pipe-and-filter architectures. In: Wing, J.M., Woodcock, J., Davies, J. (eds.) FM 1999. LNCS, vol. 1708, pp. 96–115. Springer, Heidelberg (1999).

    Chapter  Google Scholar 

  11. Sanscartier, M.J., Neufeld, E.: Identifying hidden variables from contextspecific independencies. In: Proceedings of the Twentieth International Florida Artificial Intelligence Research Society Conference, pp. 472–477 (2007)., Florida, USA

    Google Scholar 

  12. Prakash, D., Surendran, S.: Detection and analysis of hidden activities in social networks. International Journal of Computer Applications (0975–8887), 77(16), 34–38 (2013).

  13. Pividori, M., Cernadas, A., de Haro, L.A., Carrari, F., Stegmayer, G., Milone, D.H.: Clustermatch: discovering hidden relations in highly diverse kinds of qualitative and quantitative data without standardization. Bioinformatics 35(11), 1931–1939 (2019).

    Article  Google Scholar 

  14. Sawaf, M.B.A., Kawanisi, K., Jlilati, M.N., Xiao, C., Bahreinimotlagh, M.: Extent of detection of hidden relationships among different hydrological variables during floods using data-driven models. Environ. Monit. Assess. 193(11), 1–14 (2021).

    Article  Google Scholar 

  15. Rasekh, A.H., Arshia, A.H., Fakhrahmad, S.M., Sadreddini, M.H.: Mining and discovery of hidden relationships between software source codes and related textual documents. Digital Scholarship in the Humanities. 33(3), 651–669 (2018).

    Article  Google Scholar 

  16. Gopalakrishnan, V., Jha, K., Zhang, A., Jin, W.: Generating hypothesis: Using global and local features in graph to discover new knowledge from medical literature. In: Proceedings of the 8th International Conference on Bioinformatics and Computational Biology, Las Vegas, Nevada, USA. pp. 23–30 (2016). 978–1–943436–03–3

    Google Scholar 

  17. Hu, X., Zhang, X., Yoo, I., Zhang, Y.: A semantic approach for mining hidden links from complementary and non-interactive biomedical literature. In: Proceedings of the Sixth SIAM International Conference on Data Mining, Bethesda, MD, USA, pp. 200–209 (2006).

  18. Srinivasan, P., Libbus, B.: Mining MEDLINE for implicit links between dietary substances and diseases. In Bioinformatics. 20, i290–i296 (2004).

    Article  Google Scholar 

  19. Jha, K., Jin, W.: Mining novel knowledge from biomedical literature using statistical measures and domain knowledge. In: Proceedings of the 7th ACM International Conference on Bioinformatics, Computational Biology, and Health Informatics (BCB ‘16). Association for Computing Machinery, New York, NY, USA, pp. 317–326 (2016).

  20. Swanson, D.R.: Fish oil, raynaud’s syndrome, and undiscovered public knowledge. Perspect. Biol. Med. 30(1), 7–18 (1986).

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations


Corresponding author

Correspondence to Wei Jin .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Yang, W., Chowdhury, S.M.M.H., Jin, W. (2022). Finding Hidden Relationships Between Medical Concepts by Leveraging Metamap and Text Mining Techniques. In: Chen, W., Yao, L., Cai, T., Pan, S., Shen, T., Li, X. (eds) Advanced Data Mining and Applications. ADMA 2022. Lecture Notes in Computer Science(), vol 13725. Springer, Cham.

Download citation

  • DOI:

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-22063-0

  • Online ISBN: 978-3-031-22064-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics