Skip to main content

Transcriptomic Data Mining and Repurposing for Computational Drug Discovery

  • Protocol
  • First Online:
Computational Methods for Drug Repurposing

Part of the book series: Methods in Molecular Biology ((MIMB,volume 1903))

Abstract

Conventional drug discovery in general is costly and time-consuming with extremely low success and relatively high attrition rates. The disparity between high cost of drug discovery and vast unmet medical needs resulted in advent of an increasing number of computational approaches that can “connect” disease with a candidate therapeutic. This includes computational drug repurposing or repositioning wherein the goal is to discover a new indication for an approved drug. Computational drug discovery approaches that are commonly used are similarity-based wherein network analysis or machine learning-based methods are used. One such approach is matching gene expression signatures from disease to those from small molecules, commonly referred to as connectivity mapping. In this chapter, we will focus on how publicly available existing transcriptomic data from diseases can be reused to identify novel candidate therapeutics and drug repositioning candidates. To elucidate these, we will present two case studies: (1) using transcriptional signature similarity or positive correlation to identify novel small molecules that are similar to an approved drug and (2) identifying candidate therapeutics via reciprocal connectivity or negative correlation between transcriptional signatures from a disease and small molecule.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Protocol
USD 49.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Kaitin KI (2010) Deconstructing the drug development process: the new face of innovation. Clin Pharmacol Ther 87(3):356–361. https://doi.org/10.1038/clpt.2009.293

    Article  CAS  PubMed  Google Scholar 

  2. Avorn J (2015) The $2.6 billion pill--methodologic and policy considerations. N Engl J Med 372(20):1877–1879. https://doi.org/10.1056/NEJMp1500848

    Article  CAS  PubMed  Google Scholar 

  3. Denis A, Mergaert L, Fostier C, Cleemput I, Simoens S (2010) A comparative study of European rare disease and orphan drug markets. Health Policy 97(2-3):173–179. https://doi.org/10.1016/j.healthpol.2010.05.017

    Article  PubMed  Google Scholar 

  4. Valdez R, Ouyang L, Bolen J (2016) Public health and rare diseases: oxymoron no more. Prev Chronic Dis 13:E05. https://doi.org/10.5888/pcd13.150491

    Article  PubMed  PubMed Central  Google Scholar 

  5. Margolis R, Derr L, Dunn M, Huerta M, Larkin J, Sheehan J, Guyer M, Green ED (2014) The National Institutes of Health’s Big Data to Knowledge (BD2K) initiative: capitalizing on biomedical big data. J Am Med Inform Assoc 21(6):957–958. https://doi.org/10.1136/amiajnl-2014-002974

    Article  PubMed  PubMed Central  Google Scholar 

  6. Barrett T, Wilhite SE, Ledoux P, Evangelista C, Kim IF, Tomashevsky M, Marshall KA, Phillippy KH, Sherman PM, Holko M, Yefanov A, Lee H, Zhang N, Robertson CL, Serova N, Davis S, Soboleva A (2013) NCBI GEO: archive for functional genomics data sets--update. Nucleic Acids Res 41(Database Issue):D991–D995. https://doi.org/10.1093/nar/gks1193

    Article  CAS  PubMed  Google Scholar 

  7. Szklarczyk D, Morris JH, Cook H, Kuhn M, Wyder S, Simonovic M, Santos A, Doncheva NT, Roth A, Bork P, Jensen LJ, von Mering C (2017) The STRING database in 2017: quality-controlled protein-protein association networks, made broadly accessible. Nucleic Acids Res 45(D1):D362–D368. https://doi.org/10.1093/nar/gkw937

    Article  CAS  PubMed  Google Scholar 

  8. Hodos RA, Kidd BA, Shameer K, Readhead BP, Dudley JT (2016) In silico methods for drug repurposing and pharmacology. Wiley Interdiscip Rev Syst Biol Med 8(3):186–210. https://doi.org/10.1002/wsbm.1337

    Article  PubMed  PubMed Central  Google Scholar 

  9. Bajorath J (2017) Molecular similarity concepts for informatics applications. Methods Mol Biol 1526:231–245. https://doi.org/10.1007/978-1-4939-6613-4_13

    Article  CAS  PubMed  Google Scholar 

  10. Chavali AK, Blazier AS, Tlaxca JL, Jensen PA, Pearson RD, Papin JA (2012) Metabolic network analysis predicts efficacy of FDA-approved drugs targeting the causative agent of a neglected tropical disease. BMC Syst Biol 6:27. https://doi.org/10.1186/1752-0509-6-27

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  11. Martinez V, Navarro C, Cano C, Fajardo W, Blanco A (2015) DrugNet: network-based drug-disease prioritization by integrating heterogeneous data. Artif Intell Med 63(1):41–49. https://doi.org/10.1016/j.artmed.2014.11.003

    Article  PubMed  Google Scholar 

  12. Yang L, Agarwal P (2011) Systematic drug repositioning based on clinical side-effects. PLoS One 6(12):e28025. https://doi.org/10.1371/journal.pone.0028025

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  13. Ye H, Liu Q, Wei J (2014) Construction of drug network based on side effects and its application for drug repositioning. PLoS One 9(2):e87864. https://doi.org/10.1371/journal.pone.0087864

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  14. Chiang AP, Butte AJ (2009) Systematic evaluation of drug-disease relationships to identify leads for novel drug uses. Clin Pharmacol Ther 86(5):507–510. https://doi.org/10.1038/clpt.2009.103

    Article  CAS  PubMed  Google Scholar 

  15. Lamb J, Crawford ED, Peck D, Modell JW, Blat IC, Wrobel MJ, Lerner J, Brunet JP, Subramanian A, Ross KN, Reich M, Hieronymus H, Wei G, Armstrong SA, Haggarty SJ, Clemons PA, Wei R, Carr SA, Lander ES, Golub TR (2006) The Connectivity Map: using gene-expression signatures to connect small molecules, genes, and disease. Science 313(5795):1929–1935. https://doi.org/10.1126/science.1132939

    Article  CAS  PubMed  Google Scholar 

  16. Lamb J, Ramaswamy S, Ford HL, Contreras B, Martinez RV, Kittrell FS, Zahnow CA, Patterson N, Golub TR, Ewen ME (2003) A mechanism of cyclin D1 action encoded in the patterns of gene expression in human cancer. Cell 114(3):323–334

    Article  CAS  PubMed  Google Scholar 

  17. Gerald KB (1991) Nonparametric statistical methods. Nurse Anesth 2(2):93–95

    CAS  PubMed  Google Scholar 

  18. Subramanian A, Narayan R, Corsello SM, Peck DD, Natoli TE, Lu X, Gould J, Davis JF, Tubelli AA, Asiedu JK, Lahr DL, Hirschman JE, Liu Z, Donahue M, Julian B, Khan M, Wadden D, Smith IC, Lam D, Liberzon A, Toder C, Bagul M, Orzechowski M, Enache OM, Piccioni F, Johnson SA, Lyons NJ, Berger AH, Shamji AF, Brooks AN, Vrcic A, Flynn C, Rosains J, Takeda DY, Hu R, Davison D, Lamb J, Ardlie K, Hogstrom L, Greenside P, Gray NS, Clemons PA, Silver S, Wu X, Zhao WN, Read-Button W, Wu X, Haggarty SJ, Ronco LV, Boehm JS, Schreiber SL, Doench JG, Bittker JA, Root DE, Wong B, Golub TR (2017) A next generation connectivity map: L1000 platform and the first 1,000,000 profiles. Cell 171(6):1437–1452.e1417. https://doi.org/10.1016/j.cell.2017.10.049

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  19. Brum AM, van de Peppel J, van der Leije CS, Schreuders-Koedam M, Eijken M, van der Eerden BC, van Leeuwen JP (2015) Connectivity Map-based discovery of parbendazole reveals targetable human osteogenic pathway. Proc Natl Acad Sci U S A 112(41):12711–12716. https://doi.org/10.1073/pnas.1501597112

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  20. Sirota M, Dudley JT, Kim J, Chiang AP, Morgan AA, Sweet-Cordero A, Sage J, Butte AJ (2011) Discovery and preclinical validation of drug indications using compendia of public gene expression data. Sci Transl Med 3(96):96ra77. https://doi.org/10.1126/scitranslmed.3001318

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  21. Liu C, Su J, Yang F, Wei K, Ma J, Zhou X (2015) Compound signature detection on LINCS L1000 big data. Mol Biosyst 11(3):714–722. https://doi.org/10.1039/c4mb00677a

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  22. Campillos M, Kuhn M, Gavin AC, Jensen LJ, Bork P (2008) Drug target identification using side-effect similarity. Science 321(5886):263–266. https://doi.org/10.1126/science.1158140

    Article  CAS  PubMed  Google Scholar 

  23. Ding H, Takigawa I, Mamitsuka H, Zhu S (2014) Similarity-based machine learning methods for predicting drug-target interactions: a brief review. Brief Bioinform 15(5):734–747. https://doi.org/10.1093/bib/bbt056

    Article  PubMed  Google Scholar 

  24. Bleakley K, Yamanishi Y (2009) Supervised prediction of drug-target interactions using bipartite local models. Bioinformatics 25(18):2397–2403. https://doi.org/10.1093/bioinformatics/btp433

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  25. Gottlieb A, Stein GY, Ruppin E, Sharan R (2011) PREDICT: a method for inferring novel drug indications with application to personalized medicine. Mol Syst Biol 7:496. https://doi.org/10.1038/msb.2011.26

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  26. Pharmacists TASoH-S (2015) Simvastatin. www.Drugs.com

  27. Wang T, Seah S, Loh X, Chan CW, Hartman M, Goh BC, Lee SC (2016) Simvastatin-induced breast cancer cell death and deactivation of PI3K/Akt and MAPK/ERK signalling are reversed by metabolic products of the mevalonate pathway. Oncotarget 7(3):2532–2544. https://doi.org/10.18632/oncotarget.6304

    Article  PubMed  Google Scholar 

  28. Yang LX, Heng XH, Guo RW, Si YK, Qi F, Zhou XB (2013) Atorvastatin inhibits the 5-lipoxygenase pathway and expression of CCL3 to alleviate atherosclerotic lesions in atherosclerotic ApoE knockout mice. J Cardiovasc Pharmacol 62(2):205–211. https://doi.org/10.1097/FJC.0b013e3182967fc0

    Article  CAS  PubMed  Google Scholar 

  29. Nair RP, Duffin KC, Helms C, Ding J, Stuart PE, Goldgar D, Gudjonsson JE, Li Y, Tejasvi T, Feng BJ, Ruether A, Schreiber S, Weichenthal M, Gladman D, Rahman P, Schrodi SJ, Prahalad S, Guthery SL, Fischer J, Liao W, Kwok PY, Menter A, Lathrop GM, Wise CA, Begovich AB, Voorhees JJ, Elder JT, Krueger GG, Bowcock AM, Abecasis GR, Collaborative Association Study of P (2009) Genome-wide scan reveals association of psoriasis with IL-23 and NF-kappaB pathways. Nat Genet 41(2):199–204. https://doi.org/10.1038/ng.311

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  30. Davis S, Meltzer PS (2007) GEOquery: a bridge between the Gene Expression Omnibus (GEO) and BioConductor. Bioinformatics 23(14):1846–1847. https://doi.org/10.1093/bioinformatics/btm254

    Article  CAS  PubMed  Google Scholar 

  31. Smyth GK (2004) Linear models and empirical bayes methods for assessing differential expression in microarray experiments. Stat Appl Genet Mol Biol 3:3. https://doi.org/10.2202/1544-6115.1027

    Article  Google Scholar 

  32. Duan Q, Reid SP, Clark NR, Wang Z, Fernandez NF, Rouillard AD, Readhead B, Tritsch SR, Hodos R, Hafner M, Niepel M, Sorger PK, Dudley JT, Bavari S, Panchal RG, Ma’ayan A (2016) L1000CDS(2): LINCS L1000 characteristic direction signatures search engine. NPJ Syst Biol Appl 2. https://doi.org/10.1038/npjsba.2016.15

  33. Roberson ED, Liu Y, Ryan C, Joyce CE, Duan S, Cao L, Martin A, Liao W, Menter A, Bowcock AM (2012) A subset of methylated CpG sites differentiate psoriatic from normal skin. J Invest Dermatol 132(3 Pt 1):583–592. https://doi.org/10.1038/jid.2011.348

    Article  CAS  PubMed  Google Scholar 

  34. Schallreuter KU, Pittelkow MR (1987) Anthralin inhibits elevated levels of thioredoxin reductase in psoriasis. A new mode of action for this drug. Arch Dermatol 123(11):1494–1498

    Article  CAS  PubMed  Google Scholar 

  35. Chen J, Bardes EE, Aronow BJ, Jegga AG (2009) ToppGene Suite for gene list enrichment analysis and candidate gene prioritization. Nucleic Acids Res 37(Web Server Issue):W305–W311. https://doi.org/10.1093/nar/gkp427

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  36. Napolitano F, Carrella D, Mandriani B, Pisonero S, Sirci F, Medina D, Brunetti-Pierri N, di Bernardo D (2017) gene2drug: a computational tool for pathway-based rational drug repositioning. Bioinformatics 34:1498. https://doi.org/10.1093/bioinformatics/btx800

    Article  Google Scholar 

  37. Kuleshov MV, Jones MR, Rouillard AD, Fernandez NF, Duan Q, Wang Z, Koplev S, Jenkins SL, Jagodnik KM, Lachmann A, McDermott MG, Monteiro CD, Gundersen GW, Ma’ayan A (2016) Enrichr: a comprehensive gene set enrichment analysis web server 2016 update. Nucleic Acids Res 44(W1):W90–W97. https://doi.org/10.1093/nar/gkw377

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  38. Wang Z, Monteiro CD, Jagodnik KM, Fernandez NF, Gundersen GW, Rouillard AD, Jenkins SL, Feldmann AS, Hu KS, McDermott MG, Duan Q, Clark NR, Jones MR, Kou Y, Goff T, Woodland H, Amaral FM, Szeto GL, Fuchs O, Schussler-Fiorenza Rose SM, Sharma S, Schwartz U, Bausela XB, Szymkiewicz M, Maroulis V, Salykin A, Barra CM, Kruth CD, Bongio NJ, Mathur V, Todoric RD, Rubin UE, Malatras A, Fulp CT, Galindo JA, Motiejunaite R, Juschke C, Dishuck PC, Lahl K, Jafari M, Aibar S, Zaravinos A, Steenhuizen LH, Allison LR, Gamallo P, de Andres Segura F, Dae Devlin T, Perez-Garcia V, Ma’ayan A (2016) Extraction and analysis of signatures from the Gene Expression Omnibus by the crowd. Nat Commun 7:12846. https://doi.org/10.1038/ncomms12846

    Article  CAS  PubMed  PubMed Central  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Anil G. Jegga .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Science+Business Media, LLC, part of Springer Nature

About this protocol

Check for updates. Verify currency and authenticity via CrossMark

Cite this protocol

Wang, Y., Yella, J., Jegga, A.G. (2019). Transcriptomic Data Mining and Repurposing for Computational Drug Discovery. In: Vanhaelen, Q. (eds) Computational Methods for Drug Repurposing. Methods in Molecular Biology, vol 1903. Humana Press, New York, NY. https://doi.org/10.1007/978-1-4939-8955-3_5

Download citation

  • DOI: https://doi.org/10.1007/978-1-4939-8955-3_5

  • Published:

  • Publisher Name: Humana Press, New York, NY

  • Print ISBN: 978-1-4939-8954-6

  • Online ISBN: 978-1-4939-8955-3

  • eBook Packages: Springer Protocols

Publish with us

Policies and ethics