Skip to main content

Compilation of Custom Compound/Bioactivity Datasets from Public Repositories

  • Protocol
  • First Online:
Chemogenomics

Part of the book series: Methods in Molecular Biology ((MIMB,volume 2706))

  • 584 Accesses

Abstract

Public repositories containing compound–bioactivity data for millions of small molecules offer a valuable resource for chemogenomic compound candidate search. Nonetheless, owning to nonuniform data mining, these databases are often incomplete, thus advocating the combined use of data from several repositories to increase target coverage and data accuracy. Here, we present a workflow to generate custom datasets from public databases for mining chemogenomic compound candidates. The compiled set provides flags for differences in structural and bioactivity data and enables rapid extraction of potent and selective bioactive compounds.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Protocol
USD 49.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Müller S, Ackloo S, Al Chawaf A, Al-Lazikani B, Antolin A, Baell JB, Beck H, Beedie S, Betz UAK, Bezerra GA, Brennan PE, Brown D, Brown PJ, Bullock AN, Carter AJ, Chaikuad A, Chaineau M, Ciulli A, Collins I, Dreher J, Drewry D, Edfeldt K, Edwards AM, Egner U, Frye SV, Fuchs SM, Hall MD, Hartung IV, Hillisch A, Hitchcock SH, Homan E, Kannan N, Kiefer JR, Knapp S, Kostic M, Kubicek S, Leach AR, Lindemann S, Marsden BD, Matsui H, Meier JL, Merk D, Michel M, Morgan MR, Mueller-Fahrnow A, Owen DR, Perry BG, Rosenberg SH, Saikatendu KS, Schapira M, Scholten C, Sharma S, Simeonov A, Sundström M, Superti-Furga G, Todd MH, Tredup C, Vedadi M, Von Delft F, Willson TM, Winter GE, Workman P, Arrowsmith CH (2022) Target 2035 – update on the quest for a probe for every protein. RSC Med Chem 13(1):13–21

    Article  PubMed  Google Scholar 

  2. Arrowsmith CH, Audia JE, Austin C, Baell J, Bennett J, Blagg J, Bountra C, Brennan PE, Brown PJ, Bunnage ME, Buser-Doepner C, Campbell RM, Carter AJ, Cohen P, Copeland RA, Cravatt B, Dahlin JL, Dhanak D, Edwards AM, Frederiksen M, Frye SV, Gray N, Grimshaw CE, Hepworth D, Howe T, Huber KVM, Jin J, Knapp S, Kotz JD, Kruger RG, Lowe D, Mader MM, Marsden B, Mueller-Fahrnow A, Müller S, O’Hagan RC, Overington JP, Owen DR, Rosenberg SH, Roth B, Roth B, Ross R, Schapira M, Schreiber SL, Shoichet B, Sundström M, Superti-Furga G, Taunton J, Toledo-Sherman L, Walpole C, Walters MA, Willson TM, Workman P, Young RN, Zuercher WJ (2015) The promise and peril of chemical probes. Nature Chemical Biology 11(8):536–541

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  3. Bredel M, Jacoby E (2004) Chemogenomics: an emerging strategy for rapid target and drug discovery. Nature Reviews. Genetics 5(4):262–275

    Article  CAS  PubMed  Google Scholar 

  4. Jones LH, Bunnage ME (2017) Applications of chemogenomic library screening in drug discovery. Nature Reviews. Drug Discovery 16(4):285–296

    Article  CAS  PubMed  Google Scholar 

  5. Moffat JG, Vincent F, Lee JA, Eder J, Prunotto M (2017) Opportunities and challenges in phenotypic drug discovery: an industry perspective. Nature Reviews. Drug Discovery 16(8):531–543

    Article  CAS  PubMed  Google Scholar 

  6. Kim S, Chen J, Cheng T, Gindulyte A, He J, He S, Li Q, Shoemaker BA, Thiessen PA, Yu B, Zaslavsky L, Zhang J, Bolton EE (2021) PubChem in 2021: new data content and improved web interfaces. Nucleic Acids Research 49(D1):D1388–D1395

    Article  CAS  PubMed  Google Scholar 

  7. Mendez D, Gaulton A, Bento AP, Chambers J, De Veij M, Félix E, Magariños MP, Mosquera JF, Mutowo P, Nowotka M, Gordillo-Marañón M, Hunter F, Junco L, Mugumbate G, Rodriguez-Lopez M, Atkinson F, Bosc N, Radoux CJ, Segura-Cabrera A, Hersey A, Leach AR (2019) ChEMBL: towards direct deposition of bioassay data. Nucleic Acids Research 47(D1):D930–D940

    Article  CAS  PubMed  Google Scholar 

  8. Harding SD, Armstrong JF, Faccenda E, Southan C, Alexander SPH, Davenport AP, Pawson AJ, Spedding M, Davies JA (2022) The IUPHAR/BPS guide to PHARMACOLOGY in 2022: curating pharmacology for COVID-19, malaria and antibacterials. Nucleic Acids Research 50(D1):D1282–D1294

    Article  CAS  PubMed  Google Scholar 

  9. Steven Zheng XFS, Chan TF (2002) Chemical genomics: a systematic approach in biological research and drug discovery. Current Issues in Molecular Biology 4(2):33–43

    Google Scholar 

  10. Isigkeit L, Chaikuad A, Merk D (2022) A consensus compound/bioactivity dataset for data-driven drug design and chemogenomics. Molecules 27(8):2513

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  11. Todeschini R, Ballabio D, Consonni V (2000) Distances and similarity measures in chemometrics and chemoinformatics. In: Encyclopedia of analytical chemistry. Wiley

    Google Scholar 

  12. Rogers D, Hahn M (2010) Extended-connectivity fingerprints. Journal of Chemical Information and Modeling 50:742–754

    Article  CAS  PubMed  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Daniel Merk .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Science+Business Media, LLC, part of Springer Nature

About this protocol

Check for updates. Verify currency and authenticity via CrossMark

Cite this protocol

Isigkeit, L., Merk, D. (2023). Compilation of Custom Compound/Bioactivity Datasets from Public Repositories. In: Merk, D., Chaikuad, A. (eds) Chemogenomics. Methods in Molecular Biology, vol 2706. Humana, New York, NY. https://doi.org/10.1007/978-1-0716-3397-7_3

Download citation

  • DOI: https://doi.org/10.1007/978-1-0716-3397-7_3

  • Published:

  • Publisher Name: Humana, New York, NY

  • Print ISBN: 978-1-0716-3396-0

  • Online ISBN: 978-1-0716-3397-7

  • eBook Packages: Springer Protocols

Publish with us

Policies and ethics