Parsing Compound–Protein Bioactivity Tables

Brown, J. B.

doi:10.1007/978-1-4939-8639-2_4

J. B. Brown³

Part of the book series: Methods in Molecular Biology ((MIMB,volume 1825))

1192 Accesses

Abstract

With the availability of a multitude of databases that contain information on the bioactivity between compounds and proteins, several fundamental tasks arise. These include parsing of the original data in order to filter out unusable data, merging of multiple databases, identification of the sets of unique molecules, and selection of subsets of parsed data.

In this chapter, we address these issues by providing solutions to each of the problems. Solutions are presented using standardized and freely available data processing tools, as well as computer program code.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Protocol: USD 49.95; Price excludes VAT (USA)

eBook: USD 109.00; Price excludes VAT (USA)

Softcover Book: USD 139.99; Price excludes VAT (USA)

Hardcover Book: USD 199.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Gaulton A, Hersey A, Nowotka M et al (2017) The ChEMBL database in 2017. Nucleic Acids Res 45:D945–D954. https://doi.org/10.1093/nar/gkw1074
Article CAS Google Scholar
Kim S, Thiessen PA, Bolton EE et al (2016) PubChem substance and compound databases. Nucleic Acids Res 44:D1202–D1213. https://doi.org/10.1093/nar/gkv951
Article CAS Google Scholar
Wang Y, Bryant SH, Cheng T et al (2017) PubChem BioAssay: 2017 update. Nucleic Acids Res 45:D955–D963. https://doi.org/10.1093/nar/gkw1118
Article CAS Google Scholar
Chan WKB, Zhang H, Yang J et al (2015) GLASS: a comprehensive database for experimentally-validated GPCR-ligand associations. Bioinformatics 31:btv302. https://doi.org/10.1093/bioinformatics/btv302
Article CAS Google Scholar
Roth BL, Lopez E, Patel S, Kroeze WK (2000) The multiplicity of serotonin receptors: uselessly diverse molecules or an embarrassment of riches? Neuroscience 6:252–262. https://doi.org/10.1177/107385840000600408
Article CAS Google Scholar
Hewett M, Oliver DE, Rubin DL et al (2002) PharmGKB: the pharmacogenetics knowledge base. Nucleic Acids Res 30:163–165
Article CAS Google Scholar
Szklarczyk D, Santos A, von Mering C et al (2015) STITCH 5: augmenting protein-chemical interaction networks with tissue and affinity data. Nucleic Acids Res 44:gkv1277. https://doi.org/10.1093/nar/gkv1277
Article CAS Google Scholar
Kuhn M, Szklarczyk D, Pletscher-Frankild S et al (2014) STITCH 4: integration of protein-chemical interactions with user data. Nucleic Acids Res 42:D401–D407. https://doi.org/10.1093/nar/gkt1207
Article CAS Google Scholar
Tanabe M, Kanehisa M (2012) Using the KEGG database resource. Curr Protoc Bioinformatics. https://doi.org/10.1002/0471250953.bi0112s38
Kanehisa M, Sato Y, Kawashima M et al (2016) KEGG as a reference resource for gene and protein annotation. Nucleic Acids Res 44:D457–D462. https://doi.org/10.1093/nar/gkv1070
Article CAS Google Scholar
Fabregat A, Sidiropoulos K, Garapati P et al (2016) The reactome pathway knowledgebase. Nucleic Acids Res 44:D481–D487. https://doi.org/10.1093/nar/gkv1351
Article CAS PubMed Google Scholar
Joshi-Tope G, Gillespie M, Vastrik I et al (2005) Reactome: a knowledgebase of biological pathways. Nucleic Acids Res 33(Database issue):D428–D432. https://doi.org/10.1093/nar/gki072
Article CAS PubMed Google Scholar
Shinbo Y, Nakamura Y, Altaf-Ul-Amin M et al (2006) KNApSAcK: a comprehensive species-metabolite relationship database. In: Plant metabolomics. Biotechnology in agriculture and forestry. Springer, Berlin, Heidelberg, pp 165–181
Google Scholar
Nakamura K, Shimura N, Otabe Y et al (2013) KNApSAcK-3D: a three-dimensional structure database of plant metabolites. Plant Cell Physiol 54(2):e4. https://doi.org/10.1093/pcp/pcs186
Article CAS PubMed Google Scholar

Download references

Author information

Authors and Affiliations

Life Science Informatics Research Unit, Laboratory of Molecular Biosciences, Kyoto University Graduate School of Medicine, Kyoto, Japan
J. B. Brown

Authors

J. B. Brown
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to J. B. Brown .

Editor information

Editors and Affiliations

Life Science Informatics Research Unit, Laboratory of Molecular Biosciences, Kyoto University Graduate School of Medicine, Kyoto, Japan
J.B. Brown

Rights and permissions

Reprints and permissions

Copyright information

About this protocol

Cite this protocol

Brown, J.B. (2018). Parsing Compound–Protein Bioactivity Tables. In: Brown, J. (eds) Computational Chemogenomics. Methods in Molecular Biology, vol 1825. Humana Press, New York, NY. https://doi.org/10.1007/978-1-4939-8639-2_4

Download citation

DOI: https://doi.org/10.1007/978-1-4939-8639-2_4
Published: 18 October 2018
Publisher Name: Humana Press, New York, NY
Print ISBN: 978-1-4939-8638-5
Online ISBN: 978-1-4939-8639-2
eBook Packages: Springer Protocols

Publish with us

Policies and ethics