Abstract
Biological/bioinformatics databases are essential for medical and biological studies. They integrate and organize biologically related information in a structured format and provide researchers with easy access to a variety of relevant data. This review presents an overview of publicly available databases relevant to proteomics studies in cancer research. They include gene/protein expression databases, gene mutation and single nucleotide polymorphisms databases, tumor antigen databases, protein–protein interaction, and biological pathway databases. Automated information retrieval from these databases enables efficient large-scale proteomics data analysis.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
James P (1997) Protein identification in the post-genome era: the rapid rise of proteomics. Q Rev Biophys 30:279–331
Koomen JM, Haura EB, Bepler G, Sutphen R, Remily-Wood ER, Benson K, Hussein M, Hazlehurst LA, Yeatman TJ, Hildreth LT, Sellers TA, Jacobsen PB, Fenstermacher DA, Dalton WS (2008) Proteomic contributions to personalized cancer care. Mol Cell Proteomics 7:1780–1794
Gygi SP, Rochon Y, Franza BR, Aebersold R (1999) Correlation between protein and mRNA abundance in yeast. Mol Cell Biol 19:1720–1730
Ivanov SS, Chung AS, Yuan ZL, Guan YJ, Sachs KV, Reichner JS, Chin YE (2004) Antibodies immobilized as arrays to profile protein post-translational modifications in mammalian cells. Mol Cell Proteomics 3:788–795
Bamford S, Dawson E, Forbes S, Clements J, Pettett R, Dogan A, Flanagan A, Teague J, Futreal PA, Stratton MR, Wooster R (2004) The COSMIC (catalogue of somatic mutations in cancer) database and website. Br J Cancer 91:355–358
Kopf E, Zharhary D (2007) Antibody arrays – an emerging tool in cancer proteomics. Int J Biochem Cell Biol 39:1305–1317
Tao SC, Chen CS, Zhu H (2007) Applications of protein microarray technology. Comb Chem High Throughput Screen 10:706–718
Sherman BT, da Huang W, Tan Q, Guo Y, Bour S, Liu D, Stephens R, Baseler MW, Lane HC, Lempicki RA (2007) DAVID knowledgebase: a gene-centered database integrating heterogeneous gene annotation resources to facilitate high-throughput gene functional analysis. BMC Bioinformatics 8:426
da Huang W, Sherman BT, Tan Q, Kir J, Liu D, Bryant D, Guo Y, Stephens R, Baseler MW, Lane HC, Lempicki RA (2007) DAVID Bioinformatics Resources: expanded annotation database and novel algorithms to better extract biology from large gene lists. Nucleic Acids Res 35:W169–W175
Rajcevic U, Niclou SP, Jimenez CR (2009) Proteomics strategies for target identification and biomarker discovery in cancer. Front Biosci 14:3292–3303
Grantzdorffer I, Carl-McGrath S, Ebert MP, Rocken C (2008) Proteomics of pancreatic cancer. Pancreas 36:329–336
Sahin U, Tureci O, Schmitt H, Cochlovius B, Johannes T, Schmits R, Stenner F, Luo G, Schobert I, Pfreundschuh M (1995) Human neoplasms elicit multiple specific immune responses in the autologous host. Proc Natl Acad Sci USA 92:11810–11813
Boutet E, Lieberherr D, Tognolli M, Schneider M, Bairoch A (2007) UniProtKB/Swiss-Prot. Methods Mol Biol 406:89–112
Rebhan M, Chalifa-Caspi V, Prilusky J, Lancet D (1998) GeneCards: a novel functional genomics compendium with automated data mining and query reformulation support. Bioinformatics 14:656–664
Barrett T, Troup DB, Wilhite SE, Ledoux P, Rudnev D, Evangelista C, Kim IF, Soboleva A, Tomashevsky M, Marshall KA, Phillippy KH, Sherman PM, Muertter RN, Edgar R (2009) NCBI GEO: archive for high-throughput functional genomic data. Nucleic Acids Res 37:D885–D890
Ikeo K, Ishi-i J, Tamura T, Gojobori T, Tateno Y (2003) CIBEX: center for information biology gene expression database. C R Biol 326:1079–1082
Rocca-Serra P, Brazma A, Parkinson H, Sarkans U, Shojatalab M, Contrino S, Vilo J, Abeygunawardena N, Mukherjee G, Holloway E, Kapushesky M, Kemmeren P, Lara GG, Oezcimen A, Sansone SA (2003) ArrayExpress: a public database of gene expression data at EBI. C R Biol 326:1075–1078
Parkinson H, Kapushesky M, Kolesnikov N, Rustici G, Shojatalab M, Abeygunawardena N, Berube H, Dylag M, Emam I, Farne A, Holloway E, Lukk M, Malone J, Mani R, Pilicheva E, Rayner TF, Rezwan F, Sharma A, Williams E, Bradley XZ, Adamusiak T, Brandizi M, Burdett T, Coulson R, Krestyaninova M, Kurnosov P, Maguire E, Neogi SG, Rocca-Serra P, Sansone SA, Sklyar N, Zhao M, Sarkans U, Brazma A (2009) ArrayExpress update – from an archive of functional genomics experiments to the atlas of gene expression. Nucleic Acids Res 37:D868–D872
Kato K, Yamashita R, Matoba R, Monden M, Noguchi S, Takagi T, Nakai K (2005) Cancer gene expression database (CGED): a database for gene expression profiling with accompanying clinical information of human cancer tissues. Nucleic Acids Res 33:D533–D536
Penkett CJ, Bahler J (2004) Navigating public microarray databases. Comp Funct Genomics 5:471–479
Zeng ZY, Xiong W, Zhou YH, Li XL, Li GY (2006) [Advances in high-density whole genome-wide single nucleotide polymorphism array in cancer research]. Ai Zheng 25:1454–1458
Forbes SA, Bhamra G, Bamford S, Dawson E, Kok C, Clements J, Menzies A, Teague JW, Futreal PA, Stratton MR (2008) The catalogue of somatic mutations in cancer (COSMIC). Curr Protoc Hum Genet. Chapter 10, Unit 10 11
Stenson PD, Mort M, Ball EV, Howells K, Phillips AD, Thomas NS, Cooper DN (2009) The human gene mutation database: 2008 update. Genome Med 1:13
Shimizu N, Ohtsubo M, Minoshima S (2007) MutationView/KMcancerDB: a database for cancer gene mutations. Cancer Sci 98:259–267
Futreal PA, Coin L, Marshall M, Down T, Hubbard T, Wooster R, Rahman N, Stratton MR (2004) A census of human cancer genes. Nat Rev Cancer 4:177–183
Hernandez-Boussard T, Rodriguez-Tome P, Montesano R, Hainaut P (1999) IARC p53 mutation database: a relational database to compile and analyze p53 mutations in human tumors and cell lines. International Agency for Research on Cancer. Hum Mutat 14:1–8
Murphy JA, Barrantes-Reynolds R, Kocherlakota R, Bond JP, Greenblatt MS (2004) The CDKN2A database: integrating allelic variants with evolution, structure, function, and disease association. Hum Mutat 24:296–304
Gottlieb B, Trifiro M, Lumbroso R, Pinsky L (1997) The androgen receptor gene mutations database. Nucleic Acids Res 25:158–162
Gottlieb B, Beitel LK, Wu JH, Trifiro M (2004) The androgen receptor gene mutations database (ARDB): 2004 update. Hum Mutat 23:527–533
Jackson MA, Lea I, Rashid A, Peddada SD, Dunnick JK (2006) Genetic alterations in cancer knowledge system: analysis of gene mutations in mouse and human liver and lung tumors. Toxicol Sci 90:400–418
Hu Y, McDermott MP, Ahrendt SA (2005) The p53 codon 72 proline allele is associated with p53 gene mutations in non-small cell lung cancer. Clin Cancer Res 11:2502–2509
Packer BR, Yeager M, Burdett L, Welch R, Beerman M, Qi L, Sicotte H, Staats B, Acharya M, Crenshaw A, Eckert A, Puri V, Gerhard DS, Chanock SJ (2006) SNP500Cancer: a public resource for sequence validation, assay development, and frequency analysis for genetic variation in candidate genes. Nucleic Acids Res 34:D617–D621
Smigielski EM, Sirotkin K, Ward M, Sherry ST (2000) dbSNP: a database of single nucleotide polymorphisms. Nucleic Acids Res 28:352–355
Rhee H, Lee JS (2009) MedRefSNP: a database of medically investigated SNPs. Hum Mutat 30:E460–E466
van der Bruggen P, Traversari C, Chomez P, Lurquin C, De Plaen E, Van den Eynde B, Knuth A, Boon T (1991) A gene encoding an antigen recognized by cytolytic T lymphocytes on a human melanoma. Science 254:1643–1647
Scanlan MJ, Gure AO, Jungbluth AA, Old LJ, Chen YT (2002) Cancer/testis antigens: an expanding family of targets for cancer immunotherapy. Immunol Rev 188:22–32
Almeida LG, Sakabe NJ, deOliveira AR, Silva MC, Mundstein AS, Cohen T, Chen YT, Chua R, Gurung S, Gnjatic S, Jungbluth AA, Caballero OL, Bairoch A, Kiesler E, White SL, Simpson AJ, Old LJ, Camargo AA, Vasconcelos AT (2009) CTdatabase: a knowledge-base of high-throughput and curated data on cancer-testis antigens. Nucleic Acids Res 37:D816–D819
Donnes P, Hoglund A, Sturm M, Comtesse N, Backes C, Meese E, Kohlbacher O, Lenhof HP (2004) Integrative analysis of cancer-related data using CAP. FASEB J 18:1465–1467
Jongeneel V (2001) Towards a cancer immunome database. Cancer Immun 1:3
Novellino L, Castelli C, Parmiani G (2005) A listing of human tumor antigens recognized by T cells. Cancer Immunol Immunother 54:187–207
Van den Eynde BJ, van der Bruggen P (1997) T cell defined tumor antigens. Curr Opin Immunol 9:684–693
Strausberg RL, Buetow KH, Greenhut SF, Grouse LH, Schaefer CF (2002) The cancer genome anatomy project: online resources to reveal the molecular signatures of cancer. Cancer Investig 20:1038–1050
Schaefer C, Grouse L, Buetow K, Strausberg RL (2001) A new cancer genome anatomy project web resource for the community. Cancer J 7:52–60
Krupke D, Naf D, Vincent M, Allio T, Mikaelian I, Sundberg J, Bult C, Eppig J (2005) The Mouse Tumor Biology Database: integrated access to mouse cancer biology data. Exp Lung Res 31:259–270
Bader GD, Cary MP, Sander C (2006) Pathguide: a pathway resource list. Nucleic Acids Res 34:D504–D506
Bauer-Mehren A, Furlong LI, Sanz F (2009) Pathway databases and tools for their exploitation: benefits, current limitations and challenges. Mol Syst Biol 5:290
Kanehisa M, Goto S, Furumichi M, Tanabe M Hirakawa, M (2010) KEGG for representation and analysis of molecular networks involving diseases and drugs, Nucleic Acids Res 38:D355–D360
Matthews L, Gopinath G, Gillespie M, Caudy M, Croft D, de Bono B, Garapati P, Hemish J, Hermjakob H, Jassal B, Kanapin A, Lewis S, Mahajan S, May B, Schmidt E, Vastrik I, Wu G, Birney E, Stein L, D’Eustachio P (2009) Reactome knowledgebase of human biological pathways and processes. Nucleic Acids Res 37:D619–D622
Schaefer CF, Anthony K, Krupa S, Buchoff J, Day M, Hannay T, Buetow KH (2009) PID: the Pathway Interaction Database. Nucleic Acids Res 37:D674–D679
Chowbina SR, Wu X, Zhang F, Li PM, Pandey R, Kasamsetty HN, Chen JY (2009) HPD: an online integrated human pathway database enabling systems biology studies. BMC Bioinformatics 10(Suppl 11):S5
Keshava Prasad TS, Goel R, Kandasamy K, Keerthikumar S, Kumar S, Mathivanan S, Telikicherla D, Raju R, Shafreen B, Venugopal A, Balakrishnan L, Marimuthu A, Banerjee S, Somanathan DS, Sebastian A, Rani S, Ray S, Harrys Kishore CJ, Kanth S, Ahmed M, Kashyap MK, Mohmood R, Ramachandra YL, Krishna V, Rahiman BA, Mohan S, Ranganathan P, Ramabadran S, Chaerkady R, Pandey A (2009) Human Protein Reference Database – 2009 update. Nucleic Acids Res 37:D767–D772
Chen JY, Mamidipalli S, Huan T (2009) HAPPI: an online database of comprehensive human annotated and predicted protein interactions. BMC Genomics 10(Suppl 1):S16
Persico M, Ceol A, Gavrila C, Hoffmann R, Florio A, Cesareni G (2005) HomoMINT: an inferred human network based on orthology mapping of protein interactions discovered in model organisms. BMC Bioinformatics 6(Suppl 4):S21
Aranda B, Achuthan P, Alam-Faruque Y, Armean I, Bridge A, Derow C, Feuermann M, Ghanbarian AT, Kerrien S, Khadake J, Kerssemakers J, Leroy C, Menden M, Michaut M, Montecchi-Palazzi L, Neuhauser SN, Orchard S, Perreau V, Roechert B, van Eijk K, Hermjakob H (2009) The IntAct molecular interaction database in 2010. Nucleic Acids Res 38(Database issue):D525–31
Lehne B, Schlitt T (2009) Protein-protein interaction databases: keeping up with growing interactomes. Hum Genomics 3:291–297
Mathivanan S, Periaswamy B, Gandhi TK, Kandasamy K, Suresh S, Mohmood R, Ramachandra YL, Pandey A (2006) An evaluation of human protein–protein interaction data in the public domain. BMC Bioinformatics 7(Suppl 5):S19
Galperin MY, Cochrane GR (2009) Nucleic acids research annual database issue and the NAR online molecular biology database collection in 2009. Nucleic Acids Res 37:D1–D4
Brusic V, Zeleznikow J (1999) Knowledge discovery and data mining in biological databases. Knowledge Eng Rev 14:257–277
Jeong SK, Kwon MS, Lee EY, Lee HJ, Cho SY, Kim H, Yoo JS, Omenn GS, Aebersold R, Hanash S, Paik YK (2009) BiomarkerDigger: a versatile disease proteome database and analysis platform for the identification of plasma cancer biomarkers. Proteomics 9:3729–3740
Acknowledgments
We thank Dr. Catherine J. Wu for thoughtful reviews of this manuscript.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer Science+Business Media, LLC
About this protocol
Cite this protocol
Zhang, G.L., DeLuca, D.S., Brusic, V. (2011). Database Resources for Proteomics-Based Analysis of Cancer. In: Wu, C. (eds) Protein Microarray for Disease Analysis. Methods in Molecular Biology, vol 723. Humana Press. https://doi.org/10.1007/978-1-61779-043-0_22
Download citation
DOI: https://doi.org/10.1007/978-1-61779-043-0_22
Published:
Publisher Name: Humana Press
Print ISBN: 978-1-61779-042-3
Online ISBN: 978-1-61779-043-0
eBook Packages: Springer Protocols