Applied Bioinformatics

, Volume 3, Issue 1, pp 49–62 | Cite as

caGEDA: a web application for the integrated analysis of global gene expression patterns in cancer

  • Satish Patel
  • James Lyons-WeilerEmail author
Application Note


The explosion of microarray data from pilot studies, basic research and large-scale clinical trials requires the development of integrative computational tools that can not only analyse gene expression patterns but that can also evaluate the methods of analysis adopted and then provide a boost to post-analysis translational interpretation of those patterns. We have developed a web application called caGEDA (cancer gene expression data analyzer) that can: (1) upload gene expression profiles from cDNA or oligonucleotide microarrays; (2) conduct a diverse range of serial linear normalisations; (3) identify differentially expressed genes using a variety of tests — either threshold or permutation tests; (4) produce tables of literature references to papers reporting that specific genes (identified by accession numbers) are up- or down-regulated in specific cancers; (5) estimate the error of sample class prediction using the significant gene set for features; (6) perform low-bias and accurate validated learning using three computational validation techniques (leave-one-out validation, k-fold validation, random resampling validation); and (7) validate a classifier with a randomly selected or user-defined validation set. Significant genes are reported in a table of links to entries in the following databases: LocusLink, Genome View, UCSC, Ensembl, UniGene, dbSNP, AmiGO and OMIM. caGEDA is seamlessly integrated via embedded forms with UCSD’s (University of California at San Diego) 2HAPI server (for medical subject heading (MeSH) term exploration) and EZ-Retrieve (to identify common transcription factors located upstream of sets of genes that exhibit similar modes of differential expression). caGEDA offers a variety of previously described and novel tests for differentially expressed genes, most notably the permutation percentile separability test, which is most appropriate for identifying genes that are significantly differentially expressed in a subset of patients. caGEDA, which is open source and free to academic users, will soon be greatly enhanced by operating with the components of the National Cancer Institute’s new cancer bioinformatics grid (caBIG).


microarray analysis cancer genomics 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. ACME Laboratories. 2001. ACME Laboratories freeware [online]. Accessed 7 Apr 2004. URL:
  2. Alizadeh A, Eisen M, Davis RE et al. 2000. Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling. Nature, 403:503–11.PubMedCrossRefGoogle Scholar
  3. Allander SV, Illei PB, Chen Y et al. 2002. Expression profiling of synovial sarcoma by cDNA microarrays: association of ERBB2, IGFBP2, and ELF3 with epithelial differentiation. Am J Pathol, 161:1587–95.PubMedCrossRefGoogle Scholar
  4. Alon U, Barkai N, Notterman DA et al. 1999. Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays. Proc Natl Acad Sci USA, 96:6745–50.PubMedCrossRefGoogle Scholar
  5. Beer DG, Kardia SL, Huang CC et al. 2002. Gene-expression profiles predict survival of patients with lung adenocarcinoma. Nat Med, 8:816–24.PubMedGoogle Scholar
  6. Bittner M, Meltzer P, Chen Y et al. 2000. Molecular classification of cutaneous malignant melanoma by gene expression profiling. Nature, 406:536–40.PubMedCrossRefGoogle Scholar
  7. Buetow KH, Klausner RD, Fine H et al. 2002. Cancer molecular analysis project: weaving a rich cancer research tapestry. Cancer Cell, 1: 315–18.PubMedCrossRefGoogle Scholar
  8. Bussey KJ, Kane D, Sunshine M et al. 2003. MatchMiner: a tool for batch navigation among gene and gene product identifiers. Genome Biol, 4:R27.PubMedCrossRefGoogle Scholar
  9. Chakrabarti R, Robles LD, Gibson J et al. 2002. Profiling of differential expression of messenger RNA in normal, benign, and metastatic prostate cell lines. Cancer Genet Cytogenet, 139:115–25.PubMedCrossRefGoogle Scholar
  10. Chan WC, Huang JZ. 2001. Gene expression analysis in aggressive NHL. Ann Hematol, 80(Suppl 3):B38–41.PubMedGoogle Scholar
  11. Chang JC, Wooten EC, Tsimelzon A et al. 2003. Gene expression profiling predicts therapeutic response to docetaxel in breast cancer patients. Lancet, 362:362–9.PubMedCrossRefGoogle Scholar
  12. Chopra A, Brown KM, Rood BR et al. 2003. The use of gene expression analysis to gain insights into signaling mechanisms of metastatic medulloblastoma. Pediatr Neurosurg, 39:68–74.PubMedCrossRefGoogle Scholar
  13. Chua MS, Sarwal MM. 2003. Microarrays: new tools for transplantation research. Pediatr Nephrol, 18:319–27.PubMedGoogle Scholar
  14. Cooper CR, Chay CH, Gendernalik JD et al. 2003. Stromal factors involved in prostate carcinoma metastasis to bone. Cancer, 97(Suppl 3):739–47.PubMedCrossRefGoogle Scholar
  15. Covitz PA, Hartel F, Schaefer C et al. 2003. caCORE: a common infrastructure for cancer informatics. Bioinformatics, 18:2404–12.CrossRefGoogle Scholar
  16. Cunningham MJ, Liang S, Fuhrman S et al. 2000. Gene expression microarray data analysis for toxicology profiling. Ann NY Acad Sci, 919:52–67.PubMedCrossRefGoogle Scholar
  17. Dan S, Tsunoda T, Kitahara O et al. 2002. An integrated database of chemosensitivity to 55 anticancer drugs and gene expression profiles of 39 human cancer cell lines. Cancer Res, 62:1139–47.PubMedGoogle Scholar
  18. Dangond F, Hwang D, Camelo S et al. 2004. Molecular signature of late-stage human ALS revealed by expression profiling of postmortem spinal cord gray matter. Physiol Genomics, 16:229–39.PubMedGoogle Scholar
  19. DePrimo SE, Wong LM, Khatry DB et al. 2003. Expression profiling of blood samples from an SU5416 phase III metastatic colorectal cancer clinical trial: a novel strategy for biomarker identification. BMC Cancer, 3:3.PubMedCrossRefGoogle Scholar
  20. DeRisi JL, Iyer VR, Brown PO. 1997. Exploring the metabolic and genetic control of gene expression on a genomic scale. Science, 278:680–6.PubMedCrossRefGoogle Scholar
  21. de Vos S, Hofmann WK, Grogan TM et al. 2003. Cell cycle alterations in the blastoid variant of mantle cell lymphoma (MCL-BV) as detected by gene expression profiling of mantle cell lymphoma (MCL) and MCL-BV. Diagn Mol Pathol, 12:35–43.PubMedCrossRefGoogle Scholar
  22. Dudoit S, Fridlyand J. 2002. A prediction-based resampling method to estimate the number of clusters in a dataset. Genome Biol, 3: 0036.1–0036.21.CrossRefGoogle Scholar
  23. Efron B. 1979. Bootstrap methods: another look at the jackknife. Ann Stat, 7:1–26.CrossRefGoogle Scholar
  24. Eyster KM, Lindahl R. 2001. Molecular medicine: a primer for clinicians. Part XII: DNA microarrays and their application to clinical medicine. SDJ Med, 54:57–61.Google Scholar
  25. Fink JL, Drewes S, Patel H et al. 2003. 2HAPI: a microarray data analysis system. Bioinformatics, 19:1443–5.PubMedCrossRefGoogle Scholar
  26. Furey TS, Cristianini N, Duffy N et al. 2000. Support vector machine classification and validation of cancer tissue samples using microarray expression data. Bioinformatics, 16:906–14.PubMedCrossRefGoogle Scholar
  27. Garber ME, Troyanskaya OG, Schluens K et al. 2001. Diversity of gene expression in adenocarcinoma of the lung. Proc Natl Acad Sci USA, 98:13784–9. [Erratum in: Proc Natl Acad Sci USA, 2002; 99:1098.]PubMedCrossRefGoogle Scholar
  28. Gildea JJ, Seraj MJ, Oxford G et al. 2002. RhoGDI2 is an invasion and metastasis suppressor gene in human cancer. Cancer Res, 62: 6418–23.PubMedGoogle Scholar
  29. Giles ED, Singh G. 2003. Role of insulin-like growth factor binding proteins (IGFBPs) in breast cancer proliferation and metastasis. Clin Exp Metastasis, 20:481–7.PubMedCrossRefGoogle Scholar
  30. Golub TR, Slonim DK, Tamayo P et al. 1999. Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. Science, 286:531–7.PubMedCrossRefGoogle Scholar
  31. Hawse JR, Hejtmancik JF, Huang Q et al. 2003. Identification and functional clustering of global gene expression differences between human age-related cataract and clear lenses. Mol Vis, 9:515–37.PubMedGoogle Scholar
  32. Hayden PS, El-Meanawy A, Schelling JR et al. 2003. DNA expression analysis: serial analysis of gene expression, microarrays and kidney disease. Curr Opin Nephrol Hypertens, 12:407–14.PubMedCrossRefGoogle Scholar
  33. Hedenfalk I, Duggan D, Chen Y et al. 2001. Gene-expression profiles in hereditary breast cancer. N Engl J Med, 344:539–48.PubMedCrossRefGoogle Scholar
  34. Inoue H, Matsuyama A, Mimori K et al. 2002. Prognostic score of gastric cancer determined by cDNA microarray. Clin Cancer Res, 8: 3475–9.PubMedGoogle Scholar
  35. Ji B, Chen XQ, Misek DE et al. 2003. Pancreatic gene expression during the initiation of acute pancreatitis: identification of EGR-1 as a key regulator. Physiol Genomics, 14:59–72.PubMedGoogle Scholar
  36. Kallioniemi OP. 2001. Biochip technologies in cancer research. Ann Med, 33:142–7.PubMedCrossRefGoogle Scholar
  37. Kaminski N, Allard J, Heller RA. 2000. Use of oligonucleotide arrays to analyze drug toxicity. Ann NY Acad Sci, 919:1–8.PubMedCrossRefGoogle Scholar
  38. Khatua S, Peterson KM, Brown KM et al. 2003. Overexpression of the EGFR/FKBP12/HIF-2alpha pathway identified in childhood astrocytomas by angiogenesis gene profiling. Cancer Res, 63: 1865–70.PubMedGoogle Scholar
  39. Kihara C, Tsunoda T, Tanaka T et al. 2001. Prediction of sensitivity of esophageal tumors to adjuvant chemotherapy by cDNA microarray analysis of gene-expression profiles. Cancer Res, 61:6474–9.PubMedGoogle Scholar
  40. Kikuchi T, Daigo Y, Katagiri T et al. 2003. Expression profiles of non-small cell lung cancers on cDNA microarrays: identification of genes for prediction of lymph-node metastasis and sensitivity to anti-cancer drugs. Oncogene, 22:2192–205.PubMedCrossRefGoogle Scholar
  41. Li S, Ross DT, Kadin ME et al. 2001. Comparative genome-scale analysis of gene expression profiles in T cell lymphoma cells during malignant progression using a complementary DNA microarray. Am J Pathol, 158:1231–7.PubMedCrossRefGoogle Scholar
  42. Listgarten J, Graham K, Damaraju S et al. 2003. Clinically validated benchmarking of normalisation techniques for two-colour oligonucleotide spotted microarray slides. Appl Bioinform, 2: 219–28.Google Scholar
  43. Lockhart DJ, Dong H, Byrne MC et al. 1996. Expression monitoring by hybridization to high-density oligonucleotide arrays. Nat Biotechnol, 13:1675–80.CrossRefGoogle Scholar
  44. Loring JF, Wen X, Lee JM et al. 2001. A gene expression profile of Alzheimer’s disease. DNA Cell Biol, 20:683–95.PubMedCrossRefGoogle Scholar
  45. Luo JH, Yu YP, Cieply K et al. 2002. Gene expression analysis of prostate cancers. Mol Carcinog, 33:25–35.PubMedCrossRefGoogle Scholar
  46. Lyons-Weiler J, Patel S, Bhattacharya S. 2003. A classification-based machine learning approach for the analysis of genome-wide expression data. Genome Res, 13:503–12.PubMedCrossRefGoogle Scholar
  47. Lyons-Weiler J, Patel S, Ngyuen T et al. 2004. UPITT Cancer Gene Expression Data Set Link Database, maintained by the University of Pittsburgh Benedum Oncology Informatics Center [online]. Accessed 12 Apr 2004. URL:
  48. MacDonald TJ, Brown KM, LaFleur B et al. 2001. Expression profiling of medulloblastoma: PDGFRA and the RAS/MAPK pathway as therapeutic targets for metastatic disease. Nat Genet, 29:143–52.PubMedCrossRefGoogle Scholar
  49. Mandel S, Weinreb O, Youdim MB. 2003. Using cDNA microarray to assess Parkinson’s disease models and the effects of neuroprotective drugs. Trends Pharmacol Sci, 24:184–91.PubMedCrossRefGoogle Scholar
  50. Minning TA, Bua J, Garcia GA et al. 2003. Microarray profiling of gene expression during trypomastigote to amastigote transition in Trypanosoma cruzi. Mol Biochem Parasitol, 131:55–64.PubMedCrossRefGoogle Scholar
  51. Miura K, Bowman ED, Simon R et al. 2002. Laser capture microdissection and microarray expression analysis of lung adenocarcinoma reveals tobacco smoking- and prognosis-related molecular profiles. Cancer Res, 62:3244–50.PubMedGoogle Scholar
  52. Miyachi H. 2002. Recent advances in molecular diagnostic tests. Rinsho Byori, 123(Suppl):19–29.PubMedGoogle Scholar
  53. Nagata M, Fujita H, Ida H et al. 2003. Identification of potential biomarkers of lymph node metastasis in oral squamous cell carcinoma by cDNA microarray analysis. Int J Cancer, 106:683–9.PubMedCrossRefGoogle Scholar
  54. Nocito A, Bubendorf L, Maria Tinner E et al. 2001. Microarrays of bladder cancer tissue are highly representative of proliferation index and histological grade. J Pathol, 194:349–57.PubMedCrossRefGoogle Scholar
  55. Ohno R, Nakamura Y. 2003. Prediction of response to imatinib by cDNA microarray analysis. Semin Hematol, 40(Suppl 3):42–9.PubMedCrossRefGoogle Scholar
  56. Okutsu J, Tsunoda T, Kaneta Y et al. 2002. Prediction of chemosensitivity for patients with acute myeloid leukemia, according to expression levels of 28 genes selected by genome-wide complementary DNA microarray analysis. Mol Cancer Ther, 1:1035–42.PubMedGoogle Scholar
  57. O’Reilly. 2002. com.oreilly.servlet [online]. Accessed Dec 2002. URL:
  58. Pasinetti GM. 2001. Use of cDNA microarray in the search for molecular markers involved in the onset of Alzheimer’s disease dementia. J Neurosci Res, 65:471–6.PubMedCrossRefGoogle Scholar
  59. Perou CM, Sorlie T, Eisen MB et al. 2000. Molecular portraits of human breast tumours. Nature, 406:747–52.PubMedCrossRefGoogle Scholar
  60. Ramaswamy S, Ross KN, Lander ES et al. 2003. A molecular signature of metastasis in primary solid tumors. Nat Genet, 33:49–54.PubMedCrossRefGoogle Scholar
  61. Rickman DS, Bobek MP, Misek DE et al. 2001. Distinctive molecular profiles of high-grade and low-grade gliomas based on oligonucleotide microarray analysis. Cancer Res, 61:6885–91.PubMedGoogle Scholar
  62. Sasaki H, Ide N, Fukai I et al. 2002. Gene expression analysis of human thymoma correlates with tumor stage. Int J Cancer, 101:342–7.PubMedCrossRefGoogle Scholar
  63. Schena M, Shalon D, Davis RW et al. 1995. Quantitative monitoring of gene expression patterns with a complementary DNA microarray. Science, 270:467–70.PubMedCrossRefGoogle Scholar
  64. Seftor EA, Meltzer PS, Kirschmann DA et al. 2002. Molecular determinants of human uveal melanoma invasion and metastasis. Clin Exp Metastasis, 19:233–46.PubMedCrossRefGoogle Scholar
  65. Shih JY, Yang SC, Hong TM et al. 2001. Collapsin response mediator protein-1 and the invasion and metastasis of cancer cells. J Natl Cancer Inst, 93:1392–400.PubMedCrossRefGoogle Scholar
  66. Shipp MA, Ross KN, Tamayo P et al. 2002. Diffuse large B-cell lymphoma outcome prediction by gene-expression profiling and supervised machine learning. Nat Med, 8:68–74.PubMedCrossRefGoogle Scholar
  67. Susztak K, Sharma K, Schiffer M et al. 2003. Genomic strategies for diabetic nephropathy. J Am Soc Nephrol, 14(Suppl 3):S271–8.PubMedCrossRefGoogle Scholar
  68. Takahashi M, Sugimura J, Yang X et al. 2003. Gene expression profiling of renal cell carcinoma and its implications in diagnosis, prognosis, and therapeutics. Adv Cancer Res, 89:157–81.PubMedCrossRefGoogle Scholar
  69. Tusher VG, Tibshirani R, Chu G. 2001. Significance analysis of microarrays applied to the ionizing radiation response. Proc Natl Acad Sci USA, 98:5116–21.PubMedCrossRefGoogle Scholar
  70. Ueno S, Ohki R, Hashimoto T et al. 2003. DNA microarray analysis of in vivo progression mechanism of heart failure. Biochem Biophys Res Commun, 307:771–7.PubMedCrossRefGoogle Scholar
  71. Urbanowska T, Mangialaio S, Hartmann C et al. 2003. Development of protein microarray technology to monitor biomarkers of rheumatoid arthritis disease. Cell Biol Toxicol, 19:189–202.PubMedCrossRefGoogle Scholar
  72. van’t Veer LJ, Dai H, van de Vijver MJ et al. 2002. Gene expression profiling predicts clinical outcome of breast cancer. Nature, 415: 530–6.CrossRefGoogle Scholar
  73. Vrana KE, Freeman WM, Aschner M. 2003. Use of microarray technologies in toxicology research. Neurotoxicology, 24:321–32.PubMedCrossRefGoogle Scholar
  74. Welford SM, Gregg J, Chen E et al. 1998. Detection of differentially expressed genes in primary tumor tissues using representational differences analysis coupled to microarray hybridization. Nucleic Acids Res, 26:3059–65.PubMedCrossRefGoogle Scholar
  75. Welsh JB, Zarrinker PP, Sapinoso LM et al. 2001. Analysis of gene expression profiles in normal and neoplastic ovarian tissue samples identifies candidate molecular markers of epithelial ovarian cancer. Proc Natl Acad Sci USA, 98:1176–81.PubMedCrossRefGoogle Scholar
  76. Zhang H, Ramanathan Y, Soteropoulos P et al. 2002. EZ-Retrieve: a webserver for batch retrieval of coordinate-specified human DNA sequences and underscoring putative transcription factor-binding sites. Nucleic Acids Res, 30:E121.PubMedCrossRefGoogle Scholar

Copyright information

© Open Mind Journals Limited 2004

Authors and Affiliations

  1. 1.University of Pittsburgh Cancer InstituteUniversity of Pittsburgh Medical SchoolPittsburghUSA
  2. 2.Department of PathologyUniversity of Pittsburgh Medical SchoolPittsburghUSA
  3. 3.Cancer Biomarkers LaboratoryUniversity of Pittsburgh Medical SchoolPittsburghUSA
  4. 4.Center for Pathology InformaticsUniversity of Pittsburgh Medical SchoolPittsburghUSA
  5. 5.Benedum Center for Oncology InformaticsUniversity of Pittsburgh Medical SchoolPittsburghUSA
  6. 6.Center for Biomedical InformaticsUniversity of Pittsburgh Medical SchoolPittsburghUSA
  7. 7.Interdisciplinary Biomedical Graduate ProgramUniversity of Pittsburgh Medical SchoolPittsburghUSA

Personalised recommendations