Omic Data, Information Derivable and Computational Needs

  • Ying Xu
  • Juan Cui
  • David Puett


Cancer is probably the most complex class of human diseases. Its complexity lies in: (1) its rapidly evolving population of cells that drift away from their normal functional states at the molecular, epigenomic and genomic levels, (2) its growth and expansion to encroach and replace normal tissue cells; and (3) its abilities to resist both endogenous and exogenous measures for stopping or slowing down its growth. According to Hanahan and Weinberg, cancer cells, regardless of the type, tend to have eight hallmark characteristics (Hanahan and Weinberg 2011). As introduced in Chap. 1, these hallmarks are: (1) reprogrammed energy metabolism, (2) sustained cell-growth signaling, (3) evading growth suppressors, (4) resisting cell death, (5) enabling replicative immortality, (6) inducing angiogenesis, (7) avoiding immune destruction, and (8) activating cell invasion and metastasis. Other authors have suggested some additional hallmarks of cancer such as tumor-promoting inflammation (Colotta et al. 2009) and deregulated extracellular matrix dynamics (Lu et al. 2012). These recognized hallmarks have provided an effective framework for addressing cancer-related questions, having led to a deeper understanding of this disease. However, the reality is that our overall ability in curing cancer has not yet made substantive improvements, particularly in adult cancers that account for 99 % of all cancers since the start of the “War on Cancer” in 1971 (The-National-Cancer-Act 1971).


Transcriptomic Data Cancer Genome Omic Data Metabolomic Data Pathway Enrichment Analysis 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


  1. Bairoch A (2000) The ENZYME database in 2000. Nucleic acids research 28: 304–305CrossRefGoogle Scholar
  2. Barkow S, Bleuler S, Prelic A et al. (2006) BicAT: a biclustering analysis toolbox. Bioinformatics 22: 1282–1283CrossRefGoogle Scholar
  3. Bentley J, Diggle CP, Harnden P et al. (2004) DNA double strand break repair in human bladder cancer is error prone and involves microhomology-associated end-joining. Nucleic acids research 32: 5249–5259CrossRefGoogle Scholar
  4. Bindra RS, Crosby ME, Glazer PM (2007) Regulation of DNA repair in hypoxic cancer cells. Cancer Metastasis Rev 26: 249–260CrossRefGoogle Scholar
  5. Bjorkhem I, Meaney S (2004) Brain cholesterol: long secret life behind a barrier. Arterioscler Thromb Vasc Biol 24: 806–815CrossRefGoogle Scholar
  6. Boshier PR, Marczin N, Hanna GB (2010) Repeatability of the measurement of exhaled volatile metabolites using selected ion flow tube mass spectrometry. J Am Soc Mass Spectrom 21: 1070–1074CrossRefGoogle Scholar
  7. Cancer-Genome-Atlas-Research (2008) Comprehensive genomic characterization defines human glioblastoma genes and core pathways. Nature 455: 1061–1068CrossRefGoogle Scholar
  8. Cancer-Genome-Atlas-Research (2011) Integrated genomic analyses of ovarian carcinoma. Nature 474: 609–615CrossRefGoogle Scholar
  9. Cancer-Genome-Atlas-Research (2012a) Comprehensive genomic characterization of squamous cell lung cancers. Nature 489: 519–525CrossRefGoogle Scholar
  10. Cancer-Genome-Atlas-Research (2012b) Comprehensive molecular characterization of human colon and rectal cancer. Nature 487: 330–337CrossRefGoogle Scholar
  11. Cancer-Genome-Atlas-Research (2012c) Comprehensive molecular portraits of human breast tumours. Nature 490: 61–70CrossRefGoogle Scholar
  12. Cancer-Genome-Atlas-Research (2013a) Comprehensive molecular characterization of clear cell renal cell carcinoma. Nature 499: 43–49CrossRefGoogle Scholar
  13. Cancer-Genome-Atlas-Research (2013b) Genomic and epigenomic landscapes of adult de novo acute myeloid leukemia. The New England journal of medicine 368: 2059–2074CrossRefGoogle Scholar
  14. Cancer-Genome-Atlas-Research, Weinstein JN, Collisson EA et al. (2013) The Cancer Genome Atlas Pan-Cancer analysis project. Nature genetics 45: 1113–1120CrossRefGoogle Scholar
  15. Cavalli-Sforza LL (2005) The Human Genome Diversity Project: past, present and future. Nat Rev Genet 6: 333–340Google Scholar
  16. Chen JS, Hung WS, Chan HH et al. (2013) In silico identification of oncogenic potential of fyn-related kinase in hepatocellular carcinoma. Bioinformatics 29: 420–427CrossRefGoogle Scholar
  17. Cheng Z, Ventura M, She X et al. (2005) A genome-wide comparison of recent chimpanzee and human segmental duplications. Nature 437: 88–93CrossRefGoogle Scholar
  18. Church GM (2005) The personal genome project. Mol Syst Biol 1: 2005 0030Google Scholar
  19. Colotta F, Allavena P, Sica A et al. (2009) Cancer-related inflammation, the seventh hallmark of cancer: links to genetic instability. Carcinogenesis 30: 1073–1081CrossRefGoogle Scholar
  20. Cotton RG, Auerbach AD, Axton M et al. (2008) GENETICS. The Human Variome Project. Science 322: 861–862CrossRefGoogle Scholar
  21. Croft D, O’Kelly G, Wu G et al. (2011) Reactome: a database of reactions, pathways and biological processes. Nucleic acids research 39: D691–697CrossRefGoogle Scholar
  22. Cui J, Chen Y, Chou WC et al. (2011) An integrated transcriptomic and computational analysis for biomarker identification in gastric cancer. Nucleic acids research 39: 1197–1207CrossRefGoogle Scholar
  23. D’Souza I, Poorkaj P, Hong M et al. (1999) Missense and silent tau gene mutations cause frontotemporal dementia with parkinsonism-chromosome 17 type, by affecting multiple alternative RNA splicing regulatory elements. Proceedings of the National Academy of Sciences of the United States of America 96: 5598–5603CrossRefGoogle Scholar
  24. Downing JR, Wilson RK, Zhang J et al. (2012) The Pediatric Cancer Genome Project. Nature genetics 44: 619–622CrossRefGoogle Scholar
  25. Edgar R, Domrachev M, Lash AE (2002) Gene Expression Omnibus: NCBI gene expression and hybridization array data repository. Nucleic acids research 30: 207–210CrossRefGoogle Scholar
  26. Emmert-Buck MR, Bonner RF, Smith PD et al. (1996) Laser capture microdissection. Science 274: 998–1001CrossRefGoogle Scholar
  27. Essaghir A, Toffalini F, Knoops L et al. (2010) Transcription factor regulation can be accurately predicted from the presence of target gene signatures in microarray gene expression data. Nucleic acids research 38: e120CrossRefGoogle Scholar
  28. Forbes SA, Bindal N, Bamford S et al. (2011) COSMIC: mining complete cancer genomes in the Catalogue of Somatic Mutations in Cancer. Nucleic acids research 39: D945–D950CrossRefGoogle Scholar
  29. Grabherr MG, Haas BJ, Yassour M et al. (2011) Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat Biotechnol 29: 644–652CrossRefGoogle Scholar
  30. Green RE, Krause J, Briggs AW et al. (2010) A draft sequence of the Neanderthal genome. Science 328: 710–722CrossRefGoogle Scholar
  31. Greenman C, Stephens P, Smith R et al. (2007) Patterns of somatic mutation in human cancer genomes. Nature 446: 153–158CrossRefGoogle Scholar
  32. Hanahan D, Weinberg RA (2011) Hallmarks of cancer: the next generation. Cell 144: 646–674CrossRefGoogle Scholar
  33. Huang D, Sherman BT, Tan Q et al. (2007) The DAVID Gene Functional Classification Tool: a novel biological module-centric algorithm to functionally analyze large gene lists. Genome Biol 8: R183CrossRefGoogle Scholar
  34. Huang DW, Sherman BT, Lempicki RA (2009) Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nature Protocols 4: 44–57CrossRefGoogle Scholar
  35. Huebert DJ, Kamal M, O’Donovan A et al. (2006) Genome-wide analysis of histone modifications by ChIP-on-chip. Methods 40: 365–369CrossRefGoogle Scholar
  36. International-HapMap (2003) The International HapMap Project. Nature 426: 789–796CrossRefGoogle Scholar
  37. Iyer VR, Horak CE, Scafe CS et al. (2001) Genomic binding sites of the yeast cell-cycle transcription factors SBF and MBF. Nature 409: 533–538CrossRefGoogle Scholar
  38. Kamakaka RT, Biggins S (2005) Histone variants: deviants? Genes & development 19: 295–310CrossRefGoogle Scholar
  39. Kandoth C, Schultz N, Cherniack AD et al. (2013) Integrated genomic characterization of endometrial carcinoma. Nature 497: 67–73CrossRefGoogle Scholar
  40. Kanehisa M, Goto S, Furumichi M et al. (2010) KEGG for representation and analysis of molecular networks involving diseases and drugs. Nucleic acids research 38: D355–360CrossRefGoogle Scholar
  41. Kanehisa M, Goto S, Sato Y et al. (2012) KEGG for integration and interpretation of large-scale molecular data sets. Nucleic acids research 40: D109–114CrossRefGoogle Scholar
  42. Kanehisa M, Goto S, Sato Y et al. (2014) Data, information, knowledge and principle: back to metabolism in KEGG. Nucleic acids research 42: D199–205CrossRefGoogle Scholar
  43. Lander ES, Linton LM, Birren B et al. (2001) Initial sequencing and analysis of the human genome. Nature 409: 860–921CrossRefGoogle Scholar
  44. Li G, Ma Q, Tang H et al. (2009) QUBIC: a qualitative biclustering algorithm for analyses of gene expression data. Nucleic acids research 37: e101CrossRefGoogle Scholar
  45. Liu R, Hannenhalli S, Bucan M (2009) Motifs and cis-regulatory modules mediating the expression of genes co-expressed in presynaptic neurons. Genome Biol 10: R72CrossRefGoogle Scholar
  46. Lu P, Weaver VM, Werb Z (2012) The extracellular matrix: a dynamic niche in cancer progression. The Journal of cell biology 196: 395–406CrossRefGoogle Scholar
  47. McKeehan W (1982) Glycolysis, glutaminolysis and cell proliferation. Cell Biology International Reports 6: 635–650CrossRefGoogle Scholar
  48. Nazarov PV, Muller A, Khutko V et al. Co-Expression Analysis of Large Microarray Data Sets using Coexpress Software Tool. In: Seventh International Workshop on Computational Systems Biology, WCSB 2010, 2010.Google Scholar
  49. Nishimura D (2001) BioCarta. Biotech Software & Internet Report 2:Google Scholar
  50. Nowell P, Hungerford D (1960) A minute chromosome in human chronic granulocytic leukemia. Science 132:Google Scholar
  51. Pearl J (2009) Causal inference in statistics: An overview. Statistics Surveys 3: 96–146CrossRefMATHMathSciNetGoogle Scholar
  52. Qian J, Lin J, Luscombe NM et al. (2003) Prediction of regulatory networks: genome-wide identification of transcription factor targets from gene expression data. Bioinformatics 19: 1917–1926CrossRefGoogle Scholar
  53. Ren B, Robert F, Wyrick JJ et al. (2000) Genome-wide location and function of DNA binding proteins. Science 290: 2306–2309CrossRefGoogle Scholar
  54. Scanlon S, Glazer P (2013) Genetic Instability Induced by Hypoxic Stress. In: Mittelman D (ed) Stress-Induced Mutagenesis. Springer New York, pp 151–181CrossRefGoogle Scholar
  55. Scheer M, Grote A, Chang A et al. (2011) BRENDA, the enzyme information system in 2011. Nucleic acids research 39: D670–676CrossRefGoogle Scholar
  56. Schellenberger J, Park JO, Conrad TM et al. (2010) BiGG: a Biochemical Genetic and Genomic knowledgebase of large scale metabolic reconstructions. BMC Bioinformatics 11: 213CrossRefGoogle Scholar
  57. Segal E, Yelensky R, Kaushal A et al. (2004) GeneXPress: A Visualization and Statistical Analysis Tool for Gene Expression and Sequence Data. Paper presented at the 11th International Conference on Intelligent Systems for Molecular Biology (ISMB),Google Scholar
  58. Service RF (2006) Gene sequencing. The race for the $1000 genome. Science 311: 1544–1546CrossRefGoogle Scholar
  59. Steel M, Penny D (2000) Parsimony, likelihood, and the role of models in molecular phylogenetics. Molecular biology and evolution 17: 839–850CrossRefGoogle Scholar
  60. Strahl BD, Allis CD (2000) The language of covalent histone modifications. Nature 403: 41–45CrossRefGoogle Scholar
  61. The-National-Cancer-Act (1971) The National Cancer Act of 1971.Google Scholar
  62. The-Tumor-Metabolome (2011) The tumor metabolome.Google Scholar
  63. Trapnell C, Williams BA, Pertea G et al. (2010) Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. Nat Biotechnol 28: 511–515CrossRefGoogle Scholar
  64. Truong LN, Li Y, Shi LZ et al. (2013) Microhomology-mediated End Joining and Homologous Recombination share the initial end resection step to repair DNA double-strand breaks in mammalian cells. Proceedings of the National Academy of Sciences of the United States of America 110: 7720–7725CrossRefGoogle Scholar
  65. Valen LV (1973) A new evolutionary law. Evolutionary Theory 1: 1–30Google Scholar
  66. Varma A, Palsson BO (1994) Metabolic Flux Balancing: Basic Concepts, Scientific and Practical Use. Nature Biotechnology 12: 994–998CrossRefGoogle Scholar
  67. Venter JC, Adams MD, Myers EW et al. (2001) The sequence of the human genome. Science 291: 1304–1351CrossRefGoogle Scholar
  68. Vignali M, Hassan AH, Neely KE et al. (2000) ATP-dependent chromatin-remodeling complexes. Molecular and cellular biology 20: 1899–1910CrossRefGoogle Scholar
  69. Wang Z, Gerstein M, Snyder M (2009) RNA-Seq: a revolutionary tool for transcriptomics. Nat Rev Genet 10: 57–63CrossRefGoogle Scholar
  70. Wishart DS, Jewison T, Guo AC et al. (2013) HMDB 3.0–The Human Metabolome Database in 2013. Nucleic acids research 41: D801–807CrossRefGoogle Scholar
  71. Wishart DS, Knox C, Guo AC et al. (2009) HMDB: a knowledgebase for the human metabolome. Nucleic acids research 37: D603-610CrossRefGoogle Scholar
  72. Wishart DS, Tzur D, Knox C et al. (2007) HMDB: the Human Metabolome Database. Nucleic acids research 35: D521–526CrossRefGoogle Scholar
  73. Xu K, Mao X, Mehta M et al. (2012) A comparative study of gene-expression data of basal cell carcinoma and melanoma reveals new insights about the two cancers. PLoS One 7: e30750CrossRefGoogle Scholar
  74. Yang AS, Estecio MR, Doshi K et al. (2004) A simple method for estimating global DNA methylation using bisulfite PCR of repetitive DNA elements. Nucleic acids research 32: e38CrossRefGoogle Scholar
  75. Zhao M, Sun JC, Zhao ZM (2013) TSGene: a web resource for tumor suppressor genes. Nucleic acids research 41: D970–D976CrossRefMathSciNetGoogle Scholar

Copyright information

© Springer Science+Business Media New York 2014

Authors and Affiliations

  • Ying Xu
    • 1
  • Juan Cui
    • 2
  • David Puett
    • 1
  1. 1.Department of Biochemistry and Molecular BiologyUniversity of GeorgiaAthensUSA
  2. 2.Department of Computer Science and EngineeringUniversity of NebraskaLincolnUSA

Personalised recommendations