Abstract
We provide an overview on the state of the art for the Omics technologies, the types of omics data and the bioinformatics resources relevant and related to Omics. We also illustrate the bioinformatics challenges of dealing with high-throughput data. This overview touches several fundamental aspects of Omics and bioinformatics: data standardisation, data sharing, storing Omics data appropriately and exploring Omics data in bioinformatics. Though the principles and concepts presented are true for the various different technological fields, we concentrate in three main Omics fields namely: genomics, transcriptomics and proteomics. Finally we address the integration of Omics data, and provide several useful links for bioinformatics and Omics.
Key words
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Knasmüller, S. et al. (2008) Use of conventional and -omics based methods for health claims of dietary antioxidants: A critical overview. Br J Nutr 99, ES3–52.
Hillieret, L.W. et al. (2008) Whole-genome sequencing and variant discovery in C. elegans. Nat Methods 5, 183–88.
Johnson, D.S. et al. (2007) Genome-wide mapping of in vivo protein–DNA interactions. Science 316, 1441–42.
Mortazavi, A. et al. (2008) Mapping and quantifying mammalian transcriptomes by RNA-Seq. Nat Methods 5, 621–8.
Rustici, G. et al. (2008) Data storage and analysis in ArrayExpress and Expression Profiler. Curr Protoc Bioinformatics 7, 7–13.
Whetzel, P.L. et al. (2006) The MGED Ontology: A resource for semantics-based description of microarray experiments. Bioinformatics 22, 866–73.
Burge, C., Birney, E., and Fickett, J. (2002) Top 10 future challenges for bioinformatics. Genome Technol 17, 1–3.
Havlak, P. et al. (2004) The Atlas genome assembly system. Genome Res 14, 721–32.
Batzoglou, S. et al. (2002) ARACHNE: A whole genome shotgun assembler. Genome Res 12, 177–89.
Myers, E.W. et al. (2000) A whole-genome assembly of Drosophila. Science 287, 2196–204.
Huang, X. et al. (2003) PCAP: A whole-genome assembly program. Genome Res 13, 2164–70.
Mullikin, J.C., and Ning, Z. (2003) The Phusion assembler. Genome Res 13, 81–90.
Pevzner, P.A., Tang, H., and Waterman, M.S. (2001) An Eulerian path approach to DNA fragment assembly. Proc Natl Acad Sci USA 14, 9748–53.
Hernandez, D. et al. (2008) De novo bacterial genome sequencing: Millions of very short reads assembled on a desktop computer. Genome Res 18, 802–9.
Idury, R., and Waterman, M. (1995) A new algorithm for DNA sequence assembly. J Comput Biol 2, 291–306.
Pevzner, P., and Tang, H. (2001) Fragment assembly with double-barrelled data. Bioinformatics 17, S225–33.
Chaisson, M.J., and Pevzner, P.A. (2008) Short read fragment assembly of bacterial genomes. Genome Res 18, 324–30.
Zerbino, D.R., and Birney, E. (2008) Velvet: Algorithms for de novo short read assembly using de Bruijn graphs. Genome Res 18, 821–9.
Ossowski, S. et al. (2008) Sequencing of natural strains of Arabidopsis thaliana with short reads. Genome Res 18, 2024–33.
Farrer, R.A. et al. (2009) De novo assembly of the Pseudomonas syringae pv. syringae B728a genome using Illumina/Solexa short sequence reads. FEMS Microbiol Lett 1, 103–11.
Wakaguri, H. et al. (2008) DBTSS: Database of transcription start sites, progress report. Nucleic Acids Res 36, D97–101.
Chen, X. et al. (2009) High throughput genome-wide survey of small RNAs from the parasitic protists Giardia intestinalis and Trichomonas vaginalis. Genome Biol Evol 1, 165–75.
Butler, J. et al. (2008) ALLPATHS: De novo assembly of whole-genome shotgun microreads. Genome Res 18, 810–20.
Chen, J., and Skiena, S. (2007) Assembly for double-ended short-read sequencing technologies. In ‘Advances in Genome Sequencing Technology and Algorithms’, edited by E. Mardis, S. Kim, and H. Tang. Artech House Publishers, Boston.
Simpson, J.T. et al. (2009) ABySS: A parallel assembler for short read sequence data. Genome Res 9, 1117–23.
Jackson, B.G., Schnable, P.S., and Aluru, S. (2009) Parallel short sequence assembly of transcriptomes. BMC Bioinformatics 10, S1–14.
Spudich, G., Fernandez-Suarez, X.M., and Birney, E. (2007) Genome browsing with Ensembl: A practical overview. Brief Funct Genomic Proteomic 6, 202–19.
Vizcaíno, J.A. et al. (2009) A guide to the Proteomics Identifications Database proteomics data repository. Proteomics 9, 4276–83.
Hunter, S. et al. (2009) InterPro: The integrative protein signature database. Nucleic Acids Res 37, 211–15.
Cesareni, G. et al. (2005) Comparative interatcomics. FEBS Lett 579, 1828–33.
Brazma, A. et al. (2001) Minimum information about a microarray experiment (MIAME)-toward standards for microarray data. Nat Genet 29, 365–71.
Levy, S. et al. (2007) The diploid genome sequence of an individual human. PLoS Biol 5, 2113–44.
Wheeler, D.A. et al. (2008) The complete genome of an individual by massively parallel DNA sequencing. Nature 452, 872–76.
Venter, J.C. et al. (2001) The sequence of the human genome. Science 291, 1304–51.
Spencer, C.C. et al. (2006) The influence of recombination on human genetic diversity. PLoS Genet 2, e148.
Brazma, A. et al. (2003) ArrayExpress – A public repository for microarray gene expression data at the EBI. Nucleic Acids Res 31, 68–71.
Edgar, R., Domrachev, M., and Lash, A.E. (2002) Gene Expression Omnibus: NCBI gene expression and hybridization array data repository. Nucleic Acids Res 30, 207–10.
Ikeo, K. et al. (2003) CIBEX: Center for information biology gene expression database. C R Biol 326, 1079–82.
Parkinson, H. et al. (2009) ArrayExpress update – From an archive of functional genomics experiments to the atlas of gene expression. Nucleic Acids Res 37, 868–72.
Aranda, B. et al. (2009) The IntAct molecular interaction database. Nucleic Acid Res. 1–7 doi:10.1093/nar/gkp878.
Orchard, O. et al. (2007) The minimum information required for reporting a molecular interaction experiment (MIMIx). Nat Biotechnol 25, 894–8.
Kiemer, L., and Cesareni, G. (2007) Comparative interactomics: Comparing apples and pears? Trends Biotechnol 25, 448–54.
Kiemer, L. et al. (2007) WI-PHI: A weighted yeast interactome enriched for direct physical interactions. Proteomics 7, 932–43.
Joyce, A.R., and Palsson, B.Ø. (2006) The model organism as a system: Integrating ‘omics’ data sets. Nat Rev Mol Cell Biol 7, 198–210.
Akula, S.P. et al. (2009) Techniques for integrating -omics Data. Bioinformation 3, 284–6.
Haider, S. et al. (2009) BioMart Central Portal – Unified access to biological data. Nucleic Acids Res 1, W23–27.
Li, P. et al. (2008) Performing statistical analyses on quantitative data in Taverna workflows: An example using R and maxdBrowse to identify differentially-expressed genes from microarray data. BMC Bioinformatics 9, 334.
Acknowledgements
The authors would like to thank Dr. Gabriella Rustici and Dr. Daniel Zerbino for useful insights and information on transcriptomics and genome assembly respectively. The authors would also like to thank Dr. James Watson for useful comments to the manuscript.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer Science+Business Media, LLC
About this protocol
Cite this protocol
Schneider, M.V., Orchard, S. (2011). Omics Technologies, Data and Bioinformatics Principles. In: Mayer, B. (eds) Bioinformatics for Omics Data. Methods in Molecular Biology, vol 719. Humana Press. https://doi.org/10.1007/978-1-61779-027-0_1
Download citation
DOI: https://doi.org/10.1007/978-1-61779-027-0_1
Published:
Publisher Name: Humana Press
Print ISBN: 978-1-61779-026-3
Online ISBN: 978-1-61779-027-0
eBook Packages: Springer Protocols