Abstract
The omics term refers to different biology disciplines such as, for instance, genomics, proteomics, or interactomics. The suffix -ome is used to indicate the objects of study of such disciplines, such as the genome, proteome, or interactome, and usually refers to a totality of some sort. This paper introduces omics data and the main computational techniques for their storage, preprocessing and analysis. The increasing availability of omics data due to the advent of high throughput technologies poses novel issues on data management and analysis that can be faced by parallel and distributed storage systems and algorithms. After a survey of main omics databases, preprocessing techniques and analysis approaches, the paper describes some recent bioinformatics tools in genomics, proteomics and interactomics that use a distributed approach.
Keywords
Download to read the full chapter text
Chapter PDF
References
Guzzi, P.H., Cannataro, M.: Challenges in microarray data management and analysis. In: Proceedings of the 24th IEEE International Symposium on Computer-Based Medical Systems, Bristol, United Kingdom, June 27-30 (2011)
Cannataro, M., Guzzi, P.H., Veltri, P.: Using ontologies for querying and analysing protein-protein interaction data. Procedia CS 1(1), 997–1004 (2010)
Barrell, D., Dimmer, E., Huntley, R.P., Binns, D., O’Donovan, C., Apweiler, R.: The GOA database in 2009–an integrated Gene Ontology Annotation resource. Nucleic Acids Research 37, D396–D403 (2009)
Benson, D.A., Karsch-Mizrachi, I., Lipman, D.J., Ostell, J., Wheeler, D.L.: GenBank. Nucleic Acids Research 36(Database issue) (2008)
Boeckmann, B., Bairoch, A., Apweiler, R., Blatter, M.-C.C., Estreicher, A., Gasteiger, E., Martin, M.J., Michoud, K., O’Donovan, C., Phan, I., Pilbout, S., Schneider, M.: The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003. Nucleic Acids Research 31(1), 365–370 (2003)
Cannataro, M., Guzzi, P.H., Veltri, P.: Protein-to-protein interactions: Technologies, databases, and algorithms. ACM Comput. Surv. 43 (2010)
Cannataro, M., Guzzi, P.H., Mazza, T., Tradigo, G., Veltri, P.: Using ontologies for preprocessing and mining spectra data on the grid. Future Generation Comp. Syst. 23(1), 55–60 (2007)
Cannataro, M., Guzzi, P.H., Veltri, P.: Impreco: Distributed prediction of protein complexes. Future Generation Comp. Syst. 26(3), 434–440 (2010)
Cerami, E., Bader, G., Gross, B.E., Sander, C.: Cpath: open source software for collecting, storing, and querying biological pathways. BMC Bioinformatics 7(497), 1–9 (2006)
Chaurasia, G., Iqbal, Y., Hanig, C., Herzel, H., Wanker, E.E., Futschik, M.E.: UniHI: an entry gate to the human protein interactome. Nucl. Acids Res. 35(suppl. 1), D590–D594 (2007)
The UniProt Consortium: The universal protein resource (UniProt) in 2010. Nucleic Acids Research 38(suppl. 1), D142–D148 (2010)
Craig, R., Cortens, J.P., Beavis, R.C.: Open source system for analyzing, validating, and storing protein identification data. Journal of Proteome Research 3(6), 1234–1242 (2004)
Desiere, F., Deutsch, E.W., King, N.L., Nesvizhskii, A.I., Mallick, P., Eng, J., Chen, S., Eddes, J., Loevenich, S.N., Aebersold, R.: The peptideatlas project. Nucleic Acids Research 34(suppl. 1), D655–D658
Guzzi, P.H., Cannataro, M.: mu-cs: An extension of the tm4 platform to manage affymetrix binary data. BMC Bioinformatics 11, 315 (2010)
Schmidberger, M., Vicedo, E., Mansmann, U.: Affypara: a bioconductor package for parallelized preprocessing algorithms of affymetrix microarray data
Taylor, C.F., Hermjakob, H., Julian, R.K., Garavelli, J.S., Aebersold, R., Apweiler, R.: The work of the human proteome organisation’s proteomics standards initiative (HUPO PSI). OMICS 10(2), 145–151 (2006)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Cannataro, M., Guzzi, P.H. (2012). Distributed Management and Analysis of Omics Data. In: Alexander, M., et al. Euro-Par 2011: Parallel Processing Workshops. Euro-Par 2011. Lecture Notes in Computer Science, vol 7156. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-29740-3_6
Download citation
DOI: https://doi.org/10.1007/978-3-642-29740-3_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-29739-7
Online ISBN: 978-3-642-29740-3
eBook Packages: Computer ScienceComputer Science (R0)