Abstract
The identification of coexpressed genes is a challenging problem in microarray data analysis due to a very high number of genes and low number of samples normally available. This paper presents a shape-output clustering method which is engaged in the analysis of a real-world time series microarray data from the industrial microbiology area. The proposed approach uses the changes in gene expression levels to group genes based on their shape measured over time in several samples. Furthermore, these coexpression patterns are correlated with the measured outputs of production and growth available for each sample. Experiments are performed for time series microarray of a bacteria and an analysis from a biological perspective is carried out. The obtained results confirm the existence of relationships between output variables and gene expressions.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
The microarray dataset obtained and used in this paper is available at request for academic purposes.
References
Chira C, Sedano J, Villar JR, Prieto C, Corchado E (2013) Gene clustering in time series microarray analysis. In: Proceedings of International joint conference SOCO’13-CISIS’13-ICEUTE’13 - Salamanca, Spain, pp 289–298, 11-13 Sept 2013
Coffey N, Hinde J (2011) Analyzing time-course microarray data using functional data analysis - a review. Stat Appl Genet Mol Biol 10: Article 23
Dharmadi Y, Gonzalez R (2004) DNA microarrays: experimental issues, data analysis, and application to bacterial systems. Biotechnol Prog 20(5):1309–1324
Ernst J, Bar-Joseph Z (2006) Stem: a tool for the analysis of short time series gene expression data. BMC Bioinformatics 7(1):191
Kang A, Chang M (2012) Identification and reconstitution of genetic regulatory networks for improved microbial tolerance to isooctane. Mol BioSyst 8:1350–1358
Larrañaga P, Calvo B, Santana R, Bielza C, Galdiano J, Inza I, Lozano JA, Armañanzas R, Santafé G, Pérez A, Robles V (2006) Machine learning in bioinformatics. Briefings Bioinf 7(1):86–112
Lee C-P, Leu Y (2011) A novel hybrid feature selection method for microarray data analysis. Appl Soft Comput 11:208–213
Liu T, Lin N, Shi N, Zhang B (2009) Information criterion-based clustering with order-restricted candidate profiles in short time-course microarray experiments. BMC Bioinformatics 10(1):146
Nieselt K, Battke F, Herbig A, Bruheim P, Wentzel A, Jakobsen O, Sletta H, Alam M, Merlo M, Moore J, Omara W, Morrissey E, Juarez-Hermosillo M, Rodriguez-Garcia A, Nentwich M, Thomas L, Iqbal M, Legaie R, Gaze W, Challis G, Jansen R, Dijkhuizen L, Rand D, Wild D, Bonin M, Reuther J, Wohlleben W, Smith M, Burroughs N, Martin J (2010) The dynamic architecture of the metabolic switch in streptomyces coelicolor. BMC Genomics 11(1):10
Pandey G, Yoshikawa K, Hirasawa T, Nagahisa K, Katakura Y, Furusawa C, Shimizu H, Shioya S (2007) Extracting the hidden features in saline osmotic tolerance in saccharomyces cerevisiae from dna microarray data using the self-organizing map: biosynthesis of amino acids. Appl Microbiol Biotechnol 75:415–426
Peddada SD, Lobenhofer EK, Li L, Afshari CA, Weinberg CR, Umbach DM (2003) Gene selection and clustering for time-course and doseresponse microarray experiments using order-restricted inference. Bioinformatics 19(7):834–841
Phan S, Famili F, Tang Z, Pan Y, Liu Z, Ouyang J, Lenferink A, O’connor MM-C (2007) A novel pattern based clustering methodology for time-series microarray data. Int J Comput Math 84:585–597
Pickens L, Tang Y, Chooi Y-H (2011) Metabolic engineering for the production of natural products. Annu Rev Chem Biomol Eng 2(1):211–236
Saeys Y, Inza I, Larrañaga P (2007) A review of feature selection techniques in bioinformatics. Bioinformatics 23(19):2507–2517
Smyth G, Speed T (2003) Normalization of cdna microarray data. Methods 31(4):265–273
Storey JD, Xiao W, Leek JT, Tompkins RG, Davis RW (2005) Significance analysis of time course microarray experiments. Proc Nat Acad Sci U S A 102(36):12837–12842
Tummala S, Junne S, Paredes C, Papoutsakis E (2003) Transcriptional analysis of product-concentration driven changes in cellular programs of recombinant clostridium acetobutylicumstrains. Biotechnol Bioeng 84(7):842–854
Acknowledgments
This research has been supported through Junta de Castilla y Len projects BIO/BU09/14, CCTT/10/BU/0002 and Fundacin Universidad de Oviedo project FUO-EM-340-13.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Chira, C., Sedano, J., Villar, J.R., Camara, M., Prieto, C. (2015). Shape-Output Gene Clustering for Time Series Microarrays. In: Herrero, Á., Sedano, J., Baruque, B., Quintián, H., Corchado, E. (eds) 10th International Conference on Soft Computing Models in Industrial and Environmental Applications. Advances in Intelligent Systems and Computing, vol 368. Springer, Cham. https://doi.org/10.1007/978-3-319-19719-7_21
Download citation
DOI: https://doi.org/10.1007/978-3-319-19719-7_21
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-19718-0
Online ISBN: 978-3-319-19719-7
eBook Packages: EngineeringEngineering (R0)