Abstract
Gene expressions have been commonly applied to understand the inherent underlying mechanism of known biological processes. Although the microarray gene expressions usually appear aperiodic, with proper signal processing techniques, its periodic components can be easily obtained. Thus, if expressions of interconnected (regulatory and regulated) genes are decomposed, at least one common frequency component will appear in these genes. Exploiting this novel concept, we propose a frequency decomposition approach for gene clustering to better understand the gene interconnection topology. This method, based on Hilbert Huang Transform (HHT) enables us to segregate every periodic component of the gene expressions. Next, a multilevel clustering is performed based on these frequency components. Unlike existing clustering algorithms, the proposed method assimilates a meaningful knowledge of the gene interactions topology. The information related to underlying gene interactions is vital and can prove useful in many existing evolutionary optimisation algorithms for genetic network reconstruction. We validate the entire approach by its application to a 15-gene synthetic network.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Smith, A.A.: Classification and alignment of gene-expression time-series data. Ph.D. thesis, UNIVERSITY OF WISCONSIN-MADISON (2009)
Aach, J., Church, G.M.: Aligning gene expression time series with time warping algorithms. Bioinformatics 17(6), 495–508 (2001)
Bar-Joseph, Z., Gerber, G.K., Gifford, D.K., Jaakkola, T.S., Simon, I.: Continuous representations of time-series gene expression data. J. Comput. Biol. 10(3–4), 341–356 (2003)
Ernst, J., Nau, G.J., Bar-Joseph, Z.: Clustering short time series gene expression data. Bioinformatics 21(suppl 1), i159–i168 (2005)
Déjean, S., Martin, P.G., Baccini, A., Besse, P.: Clustering time-series gene expression data using smoothing spline derivatives. EURASIP J. Bioinform. Syst. Biol. 2007(1), 70561 (2007)
Ooi, C.H., Chetty, M., Teng, S.W.: Differential prioritization in feature selection and classifier aggregation for multiclass microarray datasets. Data Min. Knowl. Disc. 14(3), 329–366 (2007)
Feng, J., Barbano, P.E., Mishra, B.: Time-frequency feature detection for time-course microarray data. In: Proceedings of the 2004 ACM symposium on Applied computing, pp. 128–132. ACM (2004)
Du, L., Wu, S., Liew, A.W.-C., Smith, D.K.: Spectral analysis of microarray gene expression time series data of plasmodium falciparum. Int. J. Bioinform. Res. Appl. 4(3), 337–349 (2008)
Rosa, B.A., Jiao, Y., Oh, S., Montgomery, B.L., Qin, W., Chen, J.: Frequency-based time-series gene expression recomposition using priism. BMC Syst. Biol. 6(1), 69 (2012)
Sussillo, D., Kundaje, A., Anastassiou, D.: Spectrogram analysis of genomes. EURASIP J. Appl. Sig. Process. 2004, 29–42 (2004)
Dodin, G., Vandergheynst, P., Levoir, P., Cordier, C., Marcourt, L.: Fourier and wavelet transform analysis, a tool for visualizing regular patterns in dna sequences. J. Theor. Biol. 206(3), 323–326 (2000)
Bar-Joseph, Z.: Analyzing time series gene expression data. Bioinformatics 20(16), 2493–2503 (2004)
Huang, N.E., Shen, Z., Long, S.R., Wu, M.C., Shih, H.H., Zheng, Q., Yen, N.-C., Tung, C.C., Liu, H.H.: The empirical mode decomposition and the hilbert spectrum for nonlinear and non-stationary time series analysis. In: Proceedings of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences, vol. 454, no. 1971, pp. 903–995 (1998)
Huang, N.E., Shen, S.S.: Hilbert-Huang Transform and Its Applications, vol. 5. World Scientific, Singapore (2005)
Chowdhury, A.R., Chetty, M., Vinh, N.X.: Incorporating time-delays in s-system model for reverse engineering genetic networks. BMC Bioinform. 14(1), 196 (2013)
Ram, R., Chetty, M.: A markov-blanket-based model for gene regulatory network inference. IEEE/ACM Trans. Comput. Biol. Bioinform. 8(2), 353–367 (2011)
Mariani, T.J., Budhraja, V., Mecham, B.H., Gu, C.C., Watson, M.A., Sadovsky, Y.: A variable fold change threshold determines significance for expression microarrays. FASEB J. 17(2), 321–323 (2003)
Marbach, D., Schaffter, T., Mattiussi, C., Floreano, D.: Generating realistic in silico gene networksfor performance assessment of reverse engineering methods. J. Comput. Biol. 16(2), 229–239 (2009)
Stolovitzky, G., Monroe, D., Califano, A.: Dialogue on reverse-engineering assessment and methods. Ann. N. Y. Acad. Sci. 1115(1), 1–22 (2007)
Schaffter, T., Marbach, D., Roulet, G.: Genenetweaver user manual (2012)
Chowdhury, A.R., Chetty, M.: An improved method to infer gene regulatory network using s-system. In: 2011 IEEE Congress on Evolutionary Computation (CEC), pp. 1012–1019, IEEE (2011)
Xuan, N., Chetty, M., Coppel, R., Wangikar, P.P.: Gene regulatory network modeling via global optimization of high-order dynamic bayesian network. BMC Bioinform. 13(1), 131 (2012)
Jiang, D., Tang, C., Zhang, A.: Cluster analysis for gene expression data: a survey. IEEE Trans. Knowl. Data Eng. 16(11), 1370–1386 (2004)
Acknowledgment
This research is supported by the Australia-India strategic research fund (AISRF) having the reference number BF040037.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Rahman, M.A., Chetty, M., Bulach, D., Wangikar, P.P. (2015). Frequency Decomposition Based Gene Clustering. In: Arik, S., Huang, T., Lai, W., Liu, Q. (eds) Neural Information Processing. ICONIP 2015. Lecture Notes in Computer Science(), vol 9490. Springer, Cham. https://doi.org/10.1007/978-3-319-26535-3_20
Download citation
DOI: https://doi.org/10.1007/978-3-319-26535-3_20
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-26534-6
Online ISBN: 978-3-319-26535-3
eBook Packages: Computer ScienceComputer Science (R0)