A Multi-gene-Feature-Based Genetic Algorithm for Prediction of Operon
The prediction of operons is critical to reconstruction of regulatory networks at the whole genome level. In this paper, a multi-approach guided genetic algorithm is developed to prediction of operon. The fitness function is created by using intergenic distance of local entropy-minimization, participation of the same metabolic pathway, log-likelihood of COG gene functions and correlation coefficient of microarray expression data, which have been used individually for predicting operons. The gene pairs within operons have high fitness value by using these four scoring criteria, whereas those across transcription unit borders have low fitness value. On the other hand, it is easy to predict operons and makes the prediction ability stronger by using these four scoring criteria. The proposed method is examined on 683 known operons of Escherichia coli K12 and an accuracy of 85.9987% is obtained.
KeywordsGenetic Algorithm Gene Pair Adjacent Gene Microarray Expression Data Intergenic Distance
Unable to display preview. Download preview PDF.
- 8.Craven, M., Page, D., Shavlik, J., Bockhorst, J., Glasner, J.: A probabilistic learning approach to whole-genome operon prediction. In: Proc. 8th International Conference on Intelligent Systems for Mol. Biol., pp. 116–127 (2000)Google Scholar
- 9.Chen, X., Su, Z.C., Xu, Y., Jiang, T.: Computational Prediction of Operons in Synechococcus sp.WH8102. Genome Informatics. Genome Informatics 15, 211–222 (2004)Google Scholar
- 10.Dam, P., Olman, V., Xu, Y.: Improving Operon Prediction in E. coli. In: 2005 IEEE Computational Systems Bioinformatics Conference - Workshops (CSBW), pp. 69–70 (2005)Google Scholar