Integration Analysis of Diverse Genomic Data Using Multi-clustering Results
In modern data mining applications, clustering algorithms are among the most important approaches, because these algorithms group elements in a dataset according to their similarities, and they do not require any class label information. In recent years, various methods for ensemble selection and clustering result combinations have been designed to optimize clustering results. Moreover, conducting data analysis using multiple sources, given the complexity of data objects, is a much more powerful method than evaluating each source separately. Therefore, a new paradigm is required that combines the genome-wide experimental results of multi-source datasets. However, multi-source data analysis is more difficult than single source data analysis. In this paper, we propose a new clustering ensemble approach for multi-source bio-data on complex objects. In addition, we present encouraging clustering results in a real bio-dataset examined using our proposed method.
KeywordsCluster Algorithm Chronic Fatigue Syndrome Cluster Result Cluster Ensemble Roulette Wheel Selection
Unable to display preview. Download preview PDF.
- 1.Alexander, P.T., Behrouz, M.-B., Anil, K.J., William, F.P.: Adaptive clustering ensembles. In: Proceedings of the International Conference on Pattern Recognition, vol. 1, pp. 272–275 (2004)Google Scholar
- 2.Alexander, S., Joydeep, G.: Cluster ensembles-A knowledge reuse framework for combining partitionings. Journal of Machine Learning 3, 583–617 (2002)Google Scholar
- 5.Everitt, B.: Cluster analysis. John Wiley and Sons, Inc., Chichester (1993)Google Scholar
- 6.Greene, D., Tsymbal, A., Bolshakova, N., Cunningham, P.: Ensemble clustering in medical diagnostics. In: Proceedings of the 17th IEEE Symposium on Computer-Based Medical Systems, pp. 576–581 (2004)Google Scholar
- 7.Jain, A.K., Murty, M.N., Flynn, P.J.: Data clustering: A review. ACM Computing Surveys 31 (1999)Google Scholar
- 8.Kaufman, L., Rosseeuw, P.J.: Finding groups in data: An introduction to cluster analysis. John Wiley and Sons, Inc., Chichester (1990)Google Scholar
- 9.Larray, T.H.Y., Fu-lai, C., Stephen, C.F.: Using emerging pattern based projected clustering and gene expression data for cancer detection. In: Proceedings of the Asia-Pacific Bioinformatics Conference, vol. 29, pp. 75–87 (2004)Google Scholar
- 11.Qiu, P., Wang, Z.J., Liu, K.J.: Ensemble dependence model for classification and prediction of cancer and normal gene expression data. Bioinformatics and Bioengineering, 251–258 (2004)Google Scholar
- 12.Theodoridis, S., Koutroumbas, K.: Pattern recognition. Academic Press, London (1999)Google Scholar
- 13.Xiaohua, H., Illhoi, Y.: Cluster ensemble and its applications in gene expression. In: Proceedings of the Asia-Pacific Bioinformatics Conference, vol. 29, pp. 297–302 (2004)Google Scholar
- 14.Zhou, Z.-H., Tang, W.: Clustering ensemble. Knowledge-Based Systems (2006)Google Scholar