Hub Gene Selection Methods for the Reconstruction of Transcription Networks
Transcription control networks have a scale-free topological structure: While most genes are involved in a reduced number of links, a few hubs or key regulators are connected to a significantly large number of nodes. Several methods have been developed for the reconstruction of these networks from gene expression data, e.g. ARACNE. However, few of them take into account the scale-free structure of transcription networks. In this paper, we focus on the hubs that commonly appear in scale-free networks. First, three feature selection methods are proposed for the identification of those genes that are likely to be hubs and second, we introduce an improvement in ARACNE so that this technique can take into account the list of hub genes generated by the feature selection methods. Experiments with synthetic gene expression data validate the accuracy of the feature selection methods in the task of identifying hub genes. When ARACNE is combined with the output of these methods, we achieve up to a 62% improvement in performance over the original reconstruction algorithm. Finally, the best method for identifying hub genes is validated on a set of expression profiles from yeast.
KeywordsTranscription network ARACNE Automatic relevance determination Group Lasso Maximum relevance minimum redundancy Scale-free Hub
- 2.Stolovitzky, G., Califano, A.: Systems biology: Making sense of oceans of biological data. The New York Academy of Sciences Update Magazine, 20–23 (March/April 2006)Google Scholar
- 29.Davis, J., Goadrich, M.: The relationship between precision-recall and roc curves. In: ICML 2006, pp. 223–240 (2006)Google Scholar