A Multi-Label Classification Framework to Predict Disease Associations of Long Non-coding RNAs (lncRNAs)
In this paper, the automated detection of tissue specific disease association of long non-coding RNAs (lncRNAs) is modeled as a multi-label classification task, where a single lncRNA transcript may be associated with several diseases in a tissue specific manner. Four algorithms are evaluated and compared in this task. Furthermore, in this article we put emphasis on the fact that secondary structure and the composition features of the lncRNAs dictate their functions that led us to develop a new multi-label feature extraction scheme. Experiments are conducted on a set of 7,566 lncRNA transcripts with 22 tissue labels, and the results provide interesting insights into the quality of the discussed algorithms and the features.
KeywordsLong non-coding RNA Multi-label classification Association
- 1.Sacco LD, Baldassarre A, Masotti A (2012) Bioinformatics tools and novel challenges in long non-coding RNAs (lncRNAs) functional analysis. Int J Mol Sci 13(1):97–114Google Scholar
- 9.Santos AM, Canuto AMP, Neto AF (2011) A comparative analysis of classification methods to multi-label tasks in different application domains. Int J Comput Inf Syst Ind Manag Appl 3:218–227Google Scholar