Abstract
Disease diagnosis is one of the major data mining questions by the clinicians. The current diagnosis models usually have a strong assumption that one patient has only one disease, i.e. a single-label data mining problem. But the patients, especially when at the late stages, may have more than one disease and require a multi-label diagnosis. The multi-label data mining is much more difficult than a single-label one, and very few algorithms have been developed for this situation. Deep learning is a data mining algorithm with highly dense inner structure and has achieved many successful applications in the other areas. We propose a hypothesis that rectified-linear-unit-based deep learning algorithm may also be good at the clinical questions, by revising the last layer as a multi-label output. The proof-of-concept experimental data support the hypothesis, and the community may be interested in trying more applications.
References
Schmidhuber J (2015) Deep learning in neural networks: an overview. Neural Netw 61:85–117
LeCun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521(7553):436–444
Fan XN, Zhang SW (2015) IncRNA-MFDL: identification of human long non-coding RNAs by fusing multiple features and using deep learning. Mol BioSyst 11:892–897
Liu F et al (2015) De novo identification of replication-timing domains in the human genome by deep learning. Bioinformatics
Heffernan R et al (2015) Improving prediction of secondary structure, local backbone angles, and solvent accessible surface area of proteins by iterative deep learning. Sci Rep 5:11476
Spencer M, Eickholt J, Jianlin C (2015) A deep learning network approach to ab initio protein secondary structure prediction. IEEE/ACM Trans Comput Biol Bioinf 12:103–112
Lusci A, Pollastri G, Baldi P (2013) Deep architectures and deep learning in chemoinformatics: the prediction of aqueous solubility for drug-like molecules. J Chem Inf Model 53:1563–1575
Nguyen SP, Shang Y, Xu D (2014) DL-PRO: A novel deep learning method for protein model quality assessment. In: Proceeding of International Joint Conference on Neural Networks, pp. 2071–2078
Zhang S et al (2015) A deep learning framework for modeling structural features of RNA-binding protein targets. Nucleic Acids Res 44(4):e32
Chakraborty C et al (2016) Rising strengths Hong Kong SAR in bioinformatics. Interdiscip Sci Comput Life Sci 1–13
Liu GP et al (2014) Deep learning based syndrome diagnosis of chronic gastritis. Comput Math Methods Med 2014:938350
Wang P et al (2016) hMuLab: a biomedical hybrid MUlti-LABel classifier based on multiple linear regression. IEEE/ACM Trans Comput Biol Bioinf
Lipton ZC et al (2015) Learning to diagnose with LSTM recurrent neural networks. arXiv preprint arXiv:1511.03677
Dudani SA (1976) The distance-weighted k-nearest-neighbor rule. IEEE Trans Syst Man Cybern SMC-6 325–327
Tsoumakas G, Katakis I, Vlahavas I (2011) Random k-labelsets for multilabel classification. IEEE Trans Knowl Data Eng 23(7):1079–1089
Tsoumakas G et al (2011) Mulan: a java library for multi-label learning. J Mach Learn Res 12:2411–2414
Tsoumakas G et al (2011) MULAN: a java library for multi-label learning. J Mach Learn Res 12:2411–2414
Zhang ML, Zhou ZH (2007) ML-KNN: a lazy learning approach to multi-label learning. Pattern Recogn 40:2038–2048
Read J et al (2011) Classifier chains for multi-label classification. Mach Learn 85:333–359
Tsoumakas G et al (2008) Effective and efficient multilabel classification in domains with large number of labels. In: Proceedings of ECML/PKDD 2008 workshop on mining multidimensional data. pp 1–15
Kocev D et al (2007) Ensembles of multi-objective decision trees. In Kok JN et al (eds) Proceedings of machine learning: ECML 2007: 18th European conference on machine learning, Warsaw, Poland, September 17–21, 2007. Springer, Berlin, Heidelberg, pp. 624–631
Madjarov G et al (2012) An extensive experimental comparison of methods for multi-label learning. Pattern Recogn 45:3084–3104
Min-Ling Z, Zhi-Hua Z (2006) Multilabel neural networks with applications to functional genomics and text categorization. IEEE Trans Knowl Data Eng 18(10):1338–1351
Jesse Read FP-C (2014) Deep learning for multi-label classification. arXiv eprint arXiv:1502.05988
Read J, Hollmen J (2014) A deep interpretation of classifier chains. In: Blockeel H, van Leeuwen M, Vinciotti V (eds) Advances in Intelligent Data Analysis XIII. Springer, Switzerland, pp 251–262
Acknowledgements
This work was supported by the China 863 program (SS2015AA020109), the Strategic Priority Research Program of the Chinese Academy of Sciences (XDB13040400), the startup grant from the Jilin University, the grants from the National Natural Science Foundation of China (No. 31560316, No. 31260273, No. 61462047 and No. 61261027), and the Department of Education of JiangXi Province (GJJ13641). Constructive comments from the anonymous reviewers are greatly appreciated.
Author information
Authors and Affiliations
Corresponding authors
Additional information
Pu Wang and Ruiquan Ge contributed equally to this study.
Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Wang, P., Ge, R., Xiao, X. et al. Rectified-Linear-Unit-Based Deep Learning for Biomedical Multi-label Data. Interdiscip Sci Comput Life Sci 9, 419–422 (2017). https://doi.org/10.1007/s12539-016-0196-1
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12539-016-0196-1