Diabetes-Related Topic Detection in Chinese Health Websites Using Deep Learning
With 98.4 million people diagnosed with diabetes in China, most of the Chinese health websites provide diabetes related news and articles in diabetes subsection for patients. However, most of the articles are uncategorized and without a clear topic or theme, resulting in time consuming information seeking experience. To address this issue, we propose an advanced deep learning approach to detect topics for diabetes related articles from health websites. Our research framework for topic detection on diabetes related articles in Chinese is the first one to incorporate deep learning in topic detection in Chinese. It can identify topics of diabetes articles with high performance and potentially assist health information seeking. To evaluate our framework, experiment is conducted on a test bed of 12,000 articles. The results showed the framework achieved an accuracy of 70% in detecting topics and significantly outperformed the SVM based approach.
Keywordsclassification topic detection diabetes Chinese deep learning
Unable to display preview. Download preview PDF.
- 2.Chinese Diabetes Society, http://cdschina.org/news_show.jsp?id=2121.html
- 3.Deep Learning Tutorials by LISA lab, http://www.deeplearning.net/tutorial/
- 4.Socher, R., Bengio, Y., Manning, C.D.: Deep learning for NLP (without magic).Tutorial Abstracts of ACL. p. 5. Association for Computational Linguistics (2012)Google Scholar
- 5.Gouws, S.: Deep unsupervised feature learning for natural language processing. In: Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics, pp. 48–53 (2012)Google Scholar
- 6.Lu, Y., Zhang, P., Liu, J., et al.: Health-related hot topic detection in online communities using text clustering. PloS One 8(2), e56221 (2013)Google Scholar
- 7.Weitzman, E.R., Cole, E., Kaci, L., et al.: Social but safe? Quality and safety of diabetes-related online social networks. JAMIA 18(3), 292–297 (2011)Google Scholar
- 10.Klemm, P., Nolan, M.T.: Internet cancer support groups: legal and ethical issues for nurse researchers. Oncology Nursing Forum 25(4), 673–676 (1998)Google Scholar
- 13.Lin, Y., Li, W., Chen, K., et al.: A document clustering and ranking system for exploring MEDLINE citations. JAMIA 14(5), 651–661 (2007)Google Scholar
- 14.Kandula, S., Curtis, D., Hill, B., et al.: Use of topic modeling for recommending relevant education material to diabetic patients. In: AMIA, vol. 2011, p. 674 (2011)Google Scholar
- 15.Brody, S., Elhadad, N.: Detecting salient aspects in online reviews of health providers. In: AMIA, vol. 2010, p. 202 (2010)Google Scholar
- 19.Hinton, G.: A practical guide to training restricted Boltzmann machines. Momentum 9(1), 926 (2010)Google Scholar