Diabetes-Related Topic Detection in Chinese Health Websites Using Deep Learning

Chen, Xinhuan; Zhang, Yong; Xing, Chunxiao; Liu, Xiao; Chen, Hsinchun

doi:10.1007/978-3-319-08416-9_2

Xinhuan Chen²⁰,
Yong Zhang²⁰,
Chunxiao Xing²⁰,
Xiao Liu²¹ &
…
Hsinchun Chen²¹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8549))

Included in the following conference series:

International Conference on Smart Health

1548 Accesses
3 Citations

Abstract

With 98.4 million people diagnosed with diabetes in China, most of the Chinese health websites provide diabetes related news and articles in diabetes subsection for patients. However, most of the articles are uncategorized and without a clear topic or theme, resulting in time consuming information seeking experience. To address this issue, we propose an advanced deep learning approach to detect topics for diabetes related articles from health websites. Our research framework for topic detection on diabetes related articles in Chinese is the first one to incorporate deep learning in topic detection in Chinese. It can identify topics of diabetes articles with high performance and potentially assist health information seeking. To evaluate our framework, experiment is conducted on a test bed of 12,000 articles. The results showed the framework achieved an accuracy of 70% in detecting topics and significantly outperformed the SVM based approach.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Monnier, J., Laken, M., Carter, C.L.: Patient and Caregiver Interest in Internet-Based Cancer Services. Cancer Practice 10(6), 305–310 (2002)
Article Google Scholar
Chinese Diabetes Society, http://cdschina.org/news_show.jsp?id=2121.html
Deep Learning Tutorials by LISA lab, http://www.deeplearning.net/tutorial/
Socher, R., Bengio, Y., Manning, C.D.: Deep learning for NLP (without magic).Tutorial Abstracts of ACL. p. 5. Association for Computational Linguistics (2012)
Google Scholar
Gouws, S.: Deep unsupervised feature learning for natural language processing. In: Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics, pp. 48–53 (2012)
Google Scholar
Lu, Y., Zhang, P., Liu, J., et al.: Health-related hot topic detection in online communities using text clustering. PloS One 8(2), e56221 (2013)
Google Scholar
Weitzman, E.R., Cole, E., Kaci, L., et al.: Social but safe? Quality and safety of diabetes-related online social networks. JAMIA 18(3), 292–297 (2011)
Google Scholar
Shrank, W.H., Choudhry, N.K., Swanton, K., et al.: Variations in structure and content of online social networks for patients with diabetes. Archives of Internal Medicine 171(17), 1589–1591 (2011)
Article Google Scholar
Greene, J.A., Choudhry, N.K., Kilabuk, E., et al.: Online social networking by patients with diabetes: a qualitative evaluation of communication with Facebook. Journal of General Internal Medicine 26(3), 287–292 (2011)
Article Google Scholar
Klemm, P., Nolan, M.T.: Internet cancer support groups: legal and ethical issues for nurse researchers. Oncology Nursing Forum 25(4), 673–676 (1998)
Google Scholar
Basch, E.M., Thaler, H.T., Shi, W., et al.: Use of information resources by patients with cancer and their companions. Cancer 100(11), 2476–2483 (2004)
Article Google Scholar
Li, N., Wu, D.D.: Using text mining and sentiment analysis for online forums hotspot detection and forecast. Decision Support Systems 48(2), 354–368 (2010)
Article Google Scholar
Lin, Y., Li, W., Chen, K., et al.: A document clustering and ranking system for exploring MEDLINE citations. JAMIA 14(5), 651–661 (2007)
Google Scholar
Kandula, S., Curtis, D., Hill, B., et al.: Use of topic modeling for recommending relevant education material to diabetic patients. In: AMIA, vol. 2011, p. 674 (2011)
Google Scholar
Brody, S., Elhadad, N.: Detecting salient aspects in online reviews of health providers. In: AMIA, vol. 2010, p. 202 (2010)
Google Scholar
Tamilselvan, P., Wang, P.: Failure diagnosis using deep belief learning based health state classification. Reliability Engineering & System Safety 115, 124–135 (2013)
Article Google Scholar
Wang, B., Liu, B., Wang, X., et al.: Deep learning approaches to semantic relevance modeling for chinese question-answer pairs. TALIP 10(4), 21 (2011)
Article Google Scholar
Hinton, G.E., Osindero, S., Teh, Y.W.: A fast learning algorithm for deep belief nets. Neural Computation 18(7), 1527–1554 (2006)
Article MATH MathSciNet Google Scholar
Hinton, G.: A practical guide to training restricted Boltzmann machines. Momentum 9(1), 926 (2010)
Google Scholar
Salakhutdinov, R., Hinton, G.: Semantic hashing. International Journal of Approximate Reasoning 50(7), 969–978 (2009)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Research Institute of Information Technology, Tsinghua National Laboratory for Information Science and Technology, Department of Computer Science and Technology, Tsinghua University, Beijing, China
Xinhuan Chen, Yong Zhang & Chunxiao Xing
MIS Department, University of Arizona, United States
Xiao Liu & Hsinchun Chen

Authors

Xinhuan Chen
View author publications
You can also search for this author in PubMed Google Scholar
Yong Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Chunxiao Xing
View author publications
You can also search for this author in PubMed Google Scholar
Xiao Liu
View author publications
You can also search for this author in PubMed Google Scholar
Hsinchun Chen
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

The State Key Laboratory of Management and Control for Complex Systems, Institute of Automation, Chinese Academy of Sciences, 100190, Beijing, China
Xiaolong Zheng
The University of Arizona, Tucson, AZ, USA and Chinese Academy of Sciences,, Beijing, China
Daniel Zeng
The University of Arizona, Tucson, AZ, USA and Tsinghua University, Beijing,, China
Hsinchun Chen
Tsinghua University, Beijing, China
Yong Zhang & Chunxiao Xing &
Carnegie Mellon University, Pittsburgh, PA, USA
Daniel B. Neill

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chen, X., Zhang, Y., Xing, C., Liu, X., Chen, H. (2014). Diabetes-Related Topic Detection in Chinese Health Websites Using Deep Learning. In: Zheng, X., Zeng, D., Chen, H., Zhang, Y., Xing, C., Neill, D.B. (eds) Smart Health. ICSH 2014. Lecture Notes in Computer Science, vol 8549. Springer, Cham. https://doi.org/10.1007/978-3-319-08416-9_2

Download citation

DOI: https://doi.org/10.1007/978-3-319-08416-9_2
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-08415-2
Online ISBN: 978-3-319-08416-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics