Abstract
Medical data in online groups and social media contain valuable information, which is provided by both healthcare professionals and patients. In fact, patients can talk freely and share their personal experiences. These resources are a valuable opportunity for health professionals who can access patients’ opinions, as well as discussions between patients. Recently, the data processing of the health community and, how to extract knowledge is a significant technical challenge. There are many online group and forums that users can discuss on healthcare issues. Therefore, we can examine these text documents for discovering knowledge and evaluating patients’ behavior based on their opinions and discussions. For example, there are many questions and answering groups on Twitter or Facebook. Given the importance of the research, in this paper, we present a semantic framework based on topic model (LDA) and Random forest(RF) to predict and retrieval latent topics of healthcare text-documents from an online forum. We extract our healthcare records (patient-questions) from patient.info website as a real dataset. Experiments on our dataset show that social media forums could help for detecting significant patient safety problems on healthcare issues.
Similar content being viewed by others
References
Leek S., Canning L., Houghton D.: Revisiting the task media fit model in the era of web 2.0: Twitter use and interaction in the healthcare sector. Ind. Mark. Manag. 54 (2016): 25–32, 2016
Liu S.S., Zakaria S., Vaidya D., Srivastava M.C.: Electrocardiogram training for residents: A curriculum based on Facebook and Twitter. J. Electrocardiol. 50 (5): 646–651, 2017
Wakamiya S., Morita M., Kano Y., Ohkuma T., Aramaki E.: Tweet classification toward Twitter-based disease surveillance: New data, methods, and evaluations. J. Med. Internet Res. 21 (2): e12783, 2019
Lu H.-M., Wei C.-P., Hsiao F.-Y.: Modeling healthcare data using multiple-channel latent Dirichlet allocation. J. Biomed. Inform. 60 (2016): 210–223, 2016
Nakhasi A., Bell S.G., Passarella R.J., Paul M.J., Dredze M., Pronovost P.J. (2018) The potential of Twitter as a data source for patient safety. Journal of Patient Safety
Pai R.R., Alathur S.: Assessing mobile health applications with twitter analytics. Int. J. Med. Inform. 113 (2018): 72–84, 2018
Pemmaraju N., Mesa R.A., Majhail N.S., Thompson M.A.: The use and impact of Twitter at medical conferences: Best practices and Twitter etiquette.. In: Seminars in Hematology, vol 54. Elsevier, 2017, pp 184–188
Peters M.E., Uible E., Chisolm M.S.: A Twitter education: Why psychiatrists should tweet. Curr. Psych. Rep. 17 (12): 94, 2015
Plachkinova M., Kettering V., Chatterjee S. (2018) Increasing exposure to complementary and alternative medicine treatment options through the design of a social media tool. Health Syst., 1–18
Subramani S., Wang H., Vu H.Q., Li G.: Domestic violence crisis identification from Facebook posts based on deep learning. IEEE Access 6 (2018): 54075–54085, 2018
Surian D., Nguyen D.Q., Kennedy G., Johnson M., Coiera E., Dunn A.G.: Characterizing Twitter discussions about HPV vaccines using topic modeling and community detection. J. Med. Internet Res. 18: 8, 2016
Tang C., Zhou L., Plasek J., Rozenblum R., Bates D.: Comment topic evolution on a cancer institution’s Facebook page. Appl. Clin. Inform. 8, 03: 854–865, 2017
Villota E.J., Yoo S.G.: An experiment of influences of Facebook posts in other users.. In: 2018 International conference on eDemocracy & eGovernment (ICEDEG), . IEEE, 2018, pp 83–88
Wu J.-Y., Hsiao Y.-C., Nian M.-W. (2018) Using supervised machine learning on large-scale online forums to classify course-related Facebook messages in predicting learning achievement within the personal learning environment. Interact. Learn. Environ., 1–16
Xing W., Goggins S., Introne J.: Quantifying the effect of informational support on membership retention in online communities through large-scale data analytics. Comput. Hum. Behav. 86 (2018): 227–234, 2018
Zou C. (2018) Analyzing research trends on drug safety using topic modeling. Expert Opin. Drug Saf., 1–8
Mohammed S., Mohammed S., Fiaidhi J., Li T., Fong S.: Experimenting with clojure on extracting medication information from clinical narratives.. In: Proceedings of the 2018 2nd International Conference on Big Data and Internet of Things. ACM, 2018, pp 119–122
Tang V., et al. (2019) An adaptive clinical decision support system for serving the elderly with chronic diseases in healthcare industry. Expert. Syst., e12369
Singhal S., Jain S., Rathi M., Sinha A.: Smart technologies to build healthcare models for vision impairment.. In: Advanced classification techniques for healthcare analysis. IGI Global, 2019, pp 259–285
Khor R.C., Nguyen A., O’Dwyer J., Kothari G., Sia J., Chang D., Foroudi F: Extracting tumour prognostic factors from a diverse electronic record dataset in genito-urinary oncology. Int. J. Med. Inform. 121: 53–57, 2019
Chen L., Song L., Shao Y., Li D., Ding K.: Using natural language processing to extract clinically useful information from Chinese electronic medical records. Int. J. Med. Inform. 124: 6–12, 2019
Pandey S.K., Janghel R.R. (2019) Recent deep learning techniques, challenges and its applications for medical healthcare system: A review. Neural. Process. Lett., 1–29
Kristina D.-H., Mowery D.L., Daniels C., Chapman W.W., Conway M.: Understanding patient satisfaction with received healthcare services: A natural language processing approach.. In: AMIA Annual Symposium Proceedings, vol 2016, 2016, p 524
Gupta S., Hanson C., Gunter C.A., Frank M., Liebovitz D., Malin B: Modeling and detecting anomalous topic access.. In: IEEE International Conference on Intelligence and Security Informatics (ISI). IEEE, 2013, pp 100–105
Hardjojo A., Gunachandran A., Pang L., Abdullah M.R.B., Wah W., Chong J.W.C., Goh E.H., Teo S.H., Lim G., Lee M.L., et al: Validation of a natural language processing algorithm for detecting infectious disease symptoms in primary care electronic medical records in Singapore. JMIR Med. Inform. 6: 2, 2018
Bian J., Topaloglu U., Yu F.: Towards large-scale twitter mining for drug-related adverse events.. In: Proceedings of the 2012 International Workshop on Smart Health and Wellbeing. ACM, 2012, pp 25–32
Coppersmith G., Dredze M., Harman C.: Quantifying mental health signals in Twitter.. In: Proceedings of the Workshop on Computational Linguistics and Clinical Psychology: From Linguistic Signal to Clinical Reality, 2014, pp 51–60
Huh J., Yetisgen-Yildiz M., Pratt W: Text classification for assisting moderators in online health communities. J. Biomed. Inform. 46 (6): 998–1005, 2013
Ye Y., Zhao Y., Shang J., Zhang L.: A hybrid IT framework for identifying high-quality physicians using big data analytics. Int. J. Inf. Manag. 47 (2019): 65–75, 2019
Belobordov A., Braslavski P.: Does everybody lie? Characterizing answerers in health-related CQA.. In: 2016 International FRUCT Conference on Intelligence, Social Media and Web (ISMW FRUCT). IEEE, 2016, pp 1–6
Blei D.M., Ng A.Y., Jordan M.I.: Latent Dirichlet allocation. J. Mach. Learn. Res. 3: 993–1022, 2003
Griffiths T.L., Steyvers M.: Finding scientific topics. Proc. Nat. Acad. Sci. 101 (suppl 1): 5228–5235, 2004
Plummer M.: JAGS: A program for analysis of Bayesian graphical models using Gibbs sampling.. In: Proceedings of the 3rd International Workshop on Distributed Statistical Computing, vol 124, No. 125.10, 2003, pp 1–10
Prihatini P.M., Putra I., Giriantari I., Sudarma M.: Indonesian text feature extraction using Gibbs sampling and mean variational inference latent Dirichlet allocation.. In: 2017 15th International Conference on Quality in Research (QiR): International Symposium on Electrical and Computer Engineering. IEEE, 2017, pp 40–44
Hao T., Chen X., Li G., Yan J: A bibliometric analysis of text mining in medical research. Soft Comput. 22 (23): 7875–7892, 2018
Young I.J.B., Luz S., Lone N. (2019) A systematic review of natural language processing for classification tasks in the field of incident reporting and adverse event analysis. Int. J. Med. Inform., 103971
Rajput A.: Natural language processing, sentiment analysis, and clinical analytics.. In: Innovation in Health Informatics. Academic Press, 2020, pp 79–97
Cruz N.P., Canales L., Muñoz J.G., Pérez B., Arnott I.: Improving adherence to clinical pathways through natural language processing on electronic medical records. Studies Health Technol. Inform. 264: 561–565, 2019
Daniel J.E., Brink W., Eloff R., Copley C: Towards automating healthcare question answering in a noisy multilingual low-resource setting.. In: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, 2019, pp 948–953
Van Vleck T.T., Chan L., Coca S.G., Craven C.K., Do R., Ellis S.B, Nadkarni G.N: Augmented intelligence with natural language processing applied to electronic health records for identifying patients with non-alcoholic fatty liver disease at risk for disease progression. Int. J. Med. Inform. 129: 334–341, 2019
Acknowledgements
This article has been awarded by the National Natural Science Foundation of China (61941113, 81674099, 61502233), the Fundamental Research Fund for the Central Universities (30918015103, 30918012204), Nanjing Science and Technology Development Plan Project (201805036), and “13th Five-Year” equipment field fund (61403120501), China Academy of Engineering Consulting Research Project(2019-ZD-1-02-02), National Social Science Foundation (18BTQ073).
Author information
Authors and Affiliations
Corresponding authors
Ethics declarations
Conflict of interests
Hamed Jelodar, Yongli Wang, Mahdi Rabbani,Gang Xiao, Ruxin Zho declare no conflict of interest directly related to the submitted work.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Ethical Approval
All procedures performed in studies involving human participants were in accordance with the ethical standards of the institutional and/or national research committee and with the 1964 Helsinki declaration and its later amendments or comparable ethical standards.
This article is part of the Topical Collection on Transactional Processing Systems
Rights and permissions
About this article
Cite this article
Jelodar, H., Wang, Y., Rabbani, M. et al. A Collaborative Framework Based for Semantic Patients-Behavior Analysis and Highlight Topics Discovery of Alcoholic Beverages in Online Healthcare Forums. J Med Syst 44, 101 (2020). https://doi.org/10.1007/s10916-020-01547-0
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s10916-020-01547-0