Online discussion forums are common features of learning management systems; they allow teachers to engage students in topical discussions in environments beyond physical spaces. This study presents a novel approach to operationalizing the connections between social interaction and contextual topics by visualizing posts in an online discussion forum. Using the weak ties theory, we developed a prototype of a tool that helps visualize the text-based content in online discussion forums, specifically in terms of topic relationships and student interactions. This research unveils a nuanced picture of social and topic connectivity, the nature of social interactions, and the changes in the topics being discussed when serendipity occurs. Our implementation of the tool and the results from testing show that the visualization method was able to determine that the strongly connected major topics in the discussion were related to the intended course learning outcomes, whereas the weakly connected topics could yield insights into students’ unexpected learning. The proposed method of visualization may benefit both teachers and students by helping them to efficiently the learning and teaching process and thus may contribute to formative assessment design, a collaborative learning process, and unexpected learning.
This is a preview of subscription content, access via your institution.
Buy single article
Instant access to the full article PDF.
Tax calculation will be finalised during checkout.
The datasets used and/or analyzed in the current study are available from the corresponding author on reasonable request.
Interaction analysis model
Intended learning outcome
Latent Dirichlet allocation
Learning management system
Aggarwal, C. C., & Wang, H. (2011). Text mining in social networks. In C. C. Aggarwal (Ed.), Social network data analytics (pp. 353–378). Springer.
Arun, R., Suresh, V., Madhavan, C. V., & Murthy, M. N. (2010). On finding the natural number of topics with latent Dirichlet allocation: Some observations. Pacific-Asia conference on knowledge discovery and data mining (pp. 391–402). Springer.
Baer, M. (2010). The strength-of-weak-ties perspective on creativity: A comprehensive examination and extension. Journal of Applied Psychology, 95(3), 592–601.
Bakshy, E., Rosenn, I., Marlow, C., & Adamic, L. (2012). The role of social networks in information diffusion. In Proceedings of the 21st international conference on World Wide Web (pp. 519–528). ACM Press.
Beaudoin, M. F. (2002). Learning or lurking?: Tracking the ‘“invisible”’ online student. The Internet and Higher Education, 5(2), 147–155.
Burt, R. S. (2004). Structural holes and good ideas. American Journal of Sociology, 110(2), 349–399.
Cao, J., Xia, T., Li, J., Zhang, Y., & Tang, S. (2009). A density-based method for adaptive LDA model selection. Neurocomputing, 72(7–9), 1775–1781.
Caspi, A., Gorsky, P., & Chajut, E. (2003). The influence of group size on nonmandatory asynchronous instructional discussion groups. The Internet and Higher Education, 6(3), 227–240.
Chen, B., Chang, Y.-H., Ouyang, F., & Zhou, W. (2018). Fostering student engagement in online discussion through social learning analytics. The Internet and Higher Education, 37, 21–30.
Chen, C. M., Li, M. C., & Huang, Y. L. (2020). Developing an instant semantic analysis and feedback system to facilitate learning performance of online discussion. Interactive Learning Environments,. https://doi.org/10.1080/10494820.2020.1839505
Cheng, C. K., Paré, D. E., Collimore, L. M., & Joordens, S. (2011). Assessing the effectiveness of a voluntary online discussion forum on improving students’ course performance. Computers and Education, 56(1), 253–261.
Clouder, D. L. & Deepwell, F. (2004). Reflections on unexpected outcomes: Learning from student collaboration in an online discussion forum. In S. Banks, P. Goodyear, V. Hodgson, C. Jones, V. Lally, D. McConnell & C. Steeples (Eds.) Proceedings of the 2004 networked learning conference (pp. 429–435). Lancaster University
Constant, D., Sproull, L., & Kiesler, S. (1996). The kindness of strangers: The usefulness of electronic weak ties for technical advice. Organization Science, 7(2), 119–135.
Cutumisu, M., & Guo, Q. (2019). Using topic modeling to extract pre-service teachers’ understandings of computational thinking from their coding reflections. IEEE Transactions on Education, 62(4), 325–332. https://doi.org/10.1109/te.2019.2925253
Dawson, S. (2010). “Seeing” the learning community: An exploration of the development of a resource for monitoring online student networking. British Journal of Educational Technology, 41(5), 736–752. https://doi.org/10.1111/j.1467-8535.2009.00970.x
De Laat, M., & Lally, V. (2003). Complexity, theory and praxis: Researching collaborative learning and tutoring processes in a networked learning community. Instructional Science, 31(1–2), 7–39.
Deveaud, R., SanJuan, E., & Bellot, P. (2014). Accurate and effective latent concept modeling for ad hoc information retrieval. Document Numérique, 17(1), 61–84.
Dicheva, D., & Dichev, C. (2006). TM4L: Creating and browsing educational topic maps. British Journal of Educational Technology, 37(3), 391–404.
Dringus, L. P., & Ellis, T. (2005). Using data mining as a strategy for assessing asynchronous discussion forums. Computers and Education, 45(1), 141–160.
Fekete, J.-D., van Wijk, J. J., Stasko, J. T., & North, C. (2008). The value of information visualization. In A. Kerren, J. T. Stasko, J. D. Fekete, & C. North (Eds.), Information visualization lecture notes in computer science (pp. 1–18). Springer.
Figueira, Á. R., & Laranjeiro, J. B. (2007). Interaction visualization in web-based learning using igraph. In Proceedings of the 8th ACM conference on hypertext and hypermedia (pp. 45–46). ACM Press.
Foster, A., & Ford, N. (2003). Serendipity and information seeking: An empirical study. Journal of Documentation, 59(3), 321–340.
Garrison, D. R., Anderson, T., & Archer, W. (1999). Critical inquiry in a text-based environment: Computer conferencing in higher education. The Internet and Higher Education, 2(2–3), 87–105.
Garrison, D. R., Anderson, T., & Archer, W. (2001). Critical thinking, cognitive presence, and computer conferencing in distance education. American Journal of Distance Education, 15(1), 7–23.
Gibbs, W. J., Olexa, V., & Bernas, R. S. (2006). A visualization tool for managing and studying online communications. Journal of Educational Technology and Society, 9(3), 232–243.
Goodyear, P. (2002). Psychological foundations for networked learning. In C. Steeples & C. Jones (Eds.), Networked learning: Perspectives and issues (pp. 49–75). Springer.
Granovetter, M. (1983a). The strength of weak ties: A network theory revisited. Sociological Theory, 1(1), 201–233.
Granovetter, M. (1983b). The strength of weak ties: A network theory revisited. Sociological Theory, 1, 201–233.
Granovetter, M. S. (1973). The strength of weak ties. American Journal of Sociology, 78(6), 1360–1380.
Griffiths, T. L., & Steyvers, M. (2004). Finding scientific topics. Proceedings of the National Academy of Sciences of the United States of America, 101(Supplement 1), 5228–5235.
Gunawardena, C. N., Lowe, C. A., & Anderson, T. (1997). Analysis of a global online debate and the development of an interaction analysis model for examining social construction of knowledge in computer conferencing. Journal of Educational Computing Research, 17(4), 397–431.
Gundecha, P., & Liu, H. (2012). Mining social media: A brief introduction. In P. Mirchandani (Ed.), Informs tutorials in operations research (pp. 1–17). INFORMS.
Guo, Y., Barnes, S. J., & Jia, Q. (2017). Mining meaning from online ratings and reviews: Tourist satisfaction analysis using latent Dirichlet allocation. Tourism Management, 59, 467–483.
Han, J., Kamber, M., & Pei, J. (2011). Data mining: Concepts and techniques. Elsevier.
Hand, D., Mannila, H., & Smyth, P. (2001). Principles of data mining. MIT Press.
Hara, N., Bonk, C. J., & Angeli, C. (2000). Content analysis of online discussion in an applied educational psychology course. Instructional Science, 28(2), 115–152.
Havnes, A., & Prøitz, T. S. (2016). Why use learning outcomes in higher education? Exploring the grounds for academic resistance and reclaiming the value of unexpected learning. Educational Assessment, Evaluation and Accountability, 28(3), 205–223.
Haythornthwaite, C. (2000). Online personal networks. New Media and Society, 2(2), 195–226.
Haythornthwaite, C. (2002). Strong, weak, and latent ties and the impact of new media. The Information Society, 18(5), 385–401.
He, W. (2013). Examining students’ online interaction in a live video streaming environment using data mining and text mining. Computers in Human Behavior, 29(1), 90–102.
Hou, H.-T., Wang, S.-M., Lin, P.-C., & Chang, K.-E. (2015). Exploring the learner’s knowledge construction and cognitive patterns of different asynchronous platforms: Comparison of an online discussion forum and Facebook. Innovations in Education and Teaching International, 52(6), 610–620.
Jarvela, S., & Hakkinen, P. (2003). The levels of web-based discussions: Using perspective-taking theory as an analytical tool. In H. van Oostendorp (Ed.), Cognition in a digital world (pp. 77–95). Lawrence Erlbaum Associates.
Jeong, A. C. (2003). The sequential analysis of group interaction and critical thinking in online. The American Journal of Distance Education, 17(1), 25–43.
Johnson, D., & Johnson, R. (2008). Cooperation and the use of technology. In J. M. Spector, M. D. Merrill, J. van Merrienboer, & M. Driscoll (Eds.), Handbook of research on educational communications and technology (3rd ed., pp. 659–670). Routledge.
Jonassen, D., Davidson, M., Collins, M., Campbell, J., & Haag, B. B. (1995). Constructivism and computer-mediated communication in distance education. American Journal of Distance Education, 9(2), 7–26.
Jones, C. R., Ferreday, D., & Hodgson, V. (2008). Networked learning a relational approach: Weak and strong ties. Journal of Computer Assisted Learning, 24(2), 90–102.
Jyothi, S., McAvinia, C., & Keating, J. (2012). A visualisation tool to aid exploration of students’ interactions in asynchronous online communication. Computers and Education, 58(1), 30–42.
Kandakatla, R., Berger, E., Rhoads, J. F., & DeBoer, J. (2020). The development of social capital in an active, blended, and collaborative engineering class. International Journal of Engineering Education, 36(3), 1034–1048.
Keim, D., Andrienko, G., Fekete, J.-D., Görg, C., Kohlhammer, J., & Melançon, G. (2008). Visual analytics: Definition, process, and challenges. In A. Kerren, J. T. Stasko, J.-D. Fekete, & C. North (Eds.), Information visualization (pp. 154–175). Springer.
Kent, C., Rechavi, A., & Rafaeli, S. (2019). Networked learning analytics: A theoretically informed methodology for analytics of collaborative learning. Learning in a networked society (pp. 145–175). Springer.
Kitto, K., Bakharia, A., Lupton, M., Mallet, D., Banks, J., Bruza, P. et al. (2016). The connected learning analytics toolkit. In Proceedings of the 6th international conference on learning analytics and knowledge (pp. 548–549). ACM Press.
Kleinberg, J. M. (1999). Authoritative sources in a hyperlinked environment. Journal of the ACM (JACM), 46(5), 604–632.
Kop, R. (2012). The unexpected connection: Serendipity and human mediation in networked learning. Educational Technology and Society, 15(2), 2–11.
Krestel, R., Fankhauser, P., & Nejdl, W. (2009, October). Latent Dirichlet Allocation for tag recommendation. In Proceedings of the third ACM conference on Recommender systems (pp. 61–68).
Li, S. Y., & Wong, K. W. G. (2016). Educational data mining using chance discovery from discussion board. In Proceedings of the 20th global Chinese conference on computers in education 2016 (pp. 712–715). The Hong Kong Institute of Education.
Li, Y. K., & Wong, G. K. (2016, November). Visualizing the asynchronous discussion forum data with topic detection. In SIGGRAPH ASIA 2016 Symposium on Education: Talks (p. 17). ACM.
Lin, F.-R., Hsieh, L.-S., & Chuang, F.-T. (2009). Discovering genres of online discussion threads via text mining. Computers and Education, 52(2), 481–495.
Lu, H. M., Wei, C. P., & Hsiao, F. Y. (2016). Modeling healthcare data using multiple-channel latent Dirichlet allocation. Journal of Biomedical Informatics, 60, 210–223.
Macfadyen, L. P., & Dawson, S. (2010). Mining LMS data to develop an “early warning system” for educators: A proof of concept. Computers and Education, 54(2), 588–599.
Manning, C. D., Raghavan, P., & Schutze, H. (2008). Introduction to information retrieval. Cambridge University Press.
May, M., George, S., & Prevot, P. (2007). Tracking, analyzing and visualizing learners’ activities on discussion forums. In Proceedings of the 6th IASTED international conference on Web-Based Education (WBE) (pp. 649–656). WBE.
May, M., George, S., & Prevot, P. (2008). A closer look at tracking human and computer interactions in Web-based communications. Interactive Technology and Smart Education, 5(3), 170–188.
Mazza, R., & Dimitrova, V. (2007). Coursevis: A graphical student monitoring tool for supporting instructors in Web-based distance courses. International Journal of Human-Computer Studies, 65(2), 125–139.
Mazza, R. & Milani, C. (2004). ‘GISMO: A graphical interactive student monitoring tool for course management systems’, paper presented at The T.E.L.’04 Technology Enhanced Learning’04 International Conference, Milan, Italy (18–19 November).
McLoughlin, D., & Mynard, J. (2009). An analysis of higher order thinking in online discussions. Innovations in Education and Teaching International, 46(2), 147–160.
Merton, R. K., & Barber, E. (2006). The travels and adventures of serendipity: A study in sociological semantics and the sociology of science. Princeton University Press.
Moore, M. G. (1989). Editorial: Three types of interaction. American Journal of Distance Education, 3(2), 1–6.
Musabirov, I., & Bulygin, D. (2020). Prototyping text mining and network analysis tools to support netnographic student projects. International Journal of Emerging Technologies in Learning, 15(10), 223–232.
Newman, M. E. J., & Girvan, M. (2004). Finding and evaluating community structure in networks. Physical Review E, 69(2), 6113.
Ouyang, F., Chen, S., & Li, X. (2021). Effect of three network visualizations on students’ social-cognitive engagement in online discussions. British Journal of Educational Technology,. https://doi.org/10.1111/bjet.13126
Ponweiser, M. (2012). Latent Dirichlet allocation in R (Diploma Thesis). Vienna University of Economics and Business.
Poon, L. K. M., Kong, S.-C., Yau, T. S. H., Wong, M., & Ling, M. H. (2017). Learning analytics for monitoring students participation online: Visualizing navigational patterns on learning management system. In S. K. S. Cheung, L. Kwok, W. W. K. Ma, L.-K. Lee, & H. Yang (Eds.), Blended learning. New challenges and innovative practices (pp. 166–176). Springer International Publishing.
Rabbany, R., Elatia, S., Takaffoli, M., & Zaïane, O. R. (2014). Collaborative learning of students in online discussion forums: A social network analysis perspective. Educational data mining (pp. 441–466). Springer.
Ray, S., & Saeed, M. (2018). Applications of educational data mining and learning analytics tools in handling big data in higher education. In M. M. Alani, H. Tawfik, M. Saeed, & O. Anya (Eds.), Applications of big data analytics (pp. 135–160). Springer.
Romero, C., Ventura, S., & García, E. (2008). Data mining in course management systems: Moodle case study and tutorial. Computers and Education, 51(1), 368–384.
Ruef, M. (2002). Strong ties, weak ties and islands: Structural and cultural predictors of organizational innovation. Industrial and Corporate Change, 11(3), 427–449.
Ryberg, T., & Larsen, M. C. (2008). Networked identities: Understanding relationships between strong and weak ties in networked environments. Journal of Computer Assisted Learning, 24(2), 103–115.
Schrire, S. (2004). Interaction and cognition in asynchronous computer conferencing. Instructional Science, 32(6), 475–502.
Shinde, P. P., Oza, K. S., & Kamat, R. K. (2017, February). Big data predictive analysis: Using R analytical tool. In 2017 International Conference on I-SMAC (IoT in Social, Mobile, Analytics and Cloud) (I-SMAC) (pp. 839–842). IEEE.
Siemens, G. (2005). Connectivism: A learning theory for the digital age. International Journal of Instructional Technology and Distance Learning, 2(1), 1–8.
Sievert, C., & Shirley, K. (2014). LDAvis: A method for visualizing and interpreting topics. In Proceedings of the workshop on interactive language learning, visualization, and interfaces (pp. 63–70).
Slade, S., & Galpin, F. (2012). Learning analytics and higher education: Ethical perspectives. In Proceedings of the 2nd international conference on learning analytics and knowledge (pp. 16–17). ACM Press.
Stahl, G. (2006). Group cognition: Computer support for building collaborative knowledge. MIT Press.
Stahl, G., Koschmann, T., & Suthers, D. D. (2006). Computer-supported collaborative learning: An historical perspective. In R. K. Sawyer (Ed.), Cambridge handbook of the learning sciences (pp. 409–426). Cambridge University Press.
Sun, S., Luo, C., & Chen, J. (2017). A review of natural language processing techniques for opinion mining systems. Information Fusion, 36, 10–25.
Tawfik, A. A., Reeves, T. D., Stich, A. E., Gill, A., Hong, C., McDade, J., et al. (2017). The nature and level of learner–learner interaction in a chemistry massive open online course (MOOC). Journal of Computing in Higher Education, 29(3), 411–431.
Thomas, J. J., & Cook, K. A. (2006). A visual analytics agenda. IEEE Computer Graphics and Applications, 26(1), 10–13.
Tirunillai, S., & Tellis, G. J. (2014). Mining marketing meaning from online chatter: Strategic brand analysis of big data using Latent Dirichlet Allocation. Journal of Marketing Research, 51(4), 463–479.
Vieira, C., Parsons, P., & Byrd, V. (2018). Visual learning analytics of educational data: A systematic literature review and research agenda. Computers and Education, 122, 119–135.
Vygotsky, L. S. (1978). Mind in society: The development of higher psychological processes (Cole, V. John-Steiner, S. Scribner, E. Souberman, Trans.). Harvard University Press.
Weiss, S. M., Indurkhya, N., & Zhang, T. (2015). Fundamentals of predictive text mining. Springer.
Wei, L., Xu, H., Wang, Z., Dong, K., Wang, C., Fang, S., et al. (2016). Topic detection based on weak tie analysis: A case study of LIS research. Journal of Data and Information Science, 1(4), 81–101. https://doi.org/10.20309/jdis.201626
Wong, G. K., & Li, S. Y. (2016). Academic performance prediction using chance discovery from online discussion forums. In 2016 IEEE 40th annual computer software and applications conference (COMPSAC) (pp. 706–711). IEEE.
Wong, G. K., Li, S. Y., & Wong, E. W. (2016). Analyzing academic discussion forum data with topic detection and data visualization. In 2016 IEEE international conference on teaching, assessment, and learning for engineering (TALE) (pp. 109–115). IEEE.
Wu, J. Y., & Nian, M. W. (2021). The dynamics of an online learning community in a hybrid statistics classroom over time: Implications for the question-oriented problem-solving course design with the social network analysis approach. Computers and Education,. https://doi.org/10.1016/j.compedu.2020.104120
Wu, X., Zhu, X., Wu, G., & Ding, W. (2013). Data mining with big data. IEEE Transactions on Knowledge and Data Engineering, 26(1), 97–107.
Williams, C. B., & Murphy, T. (2002). Electronic discussion groups: How initial parameters influence classroom performance. Educause Quarterly, 25(4), 21–29.
You, J. W. (2016). Identifying significant indicators using LMS data to predict course achievement in online learning. The Internet and Higher Education, 29, 23–30.
Zhang, H., Qiu, B., Giles, C. L., Foley, H. C., & Yen, J. (2007, May). An LDA-based community structure discovery approach for large-scale social networks. In 2007 IEEE Intelligence and Security Informatics (pp. 200–207). IEEE.
Conflict of interest
The authors declare that they have no conflict of interests.
All participants were informed with their consent to participate in this project. All information in this paper is anonymous.
Research involving human participants and/or animals
Human participants were invited.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Wong, G.K.W., Li, Y.K. & Lai, X. Visualizing the learning patterns of topic-based social interaction in online discussion forums: an exploratory study. Education Tech Research Dev (2021). https://doi.org/10.1007/s11423-021-10040-5
- Social network analysis
- Topic modeling
- Weak ties
- Text mining