Skip to main content
Log in

Assessing heterogeneity in MOOC student performance through composite-based path modelling

  • Published:
Quality & Quantity Aims and scope Submit manuscript

Abstract

Massive open online courses (MOOCs) are potentially participated in by very many students from different parts of the world, which means that learning analytics is especially challenging. In this framework, predicting students’ performance is a key issue, but the high level of heterogeneity affects understanding and measurement of the causal links between performance and its drivers, including motivation, attitude to learning, and engagement, with different models recommended for the formulation of appropriate policies. Using data for the FedericaX EdX MOOC platform (Federica WebLearning Centre at the University of Naples Federico II), we exploit a consolidated composite-based path model to relate performance with engagement and learning. The model addresses heterogeneity by analysing gender, age, country of origin, and course design differences as they affect performance. Results reveal subgroups of students requiring different learning strategies to enhance final performance. Our main findings were that differences in performance depended mainly on learning for male students taking instructor-paced courses, and on engagement for older students (> 32 years) taking self-paced courses.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3

Similar content being viewed by others

Notes

  1. https://www.federica.eu/en/partners/edx/

  2. Students from Africa and Oceania were merged in a single group labelled “Others” due to the small number of cases. This classification was justified by similar average performance values for students from those two geographical regions.

References

Download references

Funding

This work was supported by grant #870691-INVENT funded by the EU H2020 program.

Author information

Authors and Affiliations

Authors

Contributions

Authors contributed equally.

Corresponding author

Correspondence to Lamberti Giuseppe.

Ethics declarations

Conflict of interest

The authors report there are no competing interests to declare.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary file1 (PDF 450 kb)

Appendix

Appendix

Indicators are reported as violin plots (Hintze and Nelson 1998), as these simultaneously depict the full distribution and number of data considered. The height of each violin indicates the range of the detected values, while the width indicates the position of the peak. In the separate panels, the colours reflect the six subdimensions used to measure learning, engagement, and performance. Thus, indicators related to frequency-based actions, time-based actions, and interaction are coloured green, light blue, and red, respectively, while regularity, non-procrastination, and performance are coloured dark blue, yellow, and violet, respectively. All indicators are highly skewed, with long tails on the right of the distributions (Fig. 4).

Fig. 4
figure 4

Summary statistics for student learning, engagement, and performance indicators

Tables 6, 7, 8. Measurement model reliability. Crossloadings, bootstrap confidence intervals (CIs) calculated with 500 repetitions, composite reliability (CR), and average expected variance (AVE) results for the first-order constructs, as reported in Table 6. Results are acceptable, as, according to Esposito Vinzi et al. (2010), CR should be greater than 0.7, AVE should be greater than 0.5, and loadings are higher than 0.7 and significantly higher with respect to their own constructs. Loadings, bootstrap CIs, CR, and AVE results for the second-order constructs are reported in Table 7. Loading values for both constructs are lower than 0.7 (explained by the particular nature of the indicators used in the analysis) but are significant, CR is higher than 0.7 for both constructs, AVE is higher than 0.5 for engagement, and although lower than 0.5 for learning, is still close to the threshold. Given that CR is higher than 0.7, convergent validity can still be considered adequate (Fornell and Larcker, 1981). Results for the Fornell-Larcker matrix, reported in Table 8, indicate that discriminant validity is assured.

Table 6 First-order constructs
Table 7 Second-order constructs
Table 8 Fornell-Larcker matrix

Tables 9, 10, 11, and 12 MICOM testing of the invariance measurement model; steps (2) and (3). Each table reports the observed score correlation (SC), the 5% confidence interval (CI), and the observed score difference in means (SDM) between the compared groups, and the log-ratio of score variances (LSV) for groups with their corresponding 95% CIs (obtained by group permutation). Compositional invariance is verified when the SC value falls within the CI, and full measurement invariance is verified when SDM and LSV values fall within the CI. Note that in case of Table 12 (course design), the SC is lower than the threshold, although, in this case, the observed deviation occurs in the third decimal. According to Lamberti et al. (2022), the correlation is not too low for MGA and so the compositional invariance of the constructs is globally accepted.

Table 9 Gender
Table 10 Age
Table 11 Country of origin
Table 12 Course design

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Cristina, D., Giuseppe, L. & Domenico, V. Assessing heterogeneity in MOOC student performance through composite-based path modelling. Qual Quant 58, 2453–2477 (2024). https://doi.org/10.1007/s11135-023-01760-2

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11135-023-01760-2

Keywords

Navigation