Skip to main content
Log in

A Mixture Modeling Approach to Detect Different Behavioral Patterns for Process Data

  • Original Paper
  • Published:
Fudan Journal of the Humanities and Social Sciences Aims and scope Submit manuscript

Abstract

Process data recorded by computer-based assessments reflect how respondents solve problems and thus contain rich information about respondents as well as tasks. Considering that different respondents may exhibit different behavioral characteristics during problem-solving process, in this study, we propose a mixture one-parameter state response (Mix1P-SR) measurement model. This model assumes that respondents belong to discrete latent classes with different propensities towards responses to task states during the problem-solving process, and the varying response propensities are captured by different state parameters across classes. A Markov Chain Monte Carlo algorithm for the estimation of model parameters and classification of respondents is described. The simulation study shows that the Mix1P-SR model could recover parameters well on the premise that the average sequence length was not too short. Moreover, larger sample size, longer sequences, more uniform mixing proportions, and lower interclass similarity facilitated model convergence, model selection, and parameter estimation accuracy, with sequence length being particularly important. Based on the empirical data from PISA 2012, the Mix1P-SR model identified two latent classes of respondents. They had different patterns of state easiness parameters and exhibited different state response patterns, which affected their problem solving results. Implications for model application and future research directions are discussed.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4

Similar content being viewed by others

Data and Code Availability

The data that support the findings of this study are openly available from the OECD website: http://www.oecd.org/pisa/pisaproducts/database-cbapisa2012.htm. The R code for simulating data and implementing the MCMC algorithm is available at https://osf.io/pqbxr/?view_only=d798407027424f95aae5ea9824259254.

Notes

  1. We also used WinBUGS to implement the MCMC algorithm for the Mix1P-SR model with three classes. Results showed that although the model converged, only two meaningful classes were identified, while no observations were assigned to the third class and its mixing proportion was zero. In fact, in the preliminary analysis, we found that for homogeneous data (with a single class), the Mix1P-SR model with more than one class did not converge at all. Also, we generated data with three classes and found that the three-class model with our algorithm could converge, but the four-class model failed. Therefore, we infer that the current mixture model with more than k classes using the current algorithm could not converge when fitted to data with k classes.

  2. Note that in the comparison of the two models, results of the one-class model were summarized based on only the replications where the two-class model converged successfully. Indeed, these results were very close to the average results of the one-class model over all 50 replications in each condition.

References

Download references

Acknowledgements

The authors thank the OECD-PISA (Program for International Student Assessment) team for granting access to the data source and instruments in this study. This work was supported by National Natural Science Foundation of China (Grant 32300938).

Funding

This work was supported by National Natural Science Foundation of China (Grant 32300938).

Author information

Authors and Affiliations

Authors

Contributions

Conceptualization: YX and HL; methodology: YX and HL; formal analysis and investigation: YX; writing—original draft preparation: YX; writing—review and editing: YX and HL; supervision: HYL.

Corresponding author

Correspondence to Hongyun Liu.

Ethics declarations

Conflict of interest

The authors claim there has no known conflict of interest to disclose.

Appendices

Appendix 1

See Appendix Tables 

Table 5 A list of 21 problem states for the first item of the TICKETS unit, the corresponding correct and incorrect reachable states

5,

Table 6 A list of 22 problem states for the second item of the TICKETS unit

6 and

Table 7 Correct and incorrect reachable states for each problem state in the second item of the TICKETS unit

7.

Appendix 2

See Appendix Tables 

Table 8 Model fit of one- and two-class models for the empirical example

8 and

Table 9 Two-class solution for the empirical example

9.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Xiao, Y., Liu, H. A Mixture Modeling Approach to Detect Different Behavioral Patterns for Process Data. Fudan J. Hum. Soc. Sci. (2024). https://doi.org/10.1007/s40647-024-00405-4

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s40647-024-00405-4

Keywords

Navigation