Azzolini, D., Bazoli, N., Lievore, I., Schizzerotto, A., & Vergolini, L. (2019). Beyond achievement. A comparative look into 15-year-olds’ school engagement, effort and perseverance in the European Union. In E. Union (Ed.), Entrepreneurship as extreme experience. https://doi.org/10.2766/98129
CrossRef
Google Scholar
Chen, M. Y., & Zumbo, B. D. (2017). Ecological framework of item responding as validity evidence: An application of multilevel DIF modeling using PISA data. In Understanding and investigating response processes in validation research (pp. 53–68). Springer.
CrossRef
Google Scholar
Chen, Y., Li, X., Liu, J., & Ying, Z. (2019). Statistical analysis of complex problem-solving process data: An event history analysis approach. Frontiers in Psychology, 10, 486. https://doi.org/10.3389/FPSYG.2019.00486
CrossRef
Google Scholar
De Boeck, P., & Scalise, K. (2019). Collaborative problem solving: Processing actions, time, and performance. Frontiers in Psychology, 10, 1280. https://doi.org/10.3389/fpsyg.2019.01280
CrossRef
Google Scholar
Engelhardt, L., & Goldhammer, F. (2019). Validating test score interpretations using time information. Frontiers in Psychology, 10, 1131. https://doi.org/10.3389/fpsyg.2019.01131
CrossRef
Google Scholar
Goldhammer, F., Martens, T., Christoph, G., & Lüdtke, O. (2016). Test-taking engagement in PIAAC (No. 133; OECD Education Working Papers). https://doi.org/10.1787/5jlzfl6fhxs2-en
CrossRef
Google Scholar
Goldhammer, F., Martens, T., & Lüdtke, O. (2017). Conditioning factors of test-taking engagement in PIAAC: An exploratory IRT modelling approach considering person and item characteristics. Large-Scale Assessments in Education, 5, 18. https://doi.org/10.1186/s40536-017-0051-9
CrossRef
Google Scholar
Goldhammer, F., Naumann, J., Stelter, A., Tóth, K., Rölke, H., & Klieme, E. (2014). The time on task effect in reading and problem solving is moderated by task difficulty and skill: Insights from a computer-based large-scale assessment. Journal of Educational Psychology, 106(3), 608–626. https://doi.org/10.1037/a0034716
CrossRef
Google Scholar
Goldhammer, F., & Zehner, F. (2017). What to make of and how to interpret process data. Measurement: Interdisciplinary Research and Perspectives, 15(3–4), 128–132. https://doi.org/10.1080/15366367.2017.1411651
CrossRef
Google Scholar
Greiff, S., Wüstenberg, S., & Avvisati, F. (2015). Computer-generated log-file analyses as a window into students’ minds? A showcase study based on the PISA 2012 assessment of problem solving. Computers and Education, 91, 92–105. https://doi.org/10.1016/j.compedu.2015.10.018
CrossRef
Google Scholar
Hahnel, C., Goldhammer, F., Kröhne, U., & Naumann, J. (2017). Reading digital text involves working memory updating based on task characteristics and reader behavior. Learning and Individual Differences, 59(October), 149–157. https://doi.org/10.1016/j.lindif.2017.09.001
CrossRef
Google Scholar
Hahnel, C., Goldhammer, F., Kröhne, U., & Naumann, J. (2018). The role of reading skills in the evaluation of online information gathered from search engine environments. Computers in Human Behavior, 78, 223–234. https://doi.org/10.1016/j.chb.2017.10.004
CrossRef
Google Scholar
Hahnel, C., Goldhammer, F., Naumann, J., & Kröhne, U. (2016). Effects of linear reading, basic computer skills, evaluating online information, and navigation on reading digital text. Computers in Human Behavior, 55, 486–500. https://doi.org/10.1016/j.chb.2015.09.042
CrossRef
Google Scholar
Han, Z., He, Q., & von Davier, M. (2019). Predictive feature generation and selection from process data in PISA simulation-based environment: An implementation of tree-based ensemble methods. Frontiers in Psychology, 10, 2461. https://doi.org/10.3389/fpsyg.2019.02461
CrossRef
Google Scholar
He, Q., Borgonovi, F., & Paccagnella, M. (2019). Using process data to understand adults’ problem-solving behaviours in PIAAC: Identifying generalised patterns across multiple tasks with sequence mining (No. 205; OECD Education Working Papers). OECD.
Google Scholar
He, Q., & von Davier, M. (2015). Identifying feature sequences from process data in problem-solving items with N-grams. In Quantitative psychology research: The 79th annual meeting of the Psychometric Society (pp. 173–190). Springer.
CrossRef
Google Scholar
Herde, C. N., Wüstenberg, S., & Greiff, S. (2016). Assessment of complex problem solving: What we know and what we don’t know. Applied Measurement in Education, 29(4), 265–277. https://doi.org/10.1080/08957347.2016.1209208
CrossRef
Google Scholar
Hu, Y., Wu, B., & Gu, X. (2017). An eye tracking study of high-and low-performing students in solving interactive and analytical problems. Journal of Educational Technology & Society, 20(4), 300–311. https://doi.org/10.2307/26229225
CrossRef
Google Scholar
Hubley, A. M., & Zumbo, B. D. (2017). Response processes in the context of validity: Setting the stage. In Understanding and investigating response processes in validation research (pp. 1–12). Springer.
Google Scholar
Klotzke, K., & Fox, J. P. (2019). Bayesian covariance structure modelling of responses and process data. Frontiers in Psychology, 10, 1675. https://doi.org/10.3389/fpsyg.2019.01675
CrossRef
Google Scholar
Kroehne, U., & Goldhammer, F. (2018). How to conceptualize, represent, and analyze log data from technology-based assessments? A generic framework and an application to questionnaire items. Behaviormetrika, 45, 527–563. https://doi.org/10.1007/s41237-018-0063-y
CrossRef
Google Scholar
Kroehne, U., Hahnel, C., & Goldhammer, F. (2019). Invariance of the response processes between gender and modes in an assessment of reading. Frontiers in Applied Mathematics and Statistics, 5, 2. https://doi.org/10.3389/fams.2019.00002
CrossRef
Google Scholar
Krstić, K., Šoškić, A., Ković, V., & Holmqvist, K. (2018). All good readers are the same, but every low-skilled reader is different: An eye-tracking study using PISA data. European Journal of Psychology of Education, 33, 521–541. https://doi.org/10.1007/s10212-018-0382-0
CrossRef
Google Scholar
Lee, Y.-H., & Haberman, S. J. (2016). Investigating test-taking behaviors using timing and process data. International Journal of Testing, 16(3), 240–267. https://doi.org/10.1080/15305058.2015.1085385
CrossRef
Google Scholar
Liao, D., He, Q., & Jiao, H. (2019). Mapping background variables with sequential patterns in problem-solving environments: An investigation of United States adults’ employment status in PIAAC. Frontiers in Psychology, 10, 646. https://doi.org/10.3389/fpsyg.2019.00646
CrossRef
Google Scholar
Liu, H., Liu, Y., & Li, M. (2018). Analysis of process data of PISA 2012 computer-based problem solving: Application of the modified multilevel mixture IRT model. Frontiers in Psychology, 9, 1372. https://doi.org/10.3389/fpsyg.2018.01372
CrossRef
Google Scholar
Maddox, B. (2017). Talk and gesture as process data. Measurement: Interdisciplinary Research and Perspectives, 15(3–4), 113–127. https://doi.org/10.1080/15366367.2017.1392821
CrossRef
Google Scholar
Maddox, B., Bayliss, A. P., Fleming, P., Engelhardt, P. E., Edwards, S. G., & Borgonovi, F. (2018). Observing response processes with eye tracking in international large-scale assessments: Evidence from the OECD PIAAC assessment. European Journal of Psychology of Education, 33(3), 543–558. https://doi.org/10.1007/s10212-018-0380-2
CrossRef
Google Scholar
Mislevy, R. J. (1994). Evidence and inference in educational assessment. Psychometrika, 59(4), 439–483. https://doi.org/10.1007/BF02294388
CrossRef
Google Scholar
Mullis, I. V. S., & Martin, M. O. (2017). TIMSS 2019 assessment frameworks. Boston College, TIMSS & PIRLS International Study Center. http://timssandpirls.bc.edu/timss2019/frameworks/
Google Scholar
Naumann, J. (2015). A model of online reading engagement: Linking engagement, navigation, and performance in digital reading. Computers in Human Behavior, 53, 263–277. https://doi.org/10.1016/j.chb.2015.06.051
CrossRef
Google Scholar
Naumann, J. (2019). The skilled, the knowledgeable, and the motivated: Investigating the strategic allocation of time on task in a computer-based assessment. Frontiers in Psychology, 10, 1429. https://doi.org/10.3389/fpsyg.2019.01429
CrossRef
Google Scholar
Naumann, J., & Goldhammer, F. (2017). Time-on-task effects in digital reading are non-linear and moderated by persons’ skills and tasks’ demands. Learning and Individual Differences, 53, 1–16. https://doi.org/10.1016/j.lindif.2016.10.002
CrossRef
Google Scholar
OECD. (2010). PISA computer-based assessment of student skills in science. OECD Publishing. http://www.sourceoecd.org/education/9789264082021
CrossRef
Google Scholar
OECD. (2013). LogDataAnalyzer. PIAAC Log File Website. https://www.oecd.org/skills/piaac/log-file/
OECD. (2017). PISA 2015 technical report. OECD Publishing. https://doi.org/10.1787/9789264255425-en
CrossRef
Google Scholar
OECD. (2019). Beyond proficiency: Using log files to understand respondent behaviour in the survey of adult skills. OECD Publishing. https://doi.org/10.1787/0b1414ed-en
CrossRef
Google Scholar
Ouzzani, M., Hammady, H., Fedorowicz, Z., & Elmagarmid, A. (2016). Rayyan – A web and mobile app for systematic reviews. Systematic Reviews, 5(4). https://doi.org/10.1186/s13643-016-0384-4
Pejic, A., & Molcer, P. S. (2016). Exploring data mining possibilities on computer based problem solving data. SISY 2016 – IEEE 14th International Symposium on Intelligent Systems and Informatics, Proceedings, 171–176. https://doi.org/10.1109/SISY.2016.7601491
Pokropek, A. (2016). Grade of membership response time model for detecting guessing behaviors. Journal of Educational and Behavioral Statistics, 41(3), 300–325. https://doi.org/10.3102/1076998616636618
CrossRef
Google Scholar
Qiao, X., & Jiao, H. (2018). Data mining techniques in analyzing process data: A didactic. Frontiers in Psychology, 9, 2231. https://doi.org/10.3389/fpsyg.2018.02231
CrossRef
Google Scholar
Ramalingam, D., & Adams, R. J. (2018). How can the use of data from computer-delievered assessments improve the measurement of twenty-first century skills? In Assessment and teaching of 21st century skills (pp. 225–238). Springer. https://doi.org/10.1007/978-3-319-65368-6
CrossRef
Google Scholar
Reis Costa, D., & Leoncio, W. (2019). LOGAN: An R package for log file analysis in international large-scale assessments. R Package.
Google Scholar
Ren, Y., Luo, F., Ren, P., Bai, D., & Li, X. (2019). Exploring multiple goals balancing in complex problem solving based on log data. Frontiers in Psychology, 10, 1975. https://doi.org/10.3389/fpsyg.2019.01975
CrossRef
Google Scholar
Stone, J., & Zumbo, B. D. (2016). Validity as a pragmatist project: A global concern with local application. In V. Aryadoust & J. Fox (Eds.), Trends in language assessment research and practice (pp. 555–573). Cambridge Scholars.
Google Scholar
Teig, N. (2019). Scientific inquiry in TIMSS and PISA 2015: Inquiry as an instructional approach and the assessment of inquiry as an instructional outcome in science. University of Oslo. http://urn.nb.no/URN:NBN:no-74775
Google Scholar
Ulitzsch, E., von Davier, M., & Pohl, S. (2020a). A hierarchical latent response model for inferences about examinee engagement in terms of guessing and item-level non-response. British Journal of Mathematical and Statistical Psychology, 73(S1), 83–112. https://doi.org/10.1111/bmsp.12188
CrossRef
Google Scholar
Ulitzsch, E., von Davier, M., & Pohl, S. (2020b). Using response times for joint modeling of response and omission behavior. Multivariate Behavioral Research, 55(3), 425–453. https://doi.org/10.1080/00273171.2019.1643699
CrossRef
Google Scholar
Valdman, J. (2001). Log file analysis. In Department of Computer Science and Engineering (FAV UWB) (Technical Report No. DCSE/TR-2001-04) http://www.kiv.zcu.cz/vyzkum/publikace/technicke-zpravy/2001/tr-2001-04.pdf
Google Scholar
van Damme, D. (2014). How closely is the distribution of skills related to countries’ overall level of social inequality and economic prosperity? OECD Education Working Papers, 105(105), 1–23.
Google Scholar
Vista, A., Care, E., & Awwal, N. (2017). Visualising and examining sequential actions as behavioural paths that can be interpreted as markers of complex behaviours. Computers in Human Behavior, 76, 656–671. https://doi.org/10.1016/j.chb.2017.01.027
CrossRef
Google Scholar
Vörös, Z., & Rouet, J. F. (2016). Laypersons’ digital problem solving: Relationships between strategy and performance in a large-scale international survey. Computers in Human Behavior, 64, 108–116. https://doi.org/10.1016/j.chb.2016.06.018
CrossRef
Google Scholar
Xu, H., Fang, G., Chen, Y., Liu, J., & Ying, Z. (2018). Latent class analysis of recurrent events in problem-solving items. Applied Psychological Measurement, 42(6), 476–498. https://doi.org/10.1177/0146621617748325
CrossRef
Google Scholar
Zhan, P., Jiao, H., & Liao, D. (2018). Cognitive diagnosis modelling incorporating item response times. British Journal of Mathematical and Statistical Psychology, 71(2), 262–286. https://doi.org/10.1111/bmsp.12114
CrossRef
Google Scholar