Introduction

Digital literacy is an essential skill in our increasingly digital twenty-first-century society. Computational thinking (CT) emerged as part of digital literacy and led to many initiatives to embed CT in the K-12 curriculum (Yadav et al., 2016). CT was defined as a problem-solving approach, employing concepts originating from computer science (Wing, 2006). Wing argued that CT is a universally applicable skill set not just for computer scientists but also useful for everyone. More generally, computation can be seen as a lens for looking at the world (Denning, 2009). In light of this universal application, CT skills are acknowledged as relevant and important for citizens and professionals in today’s society. As a consequence, computational thinking is gaining attention in education.

Many dedicated courses for acquiring CT skills have been developed, for example, by programming robots (Atmatzidou & Demetriadis, 2016) or by programming in Scratch (Zhang & Nouri, 2019; Pérez-Marin et al., 2020). However, CT might also be addressed through existing school subjects’ curricula (Bocconi et al., 2016). For example, initiatives have been taken for integration in Science, Technology, Engineering, Arts and Mathematics (STEAM) fields (Lee & Malyn-Smith, 2020; Lee et al., 2020), music (Bell & Bell, 2018), biology (Arastoopour Irgens et al., 2020), and physics (Hutchins et al., 2020).

Because of the common grounds of mathematics and CT, mathematics is a natural candidate within school subjects to address CT aspects (Chan et al., 2022; Kallia et al., 2021). Research has been done on the integration of CT for the topics of geometry (Kynigos & Grizioti, 2018), early algebraic thinking (Bråting & Kilhamn, 2021), probability and statistics in an unplugged way (Costa et al., 2017), and with use of a wide range of tools, such as Scratch (Bigotte de Almeida et al., 2017; Calao et al., 2015), dynamic geometry software (Sinclair & Patterson, 2018), Logo (Kynigos & Grizioti, 2018), and programming in Python on a graphing calculator (Kranz et al., 2012).

Both in mathematics and computer science, data and statistics play a central role. Data analysis and data literacy (Tolboom, 2012) are becoming more important due to the omnipresence of “big” data in society and daily life. To improve data literacy, data science is an important topic to teach, covering the combination of statistical thinking, CT, and mathematics (Gould, 2021). Since statistics is a regular topic in many mathematics curricula (Franklin et al., 2007; Weiland, 2019), it provides opportunities to integrate CT into the mathematics curriculum.

Spreadsheets software is an example of software that can be used to foster computational thinking in a range of mathematical topics, including arithmetic, equations, trigonometry, and statistics (Sanford, 2018). In the case of statistics, the tool provides users the possibility to explore large data by means of calculations and thereby enables users to understand structures in the data and give meaning to what the data represents.

In this study, we research how to address CT in a statistics course within secondary mathematics education. We set up a study following a design-based research approach on the integration of CT concepts and practices in a statistics module for 16- to 17-year-old pre-university students, who enrolled in an applied mathematics course.

Theoretical Background

CT has generally been described as the “ability to engage in problem-solving, designing systems, and understanding human behavior” (Wing, 2006, p. 6). Seeing CT as an attitude and skill set for everyone and not only for computer scientists (Wing, 2006) made CT both highly relevant for educational researchers and, at the same time, a broad construct that is hard to define. Reaching a consensus for a single definition of CT appeared to be difficult in the literature (Haseski et al., 2018). Cuny et al. (2010) suggested the following definition focusing on thought processes: “Computational thinking is the thought processes involved in formulating problems and their solutions so that the solutions are represented in a form that can be effectively carried out by an information-processing agent” (p. 1).

A more specific and operational definition of CT was provided by the Computer Science Teachers Association and the International Society for Technology in Education and focused on CT as a problem-solving process. The definition included the following characteristics (CSTA & ISTE 2011, p. 1):

  • Formulating problems in a way that enables us to use a computer and other tools to help solve them;

  • Logically organizing and analyzing data;

  • Representing data through abstractions such as models and simulations;

  • Automating solutions through algorithmic thinking (a series of ordered steps);

  • Identifying, analyzing, and implementing possible solutions to achieve the most efficient and effective combination of steps and resources;

  • Generalizing and transferring this problem-solving process to a wide variety of problems.

Selby and Woollard (2013) developed the definition of CT further and argued that the definition should include the idea of a thought process, besides the concept of abstraction, and the concept of decomposition. This resulted in defining computational thinking as “an activity, often product oriented, associated with, but not limited to, problem-solving” (p. 5). They addressed computational thinking as a cognitive or thought process that reflects the ability to think in abstractions and generalizations, to think algorithmically, and to think in terms of decomposition and evaluations.

The perspective of seeing computational thinking as different thinking processes has been further developed and investigated. Kallia et al. (2021) concluded from a Delphi study among experts in mathematics and computer science education that CT includes the following thinking processes and can expand beyond this range: abstraction, decomposition, pattern recognition, algorithmic thinking, modeling, logical and analytical thinking, and generalization and evaluation of solutions and strategies.

In contrast to definitions that focus on thinking processes, Brennan and Resnick (2012) took a different stance, which aligns with our view that CT is a skill that is increasingly important in everyday life and work. To mark the multiple facets of CT, Brennan and Resnick developed a framework with three key dimensions of computational thinking in relation to design-based activities: computational concepts, practices, and perspectives. The framework was designed in the context of programming activities with a visual programming language, namely Scratch.

The first dimension, computational concepts, is related to the key principles that a programmer employs when programming. Originally focusing on Scratch programming, computational concepts involved sequences, loops, parallelism, events, conditionals, operators, and data. In the case of using an interactive tool such as Excel, the key concepts involve using Excel formulas with parameters and conditional statements, decomposition to break down a problem into smaller more manageable parts, and pattern recognition to interpret data.

The second dimension, computational practices, relates to practices developed when programming. In the context of programming in Scratch, Brennan and Resnick (2012) mention four main sets of practices: being incremental and iterative, testing and debugging, reusing and remixing, and abstracting and modularizing. In the context of data research using a spreadsheet tool, specific data practices come into play. Data practices have been defined as one of the four main categories in the taxonomy of computational thinking by Weintrop et al. (2016). In that taxonomy, data practices include collecting data, creating data, manipulating data, analyzing data, and visualizing data. From the perspective of modeling data and handling large, rich datasets, Erickson et al., (2019) introduced the term data moves to describe the actions needed to handle large data and research data, including filtering, manipulating data, merging data, creating new groupings or new measures to do analyses on datasets. In Excel, it is possible to automate procedures by programming macros. In the study described here, data moves mainly include data filtering, sorting, manipulation, and data representations in the form of charts.

Lastly, the third dimension in Brennan and Resnick’s framework concerns computational perspectives and relates to the perspectives programmers form about the world around them and themselves. Computational perspectives include expressing ideas, connecting to other people and their ideas, and questioning how CT makes sense of the world. Computational perspectives are also related to how students perceive technology and the use of digital tools. As Gillott et al. (2020) express, computational perspectives enable students to “(i) create rather consume media; (ii) use digital tools in innovative ways and (iii) question the role of technology in daily life based on an appreciation of the possibilities and limitations afforded by technology.” (p. 2). When computational thinking perspectives develop, the learner forms a “computational identity” as was found by Kong and Wang (2019). In the context of a programming course, Kong and Wang distinguished the following subcomponents of computational identity: (1) programming affiliation, (2) programming engagement, (3) programming actualization, and (4) programming goal setting. Furthermore, computational perspectives relate to problem-solving abilities. Chalmers (2018) found that “students developed a computational perspective, while, initially, some students had difficulty when their robot did not perform as planned; most students problem-solved as they identified and debugged issues with their robot build and program” (p. 98) In the context of data research using spreadsheets, students’ computational perspective includes seeing the relevance of the researched topic and related datasets. Students are confronted with how data and manipulating data using a computational tool can be used in daily or future life. In addition, students will experience the possibilities and limitations of the tool. The challenges of how to overcome difficulties in problem-solving and how to use the tool in other situations will inform the development of students’ computational perspective. In the study presented in this paper, therefore, computational perspectives include the perception of CT and its relevance for daily and future life, acknowledging the opportunities and limitations tools come with.

Many initiatives have been taken to research CT in the mathematics classroom (Lv et al., 2022). The main focus in empirical studies integrating CT and mathematics has been on primary education, and studies concentrate on the topics of geometry and number operations (Lv. et al., 2022). For secondary education, ideas have been developed about addressing CT in statistics, for example, data moves (Erickson et al., 2019), data-scientific thinking (Gould, 2021), and the use of spreadsheets (Valovičová et al., 2020). A link between CT and mathematics might be problem-solving (Finsterbach Kaup, 2022). A problem-solving approach for teaching spreadsheets to develop CT skills appeared to be effective and provided longer-lasting knowledge than the traditional surface, tool-centered approach (Csernoch et al., 2021). More empirical research is needed, however, to extend the knowledge about how to integrate CT in the mathematics classroom for the topic of statistics and data science at the secondary level.

For this study, we used a computational approach to statistics by applying the framework developed by Brennan and Resnick (2012), containing computational concepts, practices, and perspectives in the context of data research. As such, this study aims to add more insight into how to address CT in the high school topic of statistics using spreadsheets. Our research questions are:

  1. 1.

    How can CT concepts and practices be addressed in 11th-grade statistics lessons using spreadsheet software?

  2. 2.

    Concerning computational perspectives, how do students perceive CT and its relevance?

Method

This study followed a design-based research approach (Cobb et al., 2003) and explored the design of a lesson series about integrating CT into mathematics for the topic of statistics.

Research Context and Design

The lesson series was co-designed by the researchers and teachers in a larger project about computational and mathematical thinking. Part of the project investigated the topic of calculus using dynamic geometry software (Van Borkulo et al., 2021). The study reported in the present paper handles the topic of statistics using spreadsheets. This topic aligns with the subject of statistics, which is an important part of the program for the stream of applied mathematics in the Dutch curriculum that focuses on informal mathematics, modeling, and real-life applications, and is offered to students in the school education track that prepares for social sciences and humanities. The project aimed to explore ways to involve more students in CT activities by broadening the scope toward other school subjects and focusing on more diverse content. This is especially relevant in the Netherlands, since computing is not a mandatory part of the curriculum.

The activities address both computational and mathematical thinking in the mathematics classroom and focus on using a digital tool rather than coding or programming. We aimed to integrate CT in “conventional” mathematics curricula using commonly used and accessible mathematical tooling, such as spreadsheets, to convince teachers of the interest and to acknowledge time constraints. Since the development of CT skills seems to benefit from combining digital “plugged” computer approaches and non-digital “unplugged” approaches (Caeli & Yadav 2020), we sought a balance between plugged and unplugged activities in the design. The unplugged activities focus on defining the steps in the problem-solving process, i.e., how to analyze data to answer a question about the data, determine which mathematical concepts are needed, and how to use the tool to get the results and reflect on the outcomes. The plugged activities focus on using spreadsheets and their functions to explore and analyze data. We employ the tool Excel to make students use digital tools that are relevant for both mathematics and their future life and, at the same time, are accessible to students, teachers, and the school. Excel has a low threshold to use, because students and teachers generally have some experience in using Excel. Alternatives considered were R, which has many features but requires too much extra learning for the students and teachers, or Jupyter notebooks, which involve technology that is not commonly used in schools.

To help students with the use of the tool Excel, we added scaffolding introductory activities. In the iterative process of designing the materials, we decided to add even more scaffolding in the form of detailed stepwise instructions related to the use of Excel, as we initially overestimated students’ skills in using spreadsheets. However, the scaffolding appeared to be too fine-grained for students who already have some skills in Excel. Therefore, in the final version of the lesson series presented in this paper, we offered some scaffolding in the form of an Excel cheat sheet, in which they could look up information, examples, tips, and tricks that were relevant to the given tasks.

The focus of the lesson series is on the CT aspects of problem-solving, data moves, and algorithmic thinking. The aspect of problem-solving is employed by using Excel as an investigation tool. Data moves are implemented in sorting, selecting, filtering, recoding, and aggregating data. Algorithmic thinking is addressed by translating the data analysis steps into computational solutions and by using Excel formulas. Formulas in Excel are commands that help to compute data characteristics, for example, the mean of some set of numbers or the number of occurrences of a specific string among a data column. Formulas help students in the process of generalization in the sense that students see the outcome of a formula changing when the data changes or when parameters or conditions are changed. An example of a formula using parameters and conditions to determine the number of female adults in a dataset with columns gender (A) and age (B) is COUNTIFS (A:A, “female,” B:B, “ >  = 18”). Setting the parameters and conditions in this formula selects the relevant cases. In this case, the cases are counted for which column A contains the value “female,” and the number in column B is equal to or above 18.

The lesson series that the students worked on consisted of two parts (see lesson materials on https://surfdrive.surf.nl/files/index.php/s/y3IkEr45TFHuIC1). The first part was about the dataset of the Titanic (https://surfdrive.surf.nl/files/index.php/s/6pFgL7pKs6dEycE) containing passenger information (see Fig. 1) and asked students to investigate factors that influenced the survival rate. The leading question was “Women and children first?.” Students needed to investigate whether or not it was true that women and children were saved first during the Titanic disaster. The second part offered a choice of three different datasets related to school grades to set up an own research question and investigate the data. The second part concluded with writing a report on one’s own data research for the dataset of their choice. The Excel cheat sheet with tips and tricks guided the students in the possibilities of using formulas in Excel (see https://surfdrive.surf.nl/files/index.php/s/6pFgL7pKs6dEycE).

Fig. 1
figure 1

Snapshot of the Titanic dataset (picture by Francis Godolphin Osbourne Stuart—http://www.uwants.com/viewthread.php?tid=3817223&extra=page%3D1). Note: This figure demonstrates part of the Titanic dataset (with columns passenger class (pclass), survived, passenger name, sex, and age (in years, fractional if age less than 1)), and an example of how conditional statements can be used

In four lessons of 120 min each, the students worked in small groups of two to four students in the workbook and Excel files, providing plugged and unplugged activities. All students were asked to fill in the tasks in their workbooks. The final task in the workbook was to write a report. The tasks in Excel were group work. Some students worked alone if they preferred or, for example, because they worked at home in quarantine.

Participants

The participants in the study were 58 pre-university students aged 16 to 17 years from three classes with two teachers, recruited from one of the project’s partner schools, a middle-sized school in the Netherlands. The students had taken an Excel course in a previous school year in which they learned the basics of Excel, such as entering, sorting, and filtering data, creating columns, adding graphs and tables, and using basic statistical formulas, such as AVERAGE and MEDIAN. The course followed a directive approach and guided the students step by step in what they needed to do and where to click. Coming from different school education tracks, the specific moment of the Excel course differed slightly among the students.

Data Collection and Analysis

To evaluate the feasibility of our design, the workbooks and Excel files were analyzed with respect to the completeness of the answers and with respect to computational concepts and practices.

We studied the students’ learning experiences in the CT-embedded lesson series about using spreadsheets in statistics. To investigate students’ perceptions of computational practices, concepts, and perspectives, we conducted semi-structured interviews. In the last of the four lessons, we invited all students, and 27 students voluntarily participated individually in semi-structured interviews about their perception of the lesson series and learning CT and how they handled data moves in the tasks (see Table 1). Depending on the student’s perspective, which is hard to ask for directly, follow-up questions were asked to deepen emerging topics. All interviews were recorded and transcribed and then coded. The coding process took Brennan and Resnick’s framework of computational concepts, practices, and perspectives as a starting point and grouped codes into categories. For example, the quotation “if you make a small mistake in the Excel formulas, the whole answer was wrong and I found it hard to find the mistakes I made” was coded as “syntax problems,” grouped into the category “formulas” and organized under “computational concepts.” Thereby, we gave space for themes to emerge and to frame them into the framework of computational concepts, practices, and perspectives. The coding process was performed by two researchers in continuous collaboration and discussion to ensure agreement.

Table 1 Main questions in semi-structured student interviews

After the lesson series, we administered a 15-min questionnaire about previous experience with Excel and programming and the students’ perceptions of aspects of the lesson series, such as difficulty and importance (see Table 2). All students were invited to complete the questionnaire, and 49 students voluntarily participated.

Table 2 Questions in the student questionnaire

Overall, the data included digital workbooks (58 students), reports (38 students), Excel files (16 groups), questionnaires (49 students), both teachers’ logbooks, and interview transcriptions (27 students, see Table 3). The used code themes in the analysis of the interviews and their frequencies are listed in Table 4.

Table 3 Overview of data sources and relation with research questions
Table 4 Overview of interview code themes, categories, and frequencies in interview transcriptions

Results

The results of the study are organized into a general section about student achievement, then sections computational concepts and computational practices that both address the first research question, and finally a section about computational perspectives that address the second research question.

General Impression of Student Achievement

Overall, most students were able to complete the tasks in the lesson series, indicating the feasibility of the design of the lesson series. The average success rate for the Titanic tasks was 82%. The second part about grades had more missing data because of time limitations and had an average success rate of 66%. The last task of the grades part, writing a report, was completed by 38 students. The reports were mainly short essays of one or two pages (36 of 38 reports), and two reports were extensive (four to seven pages).

Computational Concepts

The computational concepts that students employed in solving problems using Excel as an investigation tool (see research question 1) were related to using formulas and, more specifically, to using parameters and conditional statements within formulas in Excel.

Formulas

To investigate the Titanic dataset and factors that influenced the survival rate, the relevant formula to be used was COUNTIFS with various criteria, for which they could find information in the cheat sheet. In the Titanic tasks, most students used Excel formulas to do calculations; 55% described formulas in the workbook, and 88% demonstrated the use of formulas in the Excel file, 75% correctly. The second part about grades gave students the freedom to choose their research question and led to less-frequent use of formulas. In the final report about their own investigations, 63% described the use of Excel formulas: 50% used AVERAGE, and 11% used COUNTIFS.

In the interviews, we inquired the students’ ideas about the formulas and how they used them. In general, the use of the formula COUNTIFS was discussed with 24 of the 27 students. The use of formulas was most frequently mentioned as a difficult part of the tasks (20), 8 times it was mentioned as easy. The challenges students encountered related to using Excel formulas were syntax problems (8 students), caused by typing errors or using wrong characters, for example, a wrong separator (“;” versus “:”), and the use of criteria (7 students). The cheat sheet helped in using the formula correctly (6 students mentioned this explicitly). Other resources mentioned were watching videos about how to use formulas (2 students) and asking the teacher (1 student). In addition to the interview data, in the questionnaire, more than 73% of the students reported that the teacher helped them sometimes or often, and almost half of the students (57%) indicated that they used the internet for help sometimes or often. Apparently, the lesson design and the teaching setting invited a collaborative learning approach.

Besides Titanic passengers’ gender and age combinedly influencing the survival rate (following from the task’s question “women and children first?”), other factors that were investigated during the Titanic tasks were age as a singular variable and passenger class, respectively. In investigating these additional factors, the workbooks showed that only 8 students used Excel formulas, 14 used a frequency table, and the rest used less-specific strategies to answer the questions, such as sorting or calculating percentages without using Excel formulas. It seemed that applying the previously frequently used formula COUNTIFS in a different situation was hard and not trivial for the students.

Parameters

In the Titanic tasks, students started with calculating the number of survivors using the formula COUNTIFS. In this formula, they had to indicate relevant column parameters to differentiate between adults and children, and survivors and non-survivors, respectively. In the questionnaire, more than half of the students (57%) reported that the use of parameters in formulas in Excel was difficult or very difficult. Nevertheless, despite this perceived difficulty of using parameters, in the Excel files, 12 of the 14 groups showed a correct use of COUNTIFS. Apparently, students used several resources, mostly the teacher and peers, to overcome the difficulties as indicated in the questionnaire (see Fig. 2). More specifically, in the interviews, 11 students mentioned using the cheat sheet in case they did not exactly know how to use a formula.

Fig. 2
figure 2

Used resources as indicated by the students in the questionnaire

Figure 3 below shows an excerpt from a student’s workbook, describing the use of COUNTIFS for investigating whether women and children were saved first. This student used the cells G8, G9, and G10 as variables to define the COUNTIFS formula’s criteria for the columns survived, gender, and age, respectively. The listed cells contain the values that are used in the comparison of the criteria. The use of parameters in COUNTIFS required the students to apply logical thinking and use conditional statements. In addition, this student created a more general solution by using variables.

Fig. 3
figure 3

Excerpt from the workbook of a student, Titanic task 5 (with columns B, survived; D, gender; and G, age)

Conditional Statements

To answer the tasks about the Titanic dataset, it was necessary to use multiple conditional statements. This was experienced as difficult by the students. In the questionnaire, almost a third of the students reported the use of conditional statements as difficult to very difficult (33%). In the interviews, seven students mentioned conditional statements as being difficult. For example, one student said: “But when you have to use multiple criteria, it gets a bit more complicated. Which column should you select then? Then you do from 1 to something, but do you have to take the same criteria from one column? Or can you take criteria from the other column? That’s where I got a little stuck. But eventually, I figured it out.

Computational Practices

The main computational practice that the students engaged in was the practice of data moves, including data exploration, manipulation, filtering, sorting, and representing data in tables and diagrams. The Excel files showed for the Titanic tasks that almost all students used sorting to explore the data (15 of the 16 Excel files included sorting) and used formulas to find the number of survivors for the different groups of passengers (14 of the 16 Excel files). The Excel files, furthermore, showed data manipulation in the form of the creation of a new column, defining categories for variables, and, for a small part, the use of graphics (see Fig. 4). The data practices reflected in the reports were the manipulation of data (60%) and the use of graphics (23%).

Fig. 4
figure 4

Number of student groups demonstrating data moves in Titanic Excel file (N = 16)

Data Moves: Filter

The use of data practices was also reflected in the workbooks and the interviews, especially for the Titanic part. For example, in the interviews, five students elaborated on the process of filtering, and nine students the process of sorting in finding the numbers. One student explained how she used filtering in investigating the influence of passenger class on the survival rate: “if you have a column, then there is this triangle you can click. I had clicked, for example, class and then you saw that more people had survived in the first class than in the third class.” Another student stated he encountered problems when using filters: “Then I tried to calculate the average for each year, but I kept finding the same answer.” For most students, filtering and sorting served as orientations on the dataset. As a student explained in the interview, filtering by clicking different values in the column header helped her gain insight into the data, and she used it to be able to use a formula correctly as a next step.

However, some students seemed to avoid the use of formulas and conditions and used filtering less efficiently. In the interviews, two students explicitly mentioned using techniques that hindered finding exact answers. One student explained the use of scrolling: “We hadn’t done that (use a conditional statement). We had just sorted and then looked at what is where.”, “We had just looked, first sorted, and then we had noted the numbers (…) and then how many on average survived.” Another student used estimation techniques to approximate the answer: “You can just look very quickly, if you see survivors, a lot of women who didn’t survive, you can just look that way. Not really counted per se.” Filtering is a useful way to gain insight into the structure of the data. Nevertheless, it appeared to be an insufficient technique to solve the problem at hand, leading this student to not find an exact answer.

Data Moves: Sort

A technique that was less efficient, but often resulted in a correct answer, was sorting and then copying and pasting a selection of the data to another data sheet, as described by a student: “sorted again and then… I don’t remember very precisely, but I think all the (passenger) classes are in another tab.” When asked if she would do it the same way next time, the student answered “I would rather use formulas for that though, like that COUNTIFS, but I didn’t really know how that worked yet.” In this case, the limitations of the sorting technique made the student realize that formulas are a more efficient way to do calculations.

Data Moves: Representations

In the Titanic tasks, the results were mainly represented in tables and numbers. Only two groups included graphs in their Excel sheets. In the grades tasks, students had more freedom to choose their own problem approach, and this inclined more students to use graphical representations of their results. More specifically, in the grades reports, students demonstrated what they have learned in the dataset of their own choice with their own research question, for which nine students used graphics.

Computational Perspectives

In the context of computational perspectives, we investigated the perceptions of the students with respect to the relevance of CT for their future life and their perceptions of the learning of CT in the lesson series and the use of Excel.

Students’ Perception of Relevance for Daily and Future Life

When talking about the added value of the lesson series and the topic of computational thinking and statistics, the relevance of using Excel for data science in daily or future life was mentioned by 14 of 27 students in the interviews. In the interviews, the reasons mentioned why they think it is relevant were: you might use it in future jobs, it helps you track personal finances, it is about real problems with real data, and might give insight into Corona infections, for example, it helps to handle large datasets. In the questionnaire, almost half of the students (49%) mentioned that what they learned in the lesson series about CT was useful for their future life or study.

About the use of Excel in mathematics lessons, the following advantages were mentioned in the questionnaire related to the usefulness of technology for problem solving and understanding: learning to handle large data, helping you calculate and compare, requiring less effort, and preparing you for the digital future. Disadvantages of using Excel in mathematics mentioned in the questionnaire were the tool is complicated, you need technical skills and knowledge of the tool, debugging can be a hassle, and it takes time from the exam topics.

In the interview, one student had a nuanced view that learning Excel is handy for future life, but that there must be a purpose for using it. In the questionnaire, students acknowledged that you need technical equipment in the class and that students need specific technical skills to use a tool such as Excel. Using technology might distract some, a student remarked.

In the second part of the lesson series, the relevance for the students’ daily file came into play when students chose a dataset and worked on a research question of their own choices. Choosing a dataset was valued as fun or interesting by 10 students. One student explained about the second part: “You could choose more yourself. You had more of your own influence. Normally, you just have to do what the book says and now you could think of your own questions. That was more fun, because you could explore following your own interests, like going out. Of course, we came up with those questions ourselves. That does make it more fun than just answering questions.” Another student explained how the second part about researching a question of their own choice increased motivation: “we wanted to look at the relationship between school level and free time. I thought that was kind of interesting. How much of an impact that had on your free time, so we’re going to look at that. Then you can see for yourself where your interest lies. If you do something, which you don’t like, then you’re not going to like doing it. Then you won’t put more energy into it.

Students' Views on CT

In general, students positively valued working on CT in the lesson series. In the questionnaire, 71% of the students were neutral or positive with respect to the lesson series. In the interviews, 14 students explicitly mentioned it was fun. The positive remarks include something new, eye opening, from real life, being more independent, thinking for yourself, and letting the computer work for you. Five students explicitly stated that they did not like the lesson series. The reasons they mentioned for their negative responses were feeling like a guinea pig instead of learning something, not having clear guidelines on how to solve the tasks, not liking mathematics at all, and having a preference to write by hand. One student noticed that she liked the topic of data research but just did not like changes so much.

The difference with regular mathematics tasks was positively valued by 15 students in the interviews. For example, one student really enjoyed the project and expressed in the interview that this lesson series gave her a new perspective on things and some “flair” to the learning. The new approach gave her a perspective on how to use mathematics and Excel to solve problems in realistic situations, what she called “perspective on reality. In the questionnaire, this student stated that the way Excel was used in this lesson series gave better preparation for future jobs than normal lessons.

With respect to the use of the tool Excel for learning CT, advantages were mentioned 44 times in the interviews by all except one student (26 students). The advantages mentioned in the interviews were that it is useful for gaining insight, having an overview, structuring large datasets, working faster, and having formulas. For example, one student expressed that Excel is useful for organizing large data, when it is impossible to calculate things by hand.

The disadvantages of using Excel were mentioned in the interviews 28 times by 15 students, for example, that you need the skills to use the tool, that you have to know the formulas, that there are so many formulas—it is hard to pick the correct one, and the disadvantage of lacking overview. A student mentioned “if you type in a formula Excel, (…) you cannot actually see what it’s doing. And so I think that’s why for me to work with workbooks is better.” Another student expressed that using Excel might cause frustration: “If you can do it, I think it’s helpful, but if you can’t, it’s super-frustrating.”

In the interviews, we asked students about their views on CT. The concept of CT was only shortly introduced in the lessons series and mainly addressed by practicing the approach and building understanding during the tasks. After the lesson series, the students appeared to have developed ideas on the concept of CT. Computational thinking was mainly perceived as a step-by-step problem-solving approach (mentioned by 12 students in the interviews), or, more generally, as a different kind of thinking (three students). For example, one student perceived CT as “logical, step-by-step thinking, in a way that you observe a problem, see how you could solve it, try to solve it, and then see how successful it is compared to the original.” She expressed that this is not new to her but more sophisticated or fine-grained than she was used to, i.e., looking accurately step-by-step.

Strategies were an important factor in computational problem solving. Students applied different strategies to solve the tasks in the lesson series. For example, in the interviews, breaking down the problem into easier pieces was mentioned by a student: “And then finally we just made it a little easier. Then it worked out.” Another relevant strategy that appeared to be a useful problem-solving strategy in the context of CT tasks was persistence. One student described an encountered difficulty in Excel: she had selected the wrong cell as a parameter in a formula in Excel, and she interpreted the result as apparently incorrect. To solve the problem, she applied the strategy to step back and try again: she went back to the data, started at the last point where things were still correct, and then tried again. This led to the correct answer. In the end, she thought it was fun to solve the issues and felt a satisfying “I did it!” when she succeeded. Furthermore, the same student thought it was not always clear what she was supposed to do during the tasks in the lesson series. Sometimes, she could not see the forest for the trees, but her strategy to persist and approach the problem differently appeared effective (“one way or the other”). Another student mentioned that the lessons were about “Thinking about the problem differently and finding a solution to it.

Conclusion and Discussion

Reflections on Results

In this study, following the framework of Brennan and Resnick (2012), we researched how computational concepts, practices, and perspectives could illustrate the learning of CT and the impact on students’ perspectives on CT. In general, the findings show that the lesson series for 16- to 17-year-old pre-university education students was feasible and triggered computational thinking learning processes related to data research. Moreover, the use of a common tool such as Excel in researching familiar, real-life data seemed fruitful and the lesson series triggered awareness of the relevance of CT in real life.

Concerning the first research question on how CT concepts and practices can be addressed in 11th-grade statistics lessons using spreadsheet software, we conclude from the results that students were able to combine data using formulas with parameters and conditions to extract information about the dataset and solve the problems. At first, students encountered difficulties, especially when using multiple conditional statements, which is a known issue in human reasoning (Byrne & Johnson-Laird, 2009). However, with the help of the workbook, the teacher, peers, and resources, such as the cheat sheet and online documentation, they succeeded for the most part. The duration of the intervention was limited and, therefore, also the extent to which students used the possibilities of the tool, e.g., the number of formulas used. Therefore, we conclude that the first steps in employing CT concepts and practices have been made.

Nevertheless, the efficiency of data practices could be improved for many students. We often saw students sort and calculate on a specific range, while, in the next step in problem-solving, they changed the sorting, and the calculations were out of date. This is a weak point in the spreadsheet calculation, a so-called smell (Hermans et al., 2012. In the example, one change in the spreadsheet often resulted in the need to make a lot of little changes in several places. More awareness of these pitfalls would improve the use of the tool and steer the computational thinking process. The assessment of the quality of a spreadsheet is a relevant issue and important in future professional life. This issue especially comes into play when working with different worksheets, as the students did in the second part choosing the dataset of their own interest. Therefore, an additional focus on the quality of the solution would have added value.

More in general, the technical skills to use a tool such as a spreadsheet tool in a proficient way are crucial for successful computational problem-solving, as appeared during the design process of the materials in our study and from the results of the intervention in the classroom. Although we live in a digital society where technology is omnipresent, digital skills are not easily acquired or trivial (Kirschner & De Bruyckere, 2017). Before being able to solve problems using a tool, students need a certain level of skills and knowledge of the tool. Our design with the scaffolding of the tasks and the cheat sheet enabled the students, who had a wide variety of prior knowledge, to connect to their level of understanding and to complete missing knowledge. The cheat sheet was used according to the individual needs of the students and appeared to be effective.

Concerning the second research question about computational perspectives on how students perceive CT and its relevance, the study revealed some interesting prospects. From the questionnaire and the interviews, when reflecting on the use of tools and addressing CT in the mathematics lessons, students showed a high level of awareness of the relevance of the use of spreadsheets and CT in mathematics and a positive perception of the benefits of CT and its importance for their (future) work and daily life. Indeed, there is an important relationship between mathematics and daily life in the sense that mathematics prepares for real-life challenges (Jawad, 2022). In addition, offering real-life problems is an effective pedagogical approach to developing CT skills (Bocconi et al., 2022). However, it was not always clear to the students what exactly it was that they were learning. On the one hand, students were familiar with the topic of data analysis and the tool Excel; on the other hand, the computational approach was new and required some experimentation with the tool to find solutions. This might have confused students with respect to the aimed learning goals. Despite these ambiguities, most students showed awareness of the importance of the CT approach.

An additional notable finding related to computational perspectives was that not everyone appreciated the new approach. One student felt like a “guinea pig,” but most students valued working on developing new skills on relevant topics from real life. Following one’s own interests in the second part by having some freedom in choosing one’s own dataset and related research question was very stimulating and seemed to connect the students’ learning to their own interests. This is in line with the finding that providing relevant contexts and personalized instruction for students’ interests is a powerful way to support learning and, thereby, motivate students (Walkington, 2013). Motivation affects persistence in learning (Vollmeyer & Rheinberg, 2000), and this is an interesting and relevant mechanism, since we found that the problem-solving process in the CT tasks required creativity and persistence to succeed. It would be worthwhile to investigate the relation between motivation and persistence to see how to enhance students’ computational problem-solving skills.

Limitations of the Study

Overall, the study showed the potential of using spreadsheets at secondary school for developing computational concepts, practices, and perspectives. However, the study also had some limitations. The study was performed in one school with three classes with two highly motivated teachers that voluntarily participated in the project. This situation is not representative of schools in the Netherlands. A second limitation was the Corona pandemic, which influenced the presence and participation of students. While it was possible to do in-depth, qualitative research in this smaller-scale setting, it would be interesting to perform the teaching experiments in more neutral circumstances and to investigate the learning gains more quantitatively over a longer period. Furthermore, with respect to the students’ perspectives, we only unveiled students’ initial views on CT and its relevance in their lives. Further research over a longer period of time is needed to investigate the development of students’ computational perspectives in more detail and inquire possible shifts in their perspectives.

Implications for Educational Practice

In reflection on these conclusions, we see some implications for educational practice. With the overarching goal to prepare all members of our society for the challenges related to our current digital age, an important aspect to take into account is the diversity among people and broadening participation in computing to different people. Diversity can be dealt with in education, for example, by paying attention to people with different interests, e.g., sciences- vs. humanities-oriented people as researched by Katai (2020), or computer science vs. non-computer science-oriented people (Hsu et al., 2021), and other groups are, of course, conceivable. In our study, we aimed to involve more students in CT activities by broadening the scope toward other school subjects such as mathematics and focusing on more diverse content. Different student interests were addressed in the open second part of the lesson series where students choose their own topics and research questions. This could be developed further by adding a larger variety of datasets including both sciences and humanities topics, among other topics. Students might also take a more active role and create their own datasets and work on more personally meaningful data. This might also address different people with different cognitive styles and different learning levels.

When integrating CT into learning activities in another subject such as mathematics, an important aspect is the assessment of the learning gains of CT activities (Bortz et al., 2020). Especially, when two fields of learning, CT and mathematics, are in play, it appears hard to measure learning gains. Bortz et al. (2020) emphasize the importance of assessment approaches that reflect “multiple ways of knowing and doing,” for example, by using integrated assessment tasks on the one hand and, on the other hand, also use items that assess multiple facets from each domain that are not fully integrated. Assessment issues are also addressed by Weintrop et al. (2014) who promote authentic assessment in the context of CT integrated into STEM classrooms. Further studies might put focus on efficient assessment methods by developing a variety of assessment items, in addition to reviewing student work from the instruction, such as workbooks and Excel files.

Outlook on Future Research

In conclusion, this study provides a better view on what to teach when learning CT by providing concrete starting points for the integration of computational concepts, practices, and perspectives in mathematics topics relating to authentic contexts. Since CT is a fundamental skill that requires continuous development, it is a time-consuming process that involves repeated practice. Further studies need to explore how CT skills in the context of spreadsheets can be further developed and how CT can be integrated into other parts of the mathematics curriculum touching different topics. It would be worthwhile to explore how relevant contexts with respect to student interests and the real world can be addressed in meaningful computational problem-solving within mathematics subjects, addressing the topics of motivation and diversity and thereby broadening participation in the field of computing.