1 Introduction

With rapidly developing technology, the number of children using mobile handheld devices has increased drastically (Rideout et al., 2010; Squire, 2006). Technologies and digital enhancements that use the internet have become a part of the daily life of school-age children (Kennedy et al., 2008), and education evolves in line with the changing technology. Rapidly changing innovation technologies have changed the characteristics of learners in the fields of knowledge, skills, and expertise that are valuable for society, and circumstances for teachers and students have changed over time (Yuen et al., 2011). Almost every school subject incorporates technological devices into the pedagogy to different extents, but science teachers are the most eager to use technological devices in science classes because of the nature of the content they are expected to teach.

The COVID-19 pandemic has had an important impact on educational systems worldwide. Due to the fast-spreading of that disease, the educators had to adapt their classes urgently to technology and distance learning (Dietrich et al., 2020), and schools have had to put more effort into adapting new technologies to teaching. Z generation was born into a time of information technology, but they did not choose distance courses that were not created for them so they are not motivated during the classes (Dietrich et al., 2020). Directing students’ interest in the course content is challenging, while their interest has changed by this technological development. The solution to this challenge emerges through creative pedagogies that integrate the instructional methods with new striking technology. Augmented reality has demonstrated high potential as part of many teaching methods.

2 Literature Review

2.1 Augmented Reality, Education, and Science Education

AR applications have important potential for many areas where rapid transfer of information is important. This is especially effective for education. Science education is among the disciplines where rapid information transfer is important. Taylor (1987, p. 1) stated that “the transfer of scientific and technological information to children and to the general public is as important as the search for information.” With the rapid change in science and technology and outdating of knowledge, learning needs rapid changes in transfer of information (Ploman, 1987). Technology provides new and innovative methods for science education and could be an effective media in promoting students’ learning (Virata & Castro, 2019). AR technology could be a promising teaching tool for science teaching in which AR technology is especially applicable (Arici et al., 2019).

Research shows that AR has great potential and benefits for learning and teaching (Yuen et al., 2011). The AR applications used in teaching and learning present many objects, practices, and experiments that students cannot obtain from the first-hand experience into many different dimensions because of the impossibilities in the real world, and it is an approach that can be applied to many science contents that are unreachable, unobtrusive, and unable to travel (Cai et al., 2013; Huang et al., 2019; Pellas et al., 2019). For example, physically unreachable phenomena such as solar systems, moon phases, and magnetic fields become accessible for learners through AR (Fleck & Simon, 2013; Kerawalla et al., 2006; Shelton & Hedley, 2002; Sin & Zaman, 2010; Yen et al., 2013). Through AR, learners can obtain instant access to location-specific information provided by a wide range of sources (Yuen et al., 2011). Location-based information, when used in particular contextual learning activities, is essential for assisting students’ outdoor learning. This interaction develops comprehension, understanding, imagination, and retention, which are the learning and cognitive skills of learners (Chiang et al., 2014). For example, an AR-based mobile learning system was used in the study conducted by Chiang et al. (2014) on aquatic animals and plants. The location module can identify the students’ GPS location, direct them to discover the target ecological regions, and provide the appropriate learning tasks or additional resources. When students explore various characteristics of learning objects, the camera and image editing modules can take the image from the real environment and make comment on the image of the observed things.

Research reveals that the use of AR technology as part of teaching a subject has the features of being constructivist, problem solving-based, student-centered, authentic, participative, creative, personalized, meaningful, challenging, collaborative, interactive, entertaining, cognitively rich, contextual, and motivational (Dunleavy et al., 2009). Despite its advantages and although the use of AR in science education is increasing, the integration of AR into science classes is still naive, and teachers still do not consider themselves as ready for use of AR in their class (Oleksiuk & Oleksiuk, 2020; Romano et al., 2020) and choose not to use AR technology (Alalwan et al., 2020; Garzón et al., 2019), because most of them do not have the abilities and motivation to design AR learning practices (Garzón et al., 2019; Romano et al., 2020). It is thought that the current study will contribute to the use of AR in science lessons and how science teachers will include AR technology in their lessons.

2.2 Argumentation, Critical Thinking, and Augmented Reality

New trends in information technologies have contributed to the development of new skills in which people have to struggle with a range of information and evaluate this information. An important point of these skills is the ability to argue with evidence (Jiménez -Aleixandre & Erduran, 2007) in which young people create appropriate results from the information and evidence given to them to criticize the claims of others in the direction of the evidence and to distinguish an idea from evidence-based situations (OECD, 2003, p. 132).

Learning with technologies could produce information and misinformation simultaneously (Chai et al., 2015). Misinformation has spread very quickly in public in COVID-19 pandemic, so the lack of the ability to interpret and evaluate the validity and credibility of them arose again (Saribas & Çetinkaya, 2021). This process revealed the importance of developing students’ critical thinking skills and argumentation abilities (Erduran, 2020) to make decisions and adequate judgments when they encountered contradicting information (Chai et al., 2015).

Thinking about different subjects, evaluating the validity of scientific claims, and interpreting and evaluating evidence are important elements of science courses and play important roles in the construction of scientific knowledge (Driver et al., 2000). The use of scientific knowledge in everyday life ensures that critical thinking skills come to the forefront. Ennis (2011, p. 1) defined critical thinking as “Critical thinking is reasonable and reflective thinking focused on deciding what to believe”. Jiménez-Aleixandre and Puig (2012) found this definition very broad, and they proposed a comprehensive definition of critical thinking that combines the components of social emancipation and evidence evaluation. It contains the competence to form autonomous ideas as well as the ability to participate in and reflect on the world around us. Figure 1 summarizes this comprehensive definition.

Fig. 1
figure 1

Argumentation levels by groups

Critical thinking skills that include the ability to evaluate arguments and counterarguments in a variety of contexts are very important, and effective argumentation is the focal point of criticism and the informed decision (Nussbaum, 2008). Argumentation is defined as the process of making claims about a scientific subject, supporting them with data, using warrants, and criticizing, refuting, and evaluating an idea (Toulmin, 1990). Argumentation as an instructional method is an important research area in science education and has received enduring interest from science educators for more than a decade (Erduran et al., 2015). Researchers concluded that learners mostly made only claims in the argumentation process and had difficulty producing well-justified and high-quality arguments (Demircioglu & Ucar, 2014; Demircioglu & Ucar, 2015; Cavagnetto et al., 2010; Erdogan et al., 2017; Erduran et al., 2004; Novak & Treagust, 2017). To improve the quality of arguments, students should be given supportive elements to produce more consistent arguments during argumentation. One of these supportive elements is the visual representations of the phenomena.

Visual representations could make it easier to see the structure of the arguments of learners (Akpınar et al., 2014) and improve students’ awareness. For example, the number of words and comments used by students or meaningful links in conversations increases with visually enriched arguments (Erkens & Janssen, 2006). Sandoval & Millwood (2005) stated that students should be able to evaluate different kinds of evidence such as digital data and graphic photography to defend their claims. Appropriate data can directly support a claim and allow an argument to be accepted or rejected by students (Lin & Mintzes, 2010). Enriched visual representations provide students with detailed and meaningful information about the subject (Clark et al., 2007). Students collect evidence for argumentation by observing enriched representations (Clark et al., 2007), and these representations help to construct higher-quality arguments (Buckingham Shum et al., 1997; Jermann & Dillenbourg, 2003). Visualization techniques enable students to observe how objects behave and interact and provide an easy-to-understand presentation of scientific facts that are difficult to understand with textual or oral explanations (Cadmus, 1990). In short, technological opportunities to create visually enriched representations increase students’ access to rich data to support their arguments.

Among the many technological opportunities to promote argumentation, AR seems to be the most promising application for instructing school subjects. AR applications are concerned with the combination of computer-generated data (virtual reality) and the real world, where computer graphics are projected onto real-time video images (Dias, 2009). In addition, augmented reality provides users with the ability to see a real-world environment enriched with 3D images and to interact in real time by combining virtual objects with the real environment in 3D and showing the spatial relations (Kerawalla et al., 2006). AR applications are thus important tools for students’ arguments with the help of detailed and meaningful information and enriched representations. Research studies using AR technology revealed that all students in the study engaged in argumentation and produced arguments (Jan, 2009; Squire & Jan, 2007).

Many studies focusing on using AR in science education have been published in recent decades. Research studies related to AR in science education have focused on the use of game-based AR in science education (Atwood-Blaine & Huffman, 2017; Bressler & Bodzin, 2013; Dunleavy et al., 2009; López-Faican & Jaen, 2020; Squire, 2006), academic achievement (Hsiao et al., 2016; Faridi et al., 2020; Hwang et al., 2016; Lu et al., 2020; Sahin & Yilmaz, 2020;, Yildirim & Seckin-Kapucu, 2020), understanding science content and its conceptual understanding (Cai et al., 2021; Chang et al., 2013; Chen & Liu, 2020; Ibáñez et al., 2014), attitude (Sahin & Yilmaz, 20200; Hwang et al., 2016), self-efficacy (Cai et al., 2021), motivation (Bressler & Bodzin, 2013; Chen & Liu, 2020; Kirikkaya & Başgül, 2019; Lu et al., 2020; Zhang et al., 2014), and critical thinking skills (Faridi et al., 2020; Syawaludin et al., 2019). The general trend in these research studies based on the content of “learning/academic achievement,” “understanding science content and its conceptual understanding,” “motivation,” “attitude,” and methodologically quantitative studies was mostly used in articles in science education. Therefore, qualitative and quantitative data to be obtained from studies investigating the use of augmented reality technology in education and focusing on cognitive issues, interaction, and collaborative activities are needed (Arici et al., 2019; Cheng & Tsai, 2013).

Instructional strategies using AR technology ensure interactions between students and additionally between students and teachers (Hanid et al., 2020). Both the technological features of AR and learning strategies should be regarded by the teachers, the curriculum, and AR technology developers to acquire the complete advantage of AR in student learning (Garzón & Acevedo, 2019; Garzón et al., 2020). Researchers investigated the learning outcomes with AR-integrated learning strategies such as collaborative learning (Baran et al., 2020; Chen & Liu, 2020; Ke & Carafano, 2016), socioscientific reasoning (Chang et al., 2020), student-centered hands-on learning activities (Chen & Liu, 2020), inquiry-based learning (Radu & Schneider, 2019), concept-map learning system (Chen et al., 2019), problem-based learning (Fidan & Tuncel, 2019), and argumentation (Jan, 2009; Squire & Jan, 2007) in science learning.

The only two existing studies using both AR and argumentation (Jan, 2009; Squire & Jan, 2007) focus on environmental education and use location-based augmented reality games through mobile devices to engage students in scientific argumentation. Studies combining AR and argumentation in astronomy education have not been found in the literature. In the current study, AR was integrated with argumentation in teaching astronomy content.

Studies have revealed that many topics in astronomy are very difficult to learn and that students have incorrect and naive concepts (Yu & Sahami, 2007). Many topics include three-dimensional (3D) spatial relationships between astronomical objects (Aktamış & Arıcı, 2013; Yu & Sahami, 2007). However, most of the traditional teaching materials used in astronomy education are two-dimensional (Aktamış & Arıcı, 2013). Teaching astronomy through photographs and 2D animations is not sufficient to understand the difficult and complex concepts of astronomy (Chen et al., 2007). Static visualization tools such as texts, photographs, and 3D models do not change over time and do not have continuous movement, while dynamic visualization tools such as videos or animations show continuous movement and change over time (Schnotz & Lowe, 2008). However, animation is the presentation of images on a computer screen (Rieber & Kini, 1991), not in the real world, and the users do not have a chance to manipulate the images (Setozaki et al., 2017). As a solution to this shortcoming, using 3D technology in science classes, especially AR technology for abstract concepts, has become a necessity (Sahin & Yilmaz, 2020). By facilitating interaction with real and virtual environment and supporting object manipulation, AR is possible to enhance educational benefits (Billinghurst, 2002). The students are not passive participants while using AR technology. For example, the animated 3D sun and Earth models are moved on a handheld platform that adjusts its orientation in accordance with the student’s point of view in Shelton’s study (2002). They found that the ability of students to manage “how” and “when” they are allowed to manipulate virtual 3D objects has a direct impact on learning complex spatial phenomena. Experimental results show that compared with traditional video teaching, AR multimedia video teaching method significantly improves students’ learning (Chen et al., 2022).

This study, which integrates argumentation with new striking technology “AR” in astronomy education, clarifies the relationship between them and examines variables such as critical thinking skills and argumentation abilities that are essential in the era we live, making this research important.

2.3 Research Questions

The purpose of this study was to identify the change in critical thinking skills and argumentation abilities through augmented reality–based argumentation activities in teaching astronomy content. The following research questions guided this study:

  • RQ1: How do the critical thinking skills of students who participated in both augmented reality and argumentation activities on astronomy change during the study?

  • RQ2: How do the argumentation abilities of students who participated in both augmented reality and argumentation activities on astronomy change during the study?

3 Method

In this case study, we investigated the change of critical thinking skills and argumentation abilities of middle school students. Before the main intervention, a pilot study was conducted to observe the effectiveness of the prepared lesson plans in practice and to identify the problems in the implementation process. The pilot study was recorded with a camera. The camera recordings were watched by the researcher, and the difficulties in the implementation process were identified. In the main intervention, preventions were taken to overcome these difficulties. Table 1 illustrates that the problems encountered during the pilot study and the preventions taken to eliminate these problems.

Table 1 The solutions to the problems in the pilot study

During the main intervention, qualitative data were collected through observations and audio recordings to determine the change in the critical thinking skills and argumentation abilities of students who participated in both augmented reality and argumentation activities on astronomy.

3.1 Context and Participants

The participants consisted of 79 7th middle school students aged between 12 and 13 from a private school in Southern Turkey. The participants were determined as students in a private school where tablet computers are available for each student and the school willing to participate in the study. Twenty-six students, including 17 females and 9 males, participated in the study. The students’ parents signed the consent forms (whether participating or refusing participation in the study). The researcher informed them about the purpose of the study, instructional process, and ethical principles that directed the study. The teachers and school principals were informed that the preliminary and detailed conclusions of the study will be shared with them. The first researcher conducted the lessons in all groups because when the study was conducted, the use of augmented reality technology in education was very new. Also, the science teachers had inadequate knowledge and experience about augmented reality applications. Before the study, the researcher attended the classes with the teacher and made observations to help students become accustomed to the presence of the researcher in the classroom. This prolonged engagement increased the reliability of the implementation of instructions and data collection (Guba & Lincoln, 1989).

3.2 Instructional Activities

The 3-week, 19-h intervention process, which was based on the prepared lesson plan, was conducted. The students participated in the learning process that included both augmented reality and argumentation activities about astronomy.

3.2.1 Augmented Reality Activities

Free applications such as Star Chart, Sky View Free, Aurasma, Junaio, Augment, and i Solar System were used with students’ tablet computers in augmented reality instructions. Tablet computers were provided by the school administration from their stock. Videos, simulations, and 3D visuals generated by applications were used as “overlays.” In addition, pictures, photographs, colored areas in the worksheets, and students’ textbooks were used as “trigger images.” Students had the opportunity to interact with and manipulate these videos, simulations, and 3D visuals while using the applications. With applications such as Sky View Free and Star Chart, students were provided with the resources to make sky observations.

A detailed description of the activities used in augmented reality is given in Appendix Table 8.

3.2.2 Argumentation Activities

Before the instruction, the students were divided into six groups by the teacher, paying attention to heterogeneity in terms of gender and academic achievement. After small group discussions, the students participated in whole-class discussions. Competing theories cartoons, tables of statements, constructing an argument, and argument-driven inquiry (ADI) frameworks were used to support argumentation in the learning process. Argument-driven inquiry consists of eight steps including the following: identification of the task, the generation and analysis of data, the production of a tentative argument, an argumentation session, an investigation report, a double-blind peer review, revision of the report, and explicit and reflective discussion (Sampson & Gleim, 2009; Sampson et al., 2011).

A detailed description of the activities used in argumentation is given in Appendix Table 9.

4 Data Collection

The data were collected through unstructured and participant observations (Maykut & Morehouse, 1994; Patton, 2002). The instructional intervention was recorded with a video camera, and the students’ argumentation processes were also recorded with a voice recorder.

Since all students spoke at the same time during group discussions, the observation records were insufficient to understand the student talks. To determine what each student in the group said during the argumentation process, a voice recorder was placed in the middle of the group table, and a voice recording was taken throughout the lesson. A total of 2653.99 min of voice recordings were taken in the six groups.

4.1 Data Analysis

The analysis of the data was conducted with inductive and deductive approaches. Before coding, the data were arranged. The critical thinking data were organized by day. The argumentation skills were organized by day and also on the basis of the groups. After generating codes during the inductive analysis of the development of critical thinking skills, a deductive approach was adopted (Patton, 2002). The critical thinking skills dimensions discussed by Ennis (2011) and Ennis (1991) were used to determine the relationship between codes. Ennis (2011) prepared an outline to distinguish critical thinking dispositions and skills by synthesizing of many years of studies. These critical skills that contain abilities that ideal critical thinkers have were used to generate codes from students’ talks. This skills and abilities were given in Appendix Table 10. Then “clarification skills, decision making-supporting skills, inference skills, advanced clarification skills, and other/strategy and techniques skills” discussed by Ennis (1991) and Ennis (2011) were used to determine the categories. The change in the argumentation abilities of the students was analyzed descriptively based on the Toulmin argument model (Toulmin, 1990) using the data obtained from the students’ voice recordings. The argument structures of each group during verbal argumentation were determined by dividing them into components according to the Toulmin model (Toulmin, 1990). The first three items (data, claim, and warrant) in the Toulmin model form the basis of an argument, and the other three items (rebuttal, backing, and qualifier) are subsidiary elements of the argument (Toulmin, 1990).

Some quotations regarding the analysis of the arguments according to the items are given in Appendix Table 11.

Arguments from the whole group were put into stages based on the argumentation-level model developed by Erduran et al. (2004) to examine the changes in each lesson and to make comparisons between the small groups of students. By considering the argument model developed by Toulmin, Erduran et al. (2004) created a five-level framework for the assessment of the quality of argumentation supposing that the quality of the arguments including rebuttals was high. The framework is given in Table 2.

Table 2 The framework for the assessment of the quality of argumentation (Erduran et al., 2004; pp. 928)

4.2 Validity and Reliability

To confirm the accuracy and validity of the analysis, method triangulation, triangulation of data sources, and analyst triangulation were used (Patton, 2002).

For analyst triangulation, the qualitative findings were also analyzed independently by a researcher studying in the field of critical thinking and argumentation, and then these evaluations made by the researchers were compared.

Video and audio recordings of intervention and documents from the activities were used for the triangulation of data sources. In addition, the data were described in detail without interpretation. Additionally, within the reliability and validity efforts, direct quotations were given in the findings. In this sense, for students, codes such as S1, S2, and S3 were used, and the source of data, group number, and relevant date of the conversation were included at the end of the quotations.

In addition, experts studying in the field of critical thinking and argumentation were asked to verify all data and findings. After the process of reflection and discussion with experts, the codes, subcategories, and categories were revised.

For reliability, some of the data randomly selected from the written transcripts of the students’ audio recordings were also coded by a second encoder, and the interrater agreement between the two coders, determined by Cohen’s kappa (Cohen, 1960), was κ = 0.86, which is considered high reliability.

5 Results

5.1 Development of Critical Thinking Ability

The development of critical thinking skills was given separately for the trend drastically changed on the day when the first skills were used by the students. All six dimensions of critical thinking skills were included in students’ dialogs or when there was a decrease in the number of categories of critical thinking skills.

The codes, subcategories, and categories of critical thinking skills that occurred on the first day (dated 11.05) are given in Table 3.

Table 3 The codes, subcategories, and categories of critical thinking skills that occurred on the first day

Clarification skills, inference skills, other/strategy and technical skills, advanced clarification skills, and decision-making/supporting skills occurred on the first day. The students mostly used decision-making/supporting skills (f = 55). Under the decision-making/supporting skills category, students mostly explained observation data (f = 37). S7, S1, and S20 stated the data they presented about their observations with the Star Chart and Sky View applications as follows:

S7: Venus is such a yellowish reddish colour.

S1: What was the colour? Red and big. The moon’s color is white.

S20: Not white here.

S1: How?

S20: It’s not white here. (Audio Recordings (AuR), Group 2 / 11.05).

Additionally, S19 mentioned the observation data with the words “I searched Saturn. It is bright. It does not vibrate. It is yellow and it’s large.” (AuR, Group 2 / 11.05).

Decision-making/supporting skills were followed by inference (f = 17), clarification (f = 13), advanced clarification (f = 5), and skills and other/strategy technical skills (f = 1).

In Table 4, the categories, subcategories, and codes for critical thinking skills that occurred on the fifth day (dated 18.05) are presented.

Table 4 The categories, subcategories, and codes for critical thinking skills that occurred on the fifth day

It was observed for the first time on the fifth day that all six dimensions of critical thinking skills were included in students’ dialogs. These are, according to the frequency of use, inference (f = 152), decision-making/support (f = 116), clarification (f = 43), advanced clarification (f = 8), other/strategy and technique (f = 3), and suppositional thinking and integrational (f = 2) skills.

On this date, judging the credibility of the source from decision-making/supporting skills (f = 1) was the skill used for the first time.

Unlike other days, for the first time, a student tried to prove his thoughts with an analogy in advanced clarification skills. An exemplary dialogue to this finding is as follows:

S19: Even the Moon remains constant, we will see different faces of the moon because the Earth revolves around its axis.

S6: I also say that it turns at the same speed. So, for example, when this house turns like this while we return in the same way, we always see the same face. (AuR, 18.05, Group 2).

Here, S6 tried to explain to his friend that they always see the same face of the moon by comparing how they see the same face of the house.

In Table 5, the categories, subcategories, and codes for critical thinking skills that occurred on the sixth day (dated 21.05) are included.

Table 5 The categories, subcategories, and codes for critical thinking skills that occurred on the sixth day

There is a decrease in the number of categories of critical thinking skills. It was determined that the students used mostly inference skills in three categories (f = 38). Additionally, students used decision-making/support (f = 34) and clarification (f = 9) skills. In inference skills, it is seen that students often make claims (f = 33) and rarely infer from the available data (f = 4).

Among the decision-making/support skills, students mostly used the skill to give reasons (f = 28). S24 accepted herself as Uranus during the activity, and she gave reason to make Saturn as an enemy like that: “No, Saturn would be my enemy too. Its ring is more distinctive, it can be seen from the Earth, its ring is more beautiful than me.” (AuR, 21.05, Group 3/).

The categories, subcategories, and codes for critical thinking skills that occurred on the ninth day (dated 28.05) are presented in Table 6.

Table 6 The categories, subcategories, and codes for critical thinking skills that occurred on the ninth day

In the course of the day dated 28.05, six categories of critical thinking skills were observed: clarification, inference, other/strategy and technique, advanced clarification, decision-making/support, suppositional thinking and integration skills. Furthermore, the subcategories under these categories are also very diverse.

There are 10 subcategories under clarification skills (f = 57), which are the most commonly used skills. The frequency of using these skills is as follows: asking his friend about his opinion (f = 15), asking questions to clarify the situation (f = 12), explaining his statement (f = 10), summarizing the solutions of other groups (f = 7), asking for a detailed explanation (f = 4), summarizing the idea (f = 3), explaining the solution proposal (f = 2), asking for a reason (f = 2), focusing on the question (f = 1), and asking what the tools used in experiment do (f = 1) skills. Explaining the solution proposal, asking what the tools used in the experiment do, and focusing on the question are the first skills used by the students.

When the qualitative findings regarding the critical thinking skills of the students were examined as a whole, it was determined that there was an improvement in the students’ critical thinking skills dimensions in the lessons held in the first 5 days (between 11.05 and 18.05). There was a decrease in the number of critical thinking skills dimensions in the middle of the intervention (21.05). However, after this date, there was an increase again in the number of critical thinking skills dimensions; and on the last day of the intervention, all the critical thinking skills dimensions were used by the students. In addition, it was determined that the skills found under these dimensions showed great variety at this date. Only in the middle (18.05) and on the last day (28.05) of the intervention did students use the skills in the six dimensions of critical thinking.

It was determined that students used mostly decision-making/support, inference, and clarification skills. According to the days, it was determined that the students mostly used inference skills (12.05, 15.05, 18.05, and 21.05) among these skills.

5.2 The Argumentation Abilities of the Students

5.2.1 Argument Structures in Students’ Verbal Argumentation Activities

Instead of the argument structures of all groups, only an example of one group is presented because of including both basic and subsidiary items in the Toulmin argument model. In Table 7, the argument structures in the verbal argumentation activities of the fourth group of students are presented due to the use of the “rebuttal” item.

Table 7 The argument structures in the verbal argumentation activities of the fourth group of students

When the argument structures in the verbal argumentation process of the six groups were examined, it was found that all groups engaged in the argumentation and produced arguments. In the activities, students mostly made claims. This was followed by data and warrant items. In the “the phases of the moon” activity, it was determined that only the second and fourth groups used rebuttal and the other groups did not.

The number of rebuttals used by the groups is lower in “the planets-table of statements” activity than in other activities. The rebuttals used are also weak. The use of rebuttals differs in the “who is right?” and “urgent solution to space pollution” activities. The number of rebuttal students used in these activities is higher than that in the other activities. The quality rebuttals are also higher.

When the structure of the warrants is examined, there are more unscientific warrants in the “urgent solution to space pollution” and “who is right” activities, while the correct scientific and partially correct scientific warrants were more frequently used in the “the phases of the moon” and “the planets table of statements” activities.

When the models related to the argument structures are examined in general, it was found that there is a decrease in the type of items used and the number of uses in the “the phases of the moon” and “the planets-table of statements” activities rather than the “urgent solution to space pollution” and “who is right” activities.

When the results were analyzed in terms of groups, it was determined that the argument structures of the second and fourth groups showed more variety than those of the other groups.

5.2.2 The Change of Argumentation Levels

The argumentation levels achieved by six groups created in the “who is right,” “ the planets-table of statements,” “phases of the moon,” and “urgent solution to space pollution” activities are shown in Fig. 2.

Fig. 2
figure 2

A characterization of the components of critical thinking (Jiménez-Aleixandre & Puig, 2012, p. 6)

In the first verbal argumentation activity, “who is right?,” the arguments achieved by the five of the six groups were at level 5. Additionally, the arguments achieved by one group, which was group 6, were at level 4.

In the second verbal argumentation activity “table of statements,” a decrease was determined at the levels of the argumentation of the other groups except group 1 and group 3. In the “the phases of the moon” activity, there was a decrease at the level of argumentation achieved by the other groups except for group 2 and group 4. In the last argumentation activity, “urgent solution to space pollution,” it was found that the arguments of all groups were at level 5.

6 Conclusions and Discussion

The critical thinking skills of the students developed until the middle of the intervention, and the frequency of using critical thinking skills varied after the middle of the intervention. When the activities in the lessons were examined, on the days when critical thinking skills were frequently used, activities including argumentation methods were performed. Based on this situation, it could be revealed that the frequency of using critical thinking skills by students varies according to the use of the argumentation method.

Argumentation is defined as the process of making claims about a scientific subject, supporting them with data, providing reasons for proof, and criticizing, rebutting, and evaluating an idea (Toulmin, 1990). According to the definition of argumentation, these processes are also in the subdimensions of critical thinking skills. The ability to provide reasons for critical thinking skills in decision-making/supporting skills is the equivalent of providing reasons for proof in the argumentation process using warrants in the Toulmin argument model. Different types of claims under inference skills are related to making claims in the argumentation process, and rejecting a judgment is related to rebutting an idea in the argumentation process. In this context, the argumentation method is thought to contribute to the development of critical thinking skills within AR.

Another qualitative finding reached in the study is that the skills most used in the subdimensions differ according to the days. This can be explained by the different types of activities performed in each lesson. For example, on the day when the ability to explain observation data was used the most, students observed the sky, constellations, and galaxies with the Star Chart or Sky View applications or observed the planets with the i-Solar System application, and they presented the data they obtained during these observations.

Regarding the verbal argumentation structure of the groups, the findings imply that all groups engaged in argumentation and produced arguments. This finding presented evidence with qualitative data to further verify Squire & Jan’s (2007) research conducted with primary, middle, and high school students to investigate the potential of a location-based AR game in environmental science concluding that all groups engaged in argumentation. Similarly, Jan (2009) investigated the experience of three middle school students and their argumentative discourse on environmental education using a location-based AR game, and it was found that all students participated in argumentation and produced arguments.

Another finding in the current study was that students mostly made claims in the activities. This situation can be interpreted as students being strong in expressing their opinions. Similar findings are found in the literature (Author, 20xxa; Cavagnetto et al., 2010; Erduran et al., 2004; Novak & Treagust, 2017). In addition, it was concluded that the students failed to use warrants and data, they could not support their claims with the data, and they did not use “rebuttal” in these studies. However, in this study in which both augmented reality applications and argumentation methods were used, students mostly made contradictory claims and used data and warrants in their arguments. This situation can be interpreted as students being strong in defending their opinions. Additionally, although it was stated in many of the studies that students’ argumentation levels were generally at level 1 or level 2 (Erdogan et al., 2017; Erduran et al., 2004; Venville & Dawson, 2010; Zohar & Nemet, 2002), it was found that most of the students’ arguments were at level 4 and level 5 in the current study. Arguments are considered to be high quality in line with the existence of rebuttals, and discussions involving rebuttals are characterized as having a high level of argumentation (Aufschnaiter et al., 2008; Erduran et al., 2004). Students used rebuttals in their arguments, and their arguments were at high levels, which indicates that students could produce quality arguments. The reason for these findings to differ from those of other studies may be due to the augmented reality technology used in the current study. Enriched representations make it easier to see the structure of arguments (Akpınar et al., 2014), helping students to improve their awareness, increase the number of words they use and comments they make (Erkens & Janssen, 2006), and provide important information about the subject (Clark et al., 2007). By observing enriched representations, students collect evidence for argumentation (Clark & Sampson, 2008) and explore different points of view to support their claim (Oestermeier & Hesse, 2000). AR technology, which includes enriched representations, may have increased the accessibility of rich data to support students’ arguments; and using these data has helped them to support their arguments and enabled them to discover different perspectives. For example, S4 explained that the statement in the table is incorrect because she observed Uranus, Jupiter, and Neptune having rings around them in the application “I-solar system” as Uranus. She used the data obtained in the AR application to support her claim.

When the models related to the argument structures are examined in general, it was concluded that the type of items, the number of items, and the rebuttals used in scientific activities were less than those in the activities involving socioscientific issues. The rebuttals used were also weak. There are also findings in the literature that producing arguments on scientific issues is more difficult than producing arguments on socioscientific issues (Osborne et al., 2004).

When the structure of the warrants in the students’ arguments was examined, it was seen that there are more nonscientific warrants in socioscientific activities, and the scientific and partially scientific warrants are more in the activities that contain scientific subjects. This shows that students were unable to combine what they have learned in science with socioscientific issues. Albe (2008) and Kolsto (2001) stated that scientific knowledge is very low in students’ arguments on socioscientific issues. Similarly, the results of the studies conducted in the related literature support this view (Demircioglu & Ucar, 2014; Sadler & Donnelly, 2006; Wu & Tsai, 2007).

When the argument structures in the activities are analyzed by groups, the argument structures of the two groups vary more than the other groups, and the argumentation levels of these groups are at level 4 and level 5. This might be because some students have different prior knowledge about subjects. Different studies have also indicated that content knowledge plays an important role in the quality of students’ arguments (Acar, 2008; Aufschnaiter et al., 2008; Clark & Sampson, 2008; Cross et al., 2008; Sampson & Clark, 2011). In many studies, it has been emphasized that the most important thing affecting the choice and process of knowledge is previous information (Stark et al., 2009). To better understand how previous information affects argumentation quality in astronomy education, investigating the relationship between middle school students’ content knowledge and argumentation quality could be a direction of future research.

7 Limitations and Future Research

There are some limitations in this study. First, this study was implemented in a private school. Therefore, the results are true for these students. Future research is necessary to be performed with the students in public schools. Second, the researcher conducted the lessons because the science teacher had no ability to design AR learning practices. Teachers and students creating their own AR experiences is an important way to bring the learning outcomes of AR available to a wider audience (Romano et al., 2020). Further research can be conducted in which the science teacher of the class is the instructor. Another limitation of the study is that the instruction with AR-based argumentation was time-consuming, and the time allocated for the “Solar System and Beyond” unit in the curriculum was not sufficient for the implementation, because students tried to understand to use AR applications, and they needed time to reflect on the activities despite prior training on AR before the instructional process. This situation may cause cognitive overload (Alalwan et al., 2020). The adoption and implementation of educational technologies are more difficult and time-consuming than other methods (Parker & Heywood, 1998). A longer period is needed to prepare student-centered and technology-supported activities.