1 Introduction

The growth in the number of educational policies that promote digital skills facilitates the application of increasingly varied teaching materials. The DigCompEdu Framework (European Commission. Joint Research Centre et al., 2017) encourages the use of digital media as a tool for innovation and improvement in education and training. In response to these educational policies, new textbooks incorporate more digital resources, many of which are multimedia documents. This may suggest that digital documents, whether single or multimedia, are helpful for learning.

Although the use of instructional videos is not new (see de Koning et al., 2018; Mayer et al., 2020), their popularity has grown since the early 2000s with the advent of massive open online courses (Oh et al., 2020; Yousef et al., 2014). In addition, the emergence of easy-to-use tools for editing and sharing videos has rendered this activity more accessible (Fiorella, 2021), thus facilitating the integration of videos into teaching sequences. Simultaneously, research into the use of instructional videos in formal teaching also increased significantly during the first decade of the twenty-first century (Alpert & Hodkinson, 2019; Giannakos, 2013).

From the learner’s perspective, engaging with digital tools, including videos, requires new skills and can complicate the learning process (Tricot & Chesné, 2020). In recent decades, researchers have investigated the effects of targeted design elements to allow students to benefit from the advantages of multimedia documents without generating a cognitive overload on working memory. Several elements, such as segmentation (Rey et al., 2019), signaling (Alpizar et al., 2020; Richter et al., 2016; Schneider et al., 2018), or the presence of generative activities (Fiorella & Mayer, 2021; Wilhelm-Chapin & Koszalka, 2016), have been identified as facilitators of the learning process. Research in the field proposes a set of 15 design principles to support learning from multimedia material. These principles are part of the cognitive theory of multimedia learning (CTML) (Mayer, 2001, 2009, 2020). The CTML is particularly suitable for videos since learners must simultaneously process auditory information (i.e., sounds, narration) and visual information (i.e., pictures, words) that come from two different media.

1.1 Instructional Video

Instructional videos can be defined in a number of ways. According to Ibrahim et al. (2012), videos are a means of presenting information in the form of a dynamic flow of visual and auditory content. Laduron and Rappe (2019) specify the modality of presentation and refer to the instructional videos as “videograms” because they are first recorded and then broadcast (which distinguishes them from videoconferencing or virtual reality, for example). Tricot and Chesné (2020) differentiate videos from animations. According to them, the word animation is used to designate virtual, schematized, and abstract representations, whereas the word video is used to designate more realistic representations based on the capture of reality. Fiorella (2021) identifies elements that clarify the role of the instructor: “Instructional videos are dynamic audiovisual presentations in which an instructor delivers oral explanations while presenting corresponding visual information (e.g., graphs, diagrams, animations, or models) on the screen” (p. 487). Finally, de Koning et al. (2018) define instructional videos according to viewing objectives: “Instructional videos differ from videos watched for entertainment in that they have the objective to help someone learn about specific concepts or procedures” (p.395).

Although these definitions cover multiple aspects of instructional videos, they do not include all the video resources available to teachers. For this reason, the term instructional video used in the present study includes not only the types of videos defined above but also moving images such as films, documentaries, tutorials, diagrams, modeling…

1.2 Cognitive Theory of Multimedia Learning

Research on multimedia learning (for a review see, Çeken & Taşkın, 2022; Noetel et al., 2022) has focused specifically on instructional material, arguing that it has to align with the way in which the human brain works for learning to occur. For this reason, the CTML (Mayer, 2020) is built on three assumptions based on cognitive science: (a) information is processed simultaneously through an auditory and a visual channel (Baddeley, 2000; Clark & Paivio, 1991) (b) working memory has a limited capacity (Baddeley, 2012; Baddeley & Hitch, 1974), and (c) generative learning is essential, because “comprehension and understanding result from the processes of generating relations both among concepts and between experience or prior learning and new information” (Wittrock, 1992, p. 532).

Each assumption provides specific insights into the functioning of the human brain, and particularly the role of working memory, which is central to the information processing system and therefore to learning. Understanding how the human brain processes new knowledge is essential to provide instructional material that promotes meaningful learning (Ausubel, 2000).

1.2.1 Multimedia Design Principles

According to the CTML (Mayer, 2020) and the cognitive load theory (Paas & Sweller, 2021; Sweller, 1994), it is essential that multimedia materials incorporate elements that support the process that leads to meaningful learning and avoids overloading learners’ working memory. As Paas and Sweller (2021) state:

The aim of instruction should be to reduce extraneous cognitive load caused by inappropriate instructional procedures. Reducing extraneous cognitive load frees up working memory capacity and so may allow for an increase in the working resources devoted to intrinsic cognitive load, resulting in an increase in germane cognitive load.

Thus, the CTML includes a series of multimedia design principles based on research evidence to facilitate and support learners’ cognitive processes when learning with multimedia material. All these design principles are based on a fundamental principle, namely the multimedia principle, which relates to the presentation of information in the form of images and text. Doing so allows for dual coding of information and supports the creation of a mental representation. In addition to this general principle, there are 14 other design principles, which are divided into three groups according to their impact on cognitive processes: reducing extraneous processing, managing essential processing, and fostering generative processing (Fiorella & Mayer, 2021; Mayer, 2020).

The first group of principles aims to reduce extraneous processing overload, where the unnecessary elements mobilize an excessive amount of cognitive resources. Consequently, there is not enough cognitive capacity to engage in essential or generative processing. This group is composed of five principles: the coherence principle, the signaling principle, the redundancy principle, the spatial contiguity principle, and the temporal contiguity principle.

The second group of principles focuses attention on the essential elements by highlighting the relevant and useful learning elements. The segmentation principle, the pre-learning principle, and the modality principle focus on avoiding overloading working memory.

The third group covers design principles intended to engage the learner in meaningful learning. The principles aid the learner in making sense of and organizing information into a coherent mental model that integrates new and prior knowledge. The group consists of the personalization principle, the voice principle, the image principle, the embodiment principle, the immersion principle, and the generative activity principle.

1.2.2 Boundary Conditions

Research on multimedia principles sometimes leads to contradictory, or at least nuanced, conclusions. According to Mayer (2020), this is due to the boundary conditions that underline the effectiveness of the principles, which depend on the individual characteristics of the learner as well as the content and context of learning.

Individual characteristics vary because each learner has different cognitive abilities, personal motivation, individual learning strategies, and heterogeneous prior knowledge. All these elements influence the learning process, which requires the selection, organization, and integration of relevant information. Therefore, the same multimedia material can lead to different results depending on the learner. For example, it would be more relevant to show a system-paced video to learners with limited prior knowledge than to more experimented learners (Biard et al., 2017). This type of segmentation would highlight the important steps of a procedure and reduce the cognitive load since the learners would not have to use part of their cognitive resources to manage the appropriate video breaks (Spanjers et al., 2012). Furthermore, a video could be successfully segmented by learners with good prior knowledge (user-paced), as it would be easier for them to identify the important steps and interrupt the video accordingly in line with their processing needs (Rey et al., 2019). This difference in learning outcomes is known as the expertise reversal principle (Kalyuga, 2014, 2021; Sweller et al., 2011), where “[…] design principles that are effective for novice learners may not be effective or even hinder learning for more knowledgeable learners” (Kalyuga, 2014, p. 576).

The complexity of learning content depends on the specificities of the discipline and the type of multimedia material (Bétrancourt & Benetos, 2018; Mayer, 2020). For example, the signaling effect is found to be more beneficial when learning content is complex (Mayer, 2020; Désiron et al. 2021) and the modality principle seems to be more beneficial when the pace of animation is fast (Ginns, 2005; Mayer, 2020), in which case the processing of verbal information in the auditory channel reduces the load of the visual channel with written text. Furthermore, according to Bétrancourt and Benetos (2018), videos that present procedures do not involve the same learning processes as those that present concepts. The authors thus conclude that, depending on the learning content, the multimedia design principles will not have the same impact.

The effectiveness of multimedia design principles also depends on the learning context and the way in which the multimedia material is presented. For example, the teacher’s involvement during the video screening or the setting in which it is displayed could moderate the effect of the design principles. However, such an effect cannot always be replicated (Tabbers & van der Spoel, 2011). Similarly, the involvement of the teacher in organizing information also impacts the learning outcome. Novice learners would benefit more from elements highlighted by teachers than from those embedded in videos (Castro-Alonso, de Koning et al., 2021; Castro-Alonso, Wong et al., 2021). Finally, the setting in which research is conducted (in the laboratory or in the classroom) and the timing of knowledge assessment (immediately afterwards or at a later time) are also elements that could influence the results related to the effect of the multimedia design principles (Mayer, 2020).

Examining the boundary conditions can aid in better defining when and under which conditions a principle has a favorable impact on learning.

2 Research Questions and Methods

Numerous experimental studies have investigated the effect of manipulating multimedia principles on learning and engagement and, to a lesser extent, of boundary conditions. However, with the exception of a few studies (Guo et al., 2014; Oh et al., 2020), there is a gap in the research regarding if and which of these design principles are implemented in authentic instructional videos. Therefore, the objective of the present qualitative study is to review authentic instructional videos to determine whether the material used in the classrooms follows multimedia design principles to support the learning processes. Gaining insights on the design of authentic instructional video through the scope of theoretically driven and empirically tested multimedia design principles will provide insights on how wide the research-practice gap is in this field.

2.1 Research Questions and Objectives

Through the use of an exploratory method (Swedberg, 2020), this qualitative study sought to determine whether instructional videos used in authentic contexts by secondary school teachers follow the multimedia principles recommended by the CTML. Therefore, the present study investigated whether videos screened in class follow multimedia design principles, regardless of the subject and the learning content.

As the principles have been categorized into three groups by Mayer (2020), in line with the different cognitive loads targeted, we analyzed the presence of the principles in the video according to these categories. In line with the boundary conditions presented with the design principles (Mayer, 2020), we also collected data on the context of the screening to determine whether teachers added principles to the videos. Furthermore, as previous research has shown that the combination of the principles of segmentation, signaling, and coherence leads to higher learning outcomes (Ibrahim et al., 2012), this group of principles, also known by the abbreviation SSW (segmenting, signaling, weeding), was analyzed separately.

2.2 Method

2.2.1 Sampling

In research on multimedia principles, the majority of videos analyzed explain procedural or conceptual content (Bétrancourt & Benetos, 2018) and are often limited to science or technology subjects (Çeken & Taşkın, 2022). However, in an authentic context, videos are also screened in other school subjects. As the aim of the present study is to provide a broad perspective on the use of multimedia principles in an authentic context three categories of content, based on Bloom’s revised taxonomy (Anderson & Krathwohl, 2001) were selected: explaining a procedure, explaining an event, and explaining a concept. Therefore, to ensure that these three categories were covered, two disciplines, namely history and science, were chosen, based on the following hypothesis: the videos used in science mainly pertain to procedures, whereas the videos used in history mainly explain events. The third category, the explanation of concepts, is likely to be found in both subjects.

To ensure that the videos had been used in class and that there would be sufficient material for a representative sample, the teachers were asked to only share videos that their students had watched during the 18 months preceding the study.

The videos, which are the main unit of analysis in this study, were collected with the use of the snowball sampling method (Parker et al., 2019). An email request to participate in the study was sent to eligible teachers, who then forwarded it to other teachers who also met the criteria. In turn, these teachers forwarded the email to more teachers who also met the criteria. In total, 12 teachers agreed to participate. An email with an individual link to the video submission form online platform (a Lime Survey questionnaire hosted on servers of the second author’s university) was sent to each of the 12 teachers. At the conclusion of the process, eight teachers had completed the form and provided valid data, and four had withdrawn from participation. Ninety-one videos were shared by the eight teachers, but 13 were excluded because they were not in the requested video format (e.g., a slide show), were unavailable, or were not instructional videos (e.g., the movie: Modern Times). The final sample for the analysis numbered 78 videos. All data and videos were collected over a period of 12 days.

2.2.2 Material and Data Collection

In addition to the video, two different types of data were collected: (a) metadata regarding the videos and their use in class and (b) demographic information regarding the participating teachers, their acceptance of the technologies, and their alignment with multimedia design principles and processing assumptions. All data were collected through an online form hosted on Lime Survey, where participating teachers uploaded the videos.

Video metadata The teachers were asked a series of five questions with each video upload. In line with the instructor-managed strategies developed by Castro-Alonso, de Koning  et al., (2021), two questions targeted multimedia principles added by teachers during screening (e.g., pausing the screening which would be considered a form of teacher-paced segmentation or insertion of generative activities). One question targeted the pre-training principle (At which point in your teaching sequence did you use the video?). Another question focused on boundary conditions asking about learners’ academic level (pre-vocational track or pre-general track). The fifth question aimed to identify the source of the video (e.g., official platform, teacher-made, found on the internet) for descriptive reasons.

Teachers’ demographic data The teachers’ acceptance of technology was measured with a version of the technology acceptance model (TAM) translated and adapted from Shen et al. (2019), with the addition of the technology use component (Scherer et al., 2019), which includes five components: perceived utility (four items), perceived ease of use (four items), attitude toward use (four items), behavioral intention (three items), use (four items). Finally, the teachers’ alignment with multimedia principles was measured with a questionnaire developed by Désiron and Petko. (2022) and translated by the author. The final part of the questionnaire focuses on demographic data (gender, age, school, teaching subjects, and number of years of teaching). This questionnaire was pre-tested by four teachers who did not participate in this study.

The primary unit of analysis in this study is the videos. The teachers’ demographic data are used only to describe the video providers.

2.2.3 Data Analyses

The videos were analyzed using a codebook (see Appendix 1) developed by the authors and based on the 15 multimedia principles of the CTML. Some of the principles, such as signaling, were further divided to provide a more fine-grained picture of their implementation in the videos (e.g., verbal signaling, visual gestural signaling, and visual graphic signaling). In addition, three other codes were included: video length (± 6 min) (Brame, 2016; Guo et al., 2014; Van Der Meij & Hopfner, 2022) camera viewpoints (1st or 3rd person) (Fiorella & Mayer, 2018), and video learning content (explanation of an event, a procedure, or a concept). A total of 23 deductive codes (Skjott Linneberg & Korsgaard, 2019) were used to analyze the videos.

During the coding process, additional codes were added to the coding scheme, following an inductive method (Skjott Linneberg & Korsgaard, 2019). Eight codes were related to the gross transgression of the multimedia principles in certain videos. We defined a transgression as an element that contradicts a multimedia principle (e.g., transgression of temporal contiguity coded when the text is not presented simultaneously with the visuals). The absence of a principle was coded when neither a presence nor a transgression was identified. Therefore, while a principle could be present or absent, a video could both follow the principle and transgress against it.

In the absence of consensus on the classification of videos (e.g., Bétrancourt & Benetos, 2018; Imhof et al., 2009; Laduron & Rappe, 2019; Mayer et al., 2020), the classification used in this research was established with the use of a deductive-inductive method inspired by Alpert and Hodkinson (2019) and based on commonly used definitions, which provided five codes related to the content: demonstrations, documentaries, science or historical popularization, reports, and modeling. At the conclusion of this process, the resulting codebook included a total of 13 inductive codes.

The codebook was validated with the use of Atlas.ti (version 22.0.2) through an iterative process of individual coding (by the first and second authors) on 14.11% of the videos (n = 11), a comparison of results, and discussions on the interpretation of certain codes, until a level of “inter-coder agreement” greater than 90% was obtained. The remaining videos were then independently coded by the first or second author (58.97% and 26.92%, respectively).

3 Results

Of the final sample, which consisted of a total of 78 videos, 29 were shown in history classes and 49 in science classes. None were produced by the teachers. One was recorded by students during a science course and used by the teacher as course material for other classes. Two were filmed in the first-person perspective (Boucheix et al., 2018).

Descriptive data from the qualitative coding were computed with Jamovi software (2.3.0.0) for quantitative analyses regarding the presence of multimedia design principles in the videos. An additional analysis of teacher profiles was also conducted.

3.1 Teachers’ Demographic Data

Half the participating teachers taught history (n = 4), and the other half taught sciences at the lower secondary school level. The teachers practiced in both the pre-general and pre-vocational tracks, where the teaching curricula overlap considerably; thus, they reported that the same video was often shown to learners in both tracks (42.3%). The teachers had been practicing for between nine and 23 years (M = 17.6 years, SD = 4.96). Based on answers to the TAM questionnaire (see Appendix 2), overall, the participating teachers perceived technology as useful in their practice [3.50; 5.00], used it often [3.67; 5.00], and were willing to use it in the future [3.50; 5.00]. Ease of use showed greater variability between teachers [2.75; 4.50]. Although overall lower, their attitudes toward technology were positive [3.25; 4.00].

Regarding the teachers’ alignment with multimedia principles, all disagreed with the coherence principle and referred explicitly to decorative details, yet half of them agreed with the dual coding and cognitive load theories that form the basis of the CTML (see Appendix 3).

3.2 Videos’ Overall Descriptive

The descriptive statistics showed that the different types of content were not equally represented in the two subjects. Modeling (n = 13) and demonstrations (n = 11) were used only in science classes. Documentaries were shown almost exclusively in history classes (history n = 23, science n = 1). Reports (history n = 1, science n = 1) and popularization programs (history n = 5, science n = 23) were used in both subjects.

Overall, the analyses of the learning content by discipline can be summarized as follows: explanations of procedures were used exclusively for science and explanations of events and explanations of concepts occurred in both teaching subjects, with a marked tendency for the use of one specific type of content in each discipline (see Fig. 1).

Fig. 1
figure 1

Learning content by discipline

The descriptive analyses also showed that few of the historical videos (10.3%) had multiple types of content, while more than a third of the videos in science (36.7%) did. For example, the video explaining the circulation of the blood begins with a historical overview of the development of medicine (event) and then illustrates the different steps involved in blood circulation (procedure).

The analyses of multimedia design principles by subject—including additions by teachers during the screening—showed no overall differences between the frequency of videos on scientific and historical subjects (see Fig. 2). However, multimedia, modality, and voice principles were more prevalent in history and coherence, signaling, and personalization principles were more prevalent in science.

Fig. 2
figure 2

Multimedia design principles by discipline

Cross-analyses of video length conducted according to the 6-min limit recommendation by Brame (2016), showed that the length did not appear to determine the presence of more than one type of learning content. Indeed, 54.4% of videos of 6 min or less have only one type of content, compared with 45.6% of videos longer than 6 min. Fisher’s exact test (p = 0.072) confirms that, in this sample, there was no significant correlation between video length and the presence of several types of learning content.

3.3 Multimedia Principles in Videos

The initial findings showed that all CTML principles, with the exception of the immersion principle specific to virtual reality, appear at least once in the analyzed sample. The multimedia principle was followed in 91% of videos, with 71 videos presenting information in the form of images combined with written or oral text. Seven of the videos did not follow the multimedia principle, as they contained images without any written or oral text. Finally, one video did not follow any multimedia design principles.

The different principles of multimedia design were divided into three cognitive processes: reducing extraneous processing, managing essential processing, and fostering generative processing. According to the CTML, the inclusion of principles from each group is essential to support learning from multimedia documents.

At least one principle from each group was present in 30.8% of the videos analyzed. When including the principles added by the teachers (segmentation and generative activities) when students watched the videos together in class (video screening) in the analysis, this percentage increases to 70.5%. In Sects. 3.3.2 and 3.3.3, we provide detailed analyses when principles added by teachers are explicitly stated.

3.3.1 Reducing Extraneous Processing

According to the CTML, five multimedia design principles aim to reduce extraneous processing: coherence, signaling, redundancy, spatial contiguity, and temporal contiguity. The analyses showed that at least one of these principles was present in 88.5% of the videos. However, the analysis by principle (see Fig. 3) also showed that, while the coherence principle was present in 33.3% of videos, it was clearly transgressed in 59% of the sample. In this subset, 90% involve background music. Overall, it was the principle with the highest rate of transgressions.

Fig. 3
figure 3

Distribution of design principles for reducing extraneous processing and their transgression in videos

The signaling principle was present in 64.1% of the videos. Detailed analysis during the coding process provided a clearer view of the actual implementation of this principle, which was divided into three types of signaling that could be found simultaneously in the same video: verbal signaling (21.8%), visual gesture signaling (19.2%), and graphic visual signaling (51.3%). Transgression of the signaling principle occurred in 2.6% of the videos, where visual gesture or visual graphic signaling did not correspond to the narrative.

The redundancy principle is respected when spoken text is not duplicated in printed form, which was the case in 51.3% of the videos. The transgression of the redundancy principle (5.1%) was coded when printed text was added to the narration, except for specific vocabulary words intended to help the learner (e.g., foreign language).

The spatial contiguity principle was present in 26.9% of the videos. This principle is applied when a written text clarifies an illustration. It does not conflict with the redundancy principle because only essential information is explained, mainly in the form of keywords.

More than half of the videos (64.1%) followed the temporal contiguity principle, and 9% included a transgression of the principle when corresponding words and images were time-lagged.

3.3.2 Managing Essential Processing

Three design principles of the CTML target the management of essential processing: segmenting, pre-training, and modality. This group of principles is the most widespread in the analyzed sample (93.6%). Even segmentation added by teachers during video screening had minimal influence on this result (94.9%) (see Fig. 4).

Fig. 4
figure 4

Distribution of design principles for managing essential processing and their transgression in videos

Based on video and practice data, the segmenting principle was divided into system-paced and teacher-paced segmenting. System-paced segmenting was only found in two videos, while teacher-paced segmenting—which refers to when a video is interrupted during viewing—was implemented for 44.9% of the videos. It is worth noting that the two system-paced videos were also segmented by the teachers. A chi-square test of independence was performed to examine the relationship between teacher-paced segmenting and video length. The relationship between these variables was not significant: X2 (1, N = 78) = 0.033, p = 0.856. Video length did not influence the addition of teacher-paced segmenting.

The pre-training principle was derived from the data collected in the questionnaire since it occurred before viewing. In this case, the principle was followed in 76.9% of the videos in the sample.

The modality principle was found in 67.9% of the videos. The principle was considered absent in 12.9% of the videos; these particular videos only featured a single medium (pictorial). Transgressions were found in 19.2% of the videos. The transgression of the modality principle was coded when the pictorial medium was accompanied by written text without auditory narration.

3.3.3 Fostering Generative Processing

Overall, 78.2% of the videos included at least one of the CTML design principles aiming to foster generative processing, which increased to 87.2% when generative activities added by teachers during the screening were included (Fig. 5).

Fig. 5
figure 5

Distribution of design principles for fostering generative processing and their transgression in videos

The personalization principle was present in 43.6% of the videos. Its transgression due to a formal style appeared in 33.3% of the videos. In 23.1% of the videos, the principle could not be assessed, as there was no auditory narration.

The voice principle was found in 76.9% of videos. However, where it was absent, the absence was due to a lack of auditory narration and not due to the use of a synthesized voice, which would have been coded as a transgression.

The low rate for the embodiment principle (32.1%) is due to the high number of videos without a tutor visible on the screen (64.1%). Transgressions of this principle were found in 3.8% of the videos.

The image principle was found in a single video, which shows an overall alignment with this principle since it requires to be avoided (Mayer, 2020).

The immersion principle, which is inherent to virtual reality (Mayer, 2020) and 3D, was absent from all the videos.

The generative activity principle was present in 10.3% of the videos through the questions asked by the narrator to guide observation. This principle was also added by teachers as part of the learning sequences (44.9%). Generative activities, which may appear simultaneously, were oral questions to students (51.4%), highlighting important elements (85.7%), discussion (74.3%), note-taking by students (22.9%), answering a questionnaire (17.1%), and other activities (42.9%). Overall, in five out of eight videos, both video-integrated and teacher-initiated generative activities were included.

3.3.4 SSW (Segmenting, Signaling, and Weeding)

While the CTML analyzes multimedia principles independently, one from the other, there is a dearth of research on learning with authentic videos that has investigated a combination of several principles (Brame, 2016; Ibrahim et al., 2012). Therefore, we also analyzed the concurrent presence of the segmentation, signaling, and coherence principles.

Combined, these three principles were found in 2.6% of the videos, and this percentage increased to 16.7% when teacher-paced segmentation was included.

3.3.5 Boundary Conditions

To analyze whether the boundary conditions of multimedia design principles related to students’ learning abilities were respected, the videos were divided into three groups based on the questionnaire data: videos used in the pre-vocational track (students with lower learning abilities), in the pre-general track (students with well-developed learning abilities), or in both levels. We conducted cross-tabulated analyses between the schooling tracks of the students and the presence of multimedia principles in the videos. The first analysis shows that 42.3% of videos were shown in both tracks, 28.2% only in the pre-general track, and 29.5% only in the pre-vocational track. Based on these results, further analysis was deemed unnecessary.

To investigate whether the need for prior knowledge for less expert students (Mayer, 2020) was fulfilled, grade-level was cross-referenced with when the videos were screened during the teaching sequence. We assumed that the students had prior knowledge of the topic when the videos were shown in the middle or at the end of the teaching sequence. This analysis showed that, in 82.6% of video screenings, pre-vocational track students had prior knowledge. For videos shown in the pre-general track, the result was 90.9%.

4 Discussion

Research into instructional videos has generally adopted an experimental approach to testing the effectiveness of the design principles derived from the CTML and the cognitive load theory. Most principles were found to benefit learning outcomes and, or reduce cognitive load (Noetel et al., 2022; Rey et al., 2019; Schroeder & Cenkci, 2018; Sundararajan & Adesope, 2020). Although some studies were conducted in authentic settings, the materials used themselves were mainly developed and manipulated for the purpose of the studies. Thus, the transfer to practice remains uncertain. The present study investigated the correspondence between multimedia document design principles – from the CTML, CLT, and empirical results – and authentic videos, screened by teachers in class.

4.1 Correspondence Between the Video Sample and Multimedia Design Principles

Looking at the overall picture, the results appear to be aligned with those of previous research on the use of design principles in authentic contexts, such as MOOCs. In their study, Oh et al. (2020) assessed the presence of design principles in MOOCs. However, a closer examination indicated that the principles are related only to facilitate some specific part of the learning process, which was not sufficient to enhance learning outcomes. Our research shows the same results with approximatively 90% of the analyzed sample following at least one of the multimedia design principles. A closer examination of the principles included by following the three categories defined by Mayer (Mayer, 2001, 2009, 2020) showed a similar picture within categories for principles that reduce extraneous processing (> 80%) or manage essential processing (> 90%), and, to a lesser extent, for principles that foster generative processing (> 70%). However, our qualitative analysis also identified that there was a cumulative presence of at least one principle per category, in only approximately 30% of the videos, although this should theoretically be the aim of instructional videos (Mayer, 2020). Finally, although previous research (Ibrahim et al., 2012; Noetel et al., 2022) has reported that the SSW principles were particularly efficient in supporting learning, we found that these were not often followed in our sample of videos.

We also reported gross transgressions of principles stated negatively (i.e., redundancy, coherence, temporal contiguity, modality and personalization principles) to provide an accurate assessment of the correspondence between videos and the multimedia design principles. Such transgressions were mainly found for the coherence and personalization principles, and, in particular, coherence was more often transgressed than respected.

The results from the present study regarding the correspondence of authentic instructional videos with multimedia design principles provide a good overview of the quality of the videos actually used by teachers of history and sciences. However, more research is required to assess whether the design of such authentic videos is favorable for or detrimental to learning outcomes. In the latter case, further experimental studies with videos could be conducted to improve the correspondence of the videos with the multimedia learning design principles.

4.2 Teacher-Implemented Multimedia Design Principles

Nonetheless, our results also show that teachers can play an active role in implementing multimedia design principles for videos screened in class. Indeed, on the basis of the strategies that instructors can use to optimize instructional materials identified by Castro-Alonso, de Koning et al. (2021), we collected data on how teachers apply the segmentation principle when showing a video in class. In the sample analyzed here, we found that, although only two videos included segmentation, the teachers interrupted the videos (i.e., temporary pausing during screening) in more than 40% of the cases. Interestingly, contingency analyses showed that although one of the purposes of segmenting is to divide long videos into parts to limit cognitive overload, we found that the recommended 6-min video length (Brame, 2016; Guo et al., 2014) did not affect teacher-paced segmenting.

Data provided by teachers to assess whether they followed the pre-training principle showed that this did occur in more than three quarters of the videos. The generative activity principle, however, was not often implemented by teachers and scarcely present in the videos.

Although research on learner-managed multimedia content interaction is a growing field, the fact that videos are viewed not only individually but also in a class setting is not considered. This study provides a first glimpse into teacher-managed multimedia content interaction to encourage learners to learn the intended content. Nevertheless, additional observational studies are required to get a better picture of how teachers actually implement multimedia principles in authentic videos they selected for class screening. For example, future research could investigate whether teachers’ training to use gestures as a communication tool (e.g., Alibali et al., 2013; Wakefield et al., 2018) is transferred to the use of gestures to signal elements during video screening. Furthermore, as software for editing videos is becoming more readily available and ergonomic, teachers might increasingly edit videos. Future research could therefore investigate the implementation of the multimedia principles by teachers both before and during the screening in class. Research pursuing such avenues should, however, also consider teachers’ misconceptions toward multimedia learning. Indeed, recent research by Eitel et al. (2021) has shown that a majority of both in-service and pre-service teachers hold common misconceptions regarding learning styles, hemispheric isolation, and naïve summation. Nonetheless, Prinz et al. (2022) have shown that these misconceptions could possibly be rebutted.

4.3 Limitations and Perspectives

The limitations of this study include the sample itself, with the generalization of the results being limited to the subjects taught by the participating teachers and the targeted grade levels. Although the teachers were involved in different tracks (pre-vocational and pre-general), the curricula for these are similar, so they often used a video for both tracks. Nonetheless, our analysis is based on a particular population and should be repeated with other teaching subjects as well as different schooling levels.

While this study provides an extensive review of the presence of or transgression of multimedia learning design principles in authentic videos, it does not question the validity of said principles in authentic materials. During the coding process, it was sometimes difficult to code with simple yes or no options. For example, the coherence principle is often presented as a negative principle with reference to seductive details (Moreno & Mayer, 2000a; Sundararajan & Adesope, 2020). Moreover, seductive details and the coherence principle are mostly investigated through pictorial or verbal representations, but the role of background music remains unclear. Based on the sample analyzed here, we would strongly recommend that when background music supports learning. Although studies by Lehmann and colleagues () found a positive effect of positive mood-inducing music on learning outcomes, it has not yet been investigated in authentic videos. In the sample of videos analyzed, we noted that music without any link to the content was sometimes used, or, in contrast, sometimes seemed to be used as a means of signaling.

Finally, as mentioned previously, one major focus of this study is the evaluation of the positive or detrimental effect of the adequacy of authentic video with multimedia learning design principles on learning outcomes and cognitive load. Video edited by teachers could also be investigated with the coding scheme developed in this study and the framework from Ring and Brahm (2022). More research is necessary regarding the effect of multimedia learning design principles in authentic video and in authentic contexts.

5 Conclusion

The present research offers a first insight into the potential application of multimedia design principles to instructional videos screened in authentic settings. Although the findings indicate that videos used in the classroom are overall not aligned with the CTML, they also suggest that multimedia design principles are added by teachers during the lesson. The analysis of the teaching scenarios that include videos provides a more comprehensive and detailed understanding of the integration of multimedia design principles in an authentic context. Future research could examine the impact of the principles, whether incorporated in videos or added by teachers, on students’ learning outcomes. Lastly, it would also be valuable to study the effect of group screening inherent to the use of instructional videos in authentic classroom settings.