Human-centered design and evaluation of a workplace for the remote assistance of highly automated vehicles

Schrank, Andreas; Walocha, Fabian; Brandenburg, Stefan; Oehl, Michael

doi:10.1007/s10111-024-00753-x

Human-centered design and evaluation of a workplace for the remote assistance of highly automated vehicles

Original Article
Open access
Published: 19 February 2024

(2024)
Cite this article

Download PDF

You have full access to this open access article

Cognition, Technology & Work Aims and scope Submit manuscript

Human-centered design and evaluation of a workplace for the remote assistance of highly automated vehicles

Download PDF

Andreas Schrank ORCID: orcid.org/0000-0001-8352-1052¹,
Fabian Walocha¹,
Stefan Brandenburg² &
…
Michael Oehl¹

983 Accesses
1 Altmetric
Explore all metrics

Abstract

Remotely operating vehicles utilize the benefits of vehicle automation when fully automated driving is not yet possible. A human operator ensures safety and availability from afar and supports the vehicle automation when its capabilities are exceeded. The remote operator, conceptualized as remote assistant, fulfills the legal requirements in Germany as a Technical Supervisor to operate highly automated vehicles at Society of Automotive Engineers 4. To integrate the remote operator into the automated driving system, a novel user-centered human–machine interface (HMI) for a remote assistant’s workplace was developed and initially evaluated. The insights gained in this process were incorporated into the design of a workplace prototype for remote assistance. This prototype was tested in the study reported here by 34 participants meeting the professional background criteria for the role of Technical Supervisor according to the German law. Typical scenarios that may occur in highly automated driving and require remote assistance were created in a simulation environment. Even under elevated cognitive load induced by simultaneously engaging in a secondary task, participants were able to obtain sufficient situation awareness and quickly resolve the scenarios. The HMI also yielded favorable usability and acceptance ratings. The results of this study inform the iterative workplace development and further research on the remote assistance of highly automated vehicles.

Exploring remote operation of heavy vehicles—findings from a simulator study

Article Open access 13 February 2024

Drivers’ Interaction with, and Perception Toward Semi-autonomous Vehicles in Naturalistic Settings

Measuring the Mental Workload of Operators of Highly Automated Vehicles

1 Introduction

The automation of driving technology advances quickly. It is associated with benefits concerning safety, reliability, and passenger comfort, as well as the reduction of economic and environmental costs of mobility (Litman 2020; Schoitsch 2016). It is considered key for the fundamental shift in transportation away from individual mass motorization to flexible on-demand mobility solutions, for instance, shuttle vehicles (Iclodean et al. 2020). However, highly automated driving (Society of Automotive Engineers level of automation 4; Society of Automotive Engineers 2021) in urban mixed-traffic environments will remain challenging for automated vehicles. On this level of automation, situations may occur that the vehicle’s automated driving system cannot handle (Kalisvaart 2021). In the worst case, automated vehicles can cause situations in which goods or even people are harmed. For example, in one incident, automated vehicles operated by the robotaxi company Cruise blocked an ambulance, delaying a patient’s urgent transportation to the hospital (New York Times 2023). In another incident, a Cruise vehicle hit a pedestrian and pulled them several meters forward, inflicting serious injuries (Guardian 2023). In some cases, human operators may be able to free automated vehicles from situations characterized by ambiguity and uncertainty by tackling even unforeseen situations with creativity and ingenuity, thereby helping avoid situations like described above, and can be fruitfully included into automated transportation systems consisting of a highly automated vehicle (HAV) and a remote operator (RO).

In remote operation systems, an RO oversees vehicle operations from a control center. The RO overviews and analyzes traffic situations that automated vehicles encounter. They provide guidance to the vehicle automation on how to tackle difficult situations. However, since the interaction between RO and HAV is essentially a shared control problem between human and machine, potential conflicting decisions between these two actors have to be identified and prevented (Abbink et al. 2018). A helpful approach to discover conflicting decisions is the heuristic-based CAP method that Vanderhaegen (2021) proposed. For service remote operation, it needs to be clear at any time which actor is responsible for which tasks. In order to determine the RO’s tasks, this paper refers to industry standards and legal frameworks (see Sect. 1 .1).

Remote operation is conceivable for any vehicles with high driving automation (Society of Automotive Engineers 4 or higher), shuttle buses, and other vehicles, e.g., personal vehicles, transportation vehicles such as vans and trucks, as well as larger buses. Remote operation could, therefore, help overcome situations that the automation alone cannot handle, resulting in safer and smoother operations of HAVs. A pivotal component of a safe and smooth HAV remote operation system is the RO’s workplace. The following paper will describe the design for a conceptual prototype for a workplace for remote assistance, a variant of remote operation, and its user evaluation focusing on the central indicators performance, situation awareness, and workload.

1.1 Workplaces for remote operators

ROs will be a core component of HAV remote operation systems. The human–machine interface (HMI) of the RO workplace is essential for safe, effective, and efficient operations. Remote operation can mainly be implemented in two different ways. First, in the remote driving approach, also known as direct or teleoperated driving, the RO executes the dynamic driving task (DDT) including braking, steering, and accelerating in real time (Society of Automotive Engineers 2021). The input given resembles manual driving, requiring the RO’s continuous attention. Second, the remote assistance, or indirect, approach is defined as “event-driven provision, by a remotely located human, of information or advice to [… a] vehicle in driverless operation in order to facilitate trip continuation when the ADS [automated driving system] encounters a situation it cannot manage” (Society of Automotive Engineers 2021, p. 18). The HMI presented and evaluated in this paper aims to enable remote assistance at Level 4 Automation (Society of Automotive Engineers, 2021). Since remote assistance, unlike remote driving, does not include the execution of the dynamic driving task (DDT), i.e., the longitudinal and lateral control of the vehicle, the proposed HMI does not enable the remote operator to complete the DDT (Society of Automotive Engineers, 2021, p. 18). In contrast, the focus is to provide assistance to the HAV in assessing a traffic situation and proposing how to proceed. In accordance with the definition of Level 4, the remote assistant who oversees an HAV does not serve as a fallback for the automation. The HAV must be able to transfer itself into a minimal-risk state, posing the least danger possible to itself, its passengers, as well as surrounding road users. Remote assistance in this implementation is the only permissible way of implementing remote operation of vehicles on public roads in Germany to this date (StVG § 1e, 2021/12.07.2021).

During remote assistance, the RO’s main task is the processing of requests for assistance coming from the supervised HAV (see Fig. 2). According to the German Autonomous Driving Act (StVG § 1e, 2021/12.07.2021), ROs specified as Technical Supervisors (“Technische Aufsicht”) are responsible to check and assist an HAV based on evidence that it requires support (“Evidenzkontrolle”). This means that an RO becomes involved only when the vehicle detects an event that it cannot handle autonomously and thus submits a request for assistance to the RO (StVG § 1e, 2021/12.07.2021). In this case, the HAV must be able to conduct a minimal-risk maneuver (MRM) independently, i.e., bring itself to a halt in a safe manner and at a safe position. The RO can intervene only after the successful completion of the MRM. The RO’s intervention must not be time critical, i.e., does not need to be completed in a specified amount of time. The RO has the following responsibilities: (1) giving clearance to alternative driving maneuvers, (2) deactivating the autonomous driving function, (3) assessing the HAV’s signals regarding its functioning and initiate measures for ensuring safety, as well as (4) getting in contact with the HAV’s passengers in the event of an MRM (StVG § 1f). In addition, the RO can propose driving maneuvers themselves if the HAV is unable to do so. The presented user study investigates some of these responsibilities using the proposed HMI for the RO’s workplace, including giving clearance to driving maneuvers proposed by the HAV (Scenario 1), suggesting driving maneuvers themselves by specifying waypoints to define a pathway that the HAV needs to follow (Scenario 2), as well as by selecting an alternative route (Scenario 3).

Even though German law demands that interventions by the RO cannot be time critical, (a) task reaction time, i.e., the duration passed from the request’s appearance on the RO’s workplace HMI to the RO’s acceptance of the request, is still considered a key performance indicator as it is essential for efficient operations and therefore relevant for the economically feasible implementation of RO systems. In addition, (b) task completion time, i.e., the time passed from the RO’s acceptance of the request to the resolution of the task, is an indicator to measure how long it took an RO to resolve a task.

The literature on workplace HMIs for remote operation is scarce. Following a human-centered design process, Kettwich et al. (2021) designed and evaluated a click prototype for a remote operation workplace HMI. It was tailored to the remote assistance of Society of Automotive Engineers 4 shuttle buses from a public transport control center. Apart from this research, although software and hardware solutions for the remote operation of vehicles already exist (e.g., DriveU.auto 2023; Herger 2023; T-Systems 2023; Vay 2022), no systematic research has been conducted in a highly controlled laboratory environment to develop and evaluate a prototypical HMI for HAV remote assistance to the authors’ knowledge. Remote assistance here is defined in accordance with Society of Automotive Engineers J3016 as an “event-driven provision, by a remotely located human […], of information or advice to an ADS-equipped vehicle in driverless operation in order to facilitate trip continuation when the ADS encounters a situation it cannot manage,”. This definition is similar to the task of the Technical Supervisor according to the current German Autonomous Driving Act. In particular, there is a gap in research on workplace HMIs for remote operation of vehicles in the contexts of public transport, logistics, and individual mobility that are tailored to the needs, expectations, and operation styles of control centers in these areas. Therefore, the goal of this work is the user-centered design of a prototypical workplace HMI for a concrete implementation of remote operation, remote assistance, and its evaluation regarding performance, situation awareness, and workload in routine remote assistance tasks. Also, we want to assess the operator’s subjective experience by assessing their ratings of usability, user experience, and acceptance.

1.2 Situation awareness

Similar to a driver, a remote operator (RO) needs to perceive and identify relevant elements of a traffic situation. They must integrate them to a coherent understanding of the situation and be able to predict how relevant elements will change in the future. These operator tasks can be described by situation awareness (SA). The hierarchical SA model of Endsley (1995) proposes three levels of SA. A lower level of SA needs to be fulfilled in order to reach a higher one. On SA Level 1, a RO has to perceive characteristics of the traffic environment like road layout and condition, traffic signs, and other road users. On SA Level 2, the RO has to analyze and integrate these elements in accordance with their goals to “form a holistic picture of the environment, comprehending the significance of objects and events” (Endsley 1995, p. 37). For example, a pedestrian crossing an HAV’s lane is relevant to the RO’s goal to continuously drive on this lane. On SA Level 3, the RO predicts how the situation will unfold. A result of high SA is that the RO commands the HAV change lanes in order to avoid the predicted collision.

In a remote setting, it may be difficult to achieve high levels of SA because ROs cannot perceive the elements of the driving situation directly and without delay (SA Level 1), or react immediately to them (based on SA Levels 2 and 3). Also, there is no direct link between a RO and the surrounding traffic environment. Information of the driving situation is sensed via technology, transmitted to the RO’s workplace, and displayed to them through the interface. Similarly, the RO’s reaction is mediated through data transmission, in-vehicle processes, and execution by actuators, causing delays between operator inputs and vehicle reactions as well as vehicle actions and presented status. Decoupling action, perception, decision, and reaction by inserting intermediate steps of deconstruction, transmission, and reconstruction into the process has important implications: distortions may occur in any of these steps, negatively impacting the RO’s SA (Tittle et al. 2002). For instance, Darken et al. (2001) stated that participants performed poorly in spatial orientation as well as object identification tasks when video feedback was supplied to remote observers. Thus, the HMI design of the RO workplace concerning the selection of information modes (visual, auditory, etc.) and the way information is displayed to the RO affects their level of SA (Endsley 1995; Endsley et al. 2003; Hollands et al. 2019). As a result, the RO’s workplace needs to ensure high levels of SA.

Specifically, the RO’s tasks investigated in this study require the RO to generate and keep up SA on all three levels. In Scenario 1, for example, in order to give clearance to the HAV to conduct the proposed driving maneuver, the RO first needs to recognize the relevant objects in the scenario accurately, including the correct perception of the buildings along the street and the puddle on the street (SA Level 1). Second, the RO needs to be able to integrate the perceived information, i.e., identify the buildings appearing in the puddle as mere reflections rather than actual obstacles (SA Level 2). Third, the RO needs to draw conclusions from the integrated information (SA Level 3). In this case, the RO can conclude that there is no obstacle on the street ahead, so they can give clearance to continue the HAV’s ride on the planned pathway.

1.3 Workload

Workload is the experienced difference between required and supplied information processing capability (Hart and Staveland 1988). It is associated with task performance. Therefore, a workplace for remote operation should balance the requirements of tasks to avoid overload, which leads to stress, or underload, which is associated with boredom (Wickens 1984). A good overview of the tasks that need to be completed is therefore essential for the RO to balance their workload across time. The proposed HMI fulfills this requirement by presenting every request for assistance in a table with the most vital pieces on information, including its status, i.e., whether it still needs to be accepted, in currently processed, or already completed. This view helps the RO realize the current situation, including the number of open requests and which ones need to be prioritized, and therefore facilitates balancing their workload.

In workplace design, all tasks, be they primary or secondary, need to be considered in workload assessment. For primary tasks, this study uses scenarios that are assumed to cause varying levels of workload, for instance, the task of giving clearance to a maneuver that the HAV proposed by itself (Scenario 1) is expected to generate less workload than the task of determining waypoints on a map view (Scenario 2). Secondary tasks pose additional cognitive load on operators (Sweller 1988), thereby increasing perceived workload. These tasks can be (a) directly relevant to fulfill the primary task, for example, when additional pieces of information need to be gathered from other sources. They can also be (b) indirectly relevant as part of other responsibilities of an operator, e.g., an incoming request for support by an HAV while already processing another HAV’s request. However, they can also be (c) irrelevant to an operator’s responsibilities, i.e., distractions. An example of the fatal consequences of being distracted from job-related tasks is the rail disaster of Bad Aibling in Bavaria, Germany. A train controller distracted himself from his rail traffic management task by playing a game on his phone, leading to a collision of two trains on a single-track stretch, killing twelve (British Broadcast Corporation 2016).

In Human Factors research, examining the impacts of a secondary task on an operator’s workload has a long tradition (e.g., Ogden et al. 1979). In the case of a RO’s task set, generic cognitive secondary tasks such as the n-back task (Kirchner 1958) can be used as proxies for cognitive load that might result from tasks that the RO could have, like the RO’s parallel assistance of several HAVs. The n-back task is widely used in driving-related studies to systematically vary workload (Pfannmüller et al. 2015; Reimer and Mehler 2011; Wu et al. 2019).

Hence, workplaces for ROs should be designed so that primary and secondary tasks do not lead to an increase of the operator’s workload that would severely deteriorate performance. This is especially important as processing multiple tasks at the same time affects the operators’ workload and their SA. In these situations, operators need to keep multiple pieces of information in their working memory, leaving less cognitive resources for gaining high levels of SA (cf. Baumann et al. 2008).

To summarize, an HMI for remote operation needs to be designed to enable effective and efficient operations, to balance the RO’s workload, and to ensure their SA. In addition, user-focused variables need to be considered.

1.4 Usability, user experience, and acceptance

The user’s subjective usability is crucial for their smooth interaction with technical systems. The perceived usability is relevant because it determines how well the user is able to access information from the system and interact with it. High subjective usability is achieved when the interaction between user and system is effective, efficient, and satisfying (International Organization for Standardization 2018). User experience is a concept that assesses how satisfied users are when interacting with a system (Hassenzahl 2008; Minge et al. 2017). It is inevitable for developing successful user-centered products (Schrepp et al. 2017a). Finally, user acceptance is imperative for the success of newly introduced technology as it determines whether a new technology will be adopted by its designated user group (van der Laan et al. 1997).

All these concepts are of utmost importance when it comes to workplace design as they directly influence efficient, effective, and safe operations. The HMI for remote operation needs to be designed in a way that enables the RO to quickly obtain an overview of the HAVs’ requests for assistance, to be presented any information on the requests that is needed to answer the requests, and to enter the advice to the HAV on how to behave in the given situation. The quality, ease, and efficacy of the direct interaction with the HMI is captured in the construct usability. Further, the repeated interaction with the HMI constitutes its emotional valence, as represented by user experience. We aimed to achieve the repeated interaction through an extensive training phase and the repetition of trials using a limited set of routine tasks. In order for the participants to experience the interaction with the HMI in different states of perceived workload, we administered a secondary task to simulate additional cognitive load. This paradigm ensures that the measured user experience is also valid in more cognitively challenging situation, capturing a more diverse range of interactions.

1.5 Human–machine interface (HMI)

The structure and components of the HMI for remote assistance strongly resembled the click prototype presented and positively evaluated in Kettwich et al. (2021). To the knowledge of the authors, it is the first workplace HMI for the remote assistance of HAVs in the literature. Particularly, it follows the currently valid legal requirements for highly automated driving in Germany, rendering it a legally compliant approach to implement remote assistance. To achieve this, the initial click prototype was further iterated following the user-centered design process, incorporating the qualitative feedback from the initial evaluation of the click prototype. Particularly, a higher degree of immersion was implemented by translating the click prototype on one screen to a full prototypical setup using seven screens that is very close the final setup of an RO’s workplace. The resulting prototypical workplace for remote assistance is depicted in Fig. 1.

The workplace consists of seven screens of which six were regular computer monitors (24’’ Dell, 16:9 ratio), set up in two rows with three monitors each, and another monitor with the same specifications but including a touch feature (“Touchscreen,” see Fig. 1). The basic elements of the HMI as well as the interaction design are described in detail by Kettwich et al. (2021). The following screens are parts of the workplace:

Video screens: On the three top screens, the live video stream from the supervised HAV is displayed. For the study, simulated video sequences were created in the Unreal Engine for each scenario.
Details screen: On this screen, the RO can view information on the status of the fleet of HAVs, the technical status of each HAV, and its exact position and schedule, and can select various camera configurations.
Notification screen: Here, the RO is shown new incoming requests (left column), the status of accepted requests (right column), and a communication bar to initiate a voice connection with actors of interest, including other departments of the remote operation center and the operator of the HAV service, police, and rescue services.
Map screen: A global map presents the currently assisted HAV in its center as well as the surrounding HAVs that are supervised by the remote operation center. Additionally, layers such as current load closures, stops, and other points of interest can be activated.
Touchscreen: It presents a highly detailed view of the immediate area around the HAV and enables the RO to interact with the vehicle by giving clearance to suggested driving maneuvers (Scenario 1), setting waypoints to create pathways for the HAV to follow (Scenario 2), and to select alternative routes (Scenario 3).

The steps of the interaction between RO and the supervised HAV are depicted in Fig. 2. In addition, they are pointed out in detail for each scenario in Table 15 in the appendix.

In all three scenarios (see Fig. 3), the HAV drove in highly automated mode before noticing that it needed the RO’s support. Subsequently, it submitted a request to the RO’s workplace. The operator received the request for support on the central screen of the second row from the top in the section for incoming notifications that also included some core information such as the HAV’s ID, the issue that requires the RO’s support, and the spatial position of the HAV. By clicking on “Accept,” the RO could allocate the task to themselves, transferring the request to a table containing current tasks. Here, further details, such as the latest video stream from the HAV, its position on a map, and details regarding its technical state, were displayed. Furthermore, a suggestion for an action, such as “Give Clearance,” “Set Waypoints,” or “Select Alternative Route,” were provided. This information supported the RO’s decision on how to assist the HAV. Finally, the RO’s input was transmitted to the HAV and executed before it returned to the highly automated driving mode.

1.6 Research objectives and hypothesis

The goal of this work is the user-centered design of a novel HMI for remote assistance following the established guidelines and its evaluation regarding the key Human Factors outcome variables performance, situation awareness (SA), and workload. Also, we want to assess the participants’ perceived usability, ratings usability, user experience, and acceptance when interacting with the HMI. To achieve this goal, three research objectives were examined in this study.

The first objective examined whether participants show lower performance at increasing levels of cognitive demand in routine remote assistance tasks using the proposed workplace HMI for remote assistance. The overall hypothesis was as follows:

H1 (performance): When the level of induced cognitive demand increases while completing tasks using the designed workplace for remote assistance, participants’ performance decreases.

It separates into three sub-hypotheses:

H1.1 (task reaction time): When the level of induced cognitive demand increases, participants require more time to react to an incoming notification which manifests in more time passed from the appearance to the acceptance of the notification.
H1.2 (task completion time): When the level of induced cognitive demand increases, participants require more time to process a task which manifests in more time passed from the acceptance of the notification to the completion of the task.
H1.3 (number of correct n-back comparisons): When the level of induced cognitive demand increases, participants’ number of correct n-back comparisons decreases.

The second objective tested whether participants report lower SA at increasing levels of cognitive demand routine while processing remote assistance tasks using the proposed workplace HMI for remote assistance. The corresponding hypothesis was as follows:

H2 (subjective SA): When the level of induced cognitive demand increases while completing tasks using the designed workplace for remote assistance, participants’ reported SA ratings decrease.

The third objective examined whether participants report higher workload with increasing levels of cognitive demand while processing remote assistance tasks using the proposed workplace HMI for remote assistance. Here, the hypothesis was as follows:

H3 (subjective workload): The participants’ reported ratings of workload increase with increasing levels of induced cognitive demand while completing tasks using the designed workplace for remote assistance.

In addition to these objectives, we examined the participants’ ratings of usability, user experience, and acceptance. Thereby, we wanted to gain first insights on the participants’ subjective experience with the remote assistance workplace. Our analysis examined how participants assess the usability of the presented HMI for remote operation and how they rate their satisfaction.

2 Method

2.1 Sample

Participants were acquired through postings in buildings and online platforms of engineering departments of universities and research centers in Germany. The participants volunteered but were compensated monetarily for their participation with 25 euros. This study was conceptualized and realized in accordance with the Declaration of Helsinki. The institutional review board of the research institution in which this study was conducted approved this study. Informed consent was obtained from all participants before the experiment. The participants were allowed to stop the study at any point without justification or consequence.

Of the N = 41 participants who took part in this study, seven had to be excluded due to issues in the data collection process. Technical issues in the tools used for the collection of either questionnaire or performance data rendered some data unusable in these participants. Only participants with complete datasets (N = 34) were included in the analysis. The final sample analyzed consisted of 34 participants (four female). Participants’ ages ranged from 23 to 31 years (M = 26.2, SD = 2.31). 62% of the participants had experience in monitoring technical systems such as airplanes, automated vehicles, wind channels, agricultural robots, pumps, and machines. Their affinity for technology (Franke et al. 2019) was high (M = 4.94, SD = 0.48; scale poles 1: low to 6: high). All participants had normal or corrected-to-normal vision and possessed a valid driver’s license for passenger vehicles. Only participants with a university or state-certified technician degree in the following disciplines were accepted: mechanical, automotive, electrical, aerospace, and aviation engineering. The reason behind this criterion was our objective to closely model the group of participants on the requirements posed to the Technical Supervisor, the German equivalent of the RO as specified in the German Autonomous Driving Act (StVG, 2021). This law demands that a Technical Supervisor have a degree in the listed engineering disciplines. This criterion therefore ensured that only participants that were deemed qualified for this work by the law, at least regarding their education background, were included in this study. 21 participants (62%) held a Bachelor’s degree as their highest academic degree, thirteen a Master’s degree. More than a third (35%) of the participants stated to drive a vehicle multiple times per month and about 29% reported to drive several times a week. All participants had heard about HAVs in the past. 91% expressed interest or strong interest in HAV technology indicated by responding with values 4 or 5 on a Likert scale on interest in AVs (1: “not interested at all” to 5: “very interested,” M = 4.29, SD = 0.72). 28 participants (82%) indicated not to have used HAVs so far.

2.2 Experimental design

The experimental design was a 3 × 3 within-subject design. The independent variables were the primary task (Scenarios 1–3) and the secondary task to induce additional cognitive load (none, 1-back, 2-back). Dependent variables were performance in primary and secondary task, workload, SA, usability, user experience, and acceptance (see Sects. 2.3.3 and 2.3.4).

2.3 Materials

2.3.1 Primary task (scenarios)

Three scenarios were used as primary tasks in this study. Figure 3 displays a screenshot of each scenario. The scenarios that target highly automated driving (Society of Automotive Engineers Level 4) were extracted from a previously compiled catalog of scenarios in remote operation (Kettwich et al. 2022) because they were considered typical for routine tasks in remote assistance. This catalog is based upon in-depth interviews, observation studies, and video analyses with control center employees. Also, the scenarios are representative as similar scenarios are already used by leading operators of HAV fleets on public roads. For instance, automated vehicle operator Waymo utilizes remote operation when an automated vehicle finds a closed road on its way, requiring rerouting (Amador et al. 2022). Similar tasks were also confirmed to be used by the robotaxi service Cruise in California, USA (CNBC 2023). The scenarios were implemented in the Unreal Engine (Epic Games 2019) and extracted as video clips. These video clips were played to the participants before and after they interacted with the remote operation workplace.

Detailed steps of the interaction between RO and workplace are listed comprehensively in Table 15 in the appendix.

Scenario 1: Detected Situation Unclear. In this scenario, the supervised HAV detects an obstacle on the road, stops, and reports the incident to the RO. The detected obstacle is a puddle on the road which reflects the surrounding buildings, so the automation is uncertain whether the vehicle can continue its ride. The RO observes the situation via the supervised HAV’s on-board cameras (transmission of video images) and gives clearance, so the vehicle can continue its journey. After assessing the situation, the primary task for the RO therefore is giving clearance to continue driving. The task of giving clearance resembles the fulfillment of “confirmation requests” by ROs as stated to be used by Cruise (CNBC 2023).

Scenario 2: Blocked Lane. A vehicle is parking on the lane that the supervised HAV uses, blocking the lane and disabling the HAV from continuing its ride. The HAV stops and provides a corresponding message to the RO. The RO checks the situation on site via the HAV’s cameras and sets waypoints for an alternative trajectory using the lane for oncoming traffic to bypass the parking vehicle. The primary task for the RO is to set waypoints to calculate a new trajectory. The task of giving clearance resembles the ROs’ “guiding the AV through tricky situations” as stated to be used by Cruise (CNBC 2023).

Scenario 3: Rerouting. Because of a road closure, the supervised HAV needs to change its route. The RO views the road closure via the HAV’s cameras and suggested alternative routes and chooses one of them. Thus, the RO’s primary task is selecting one of the several proposed routes presented on the touchscreen. The task of rerouting is an RO’s responsibility in the event of road closures, as confirmed by Waymo (Amador et al. 2022).

2.3.2 Secondary task

As a secondary task, the n-back task (Kirchner 1958) was included. Its purpose was to modulate the RO’s cognitive load in order to simulate phases of elevated workload that are likely to occur in the RO’s work. In this task, participants had to compare a presented digit with the digit presented n steps before the current one. The higher the n, the more digits had to be retained in the participants’ working memory, increasing their workload. The n-back task was presented visually and auditorily on a tablet computer distinct from the investigated workplace HMI (Fig. 4, bottom right). However, participants were instructed to listen to the auditory presentation only and give their response verbally. The experimenter assured that participants followed these instructions. From a list of 30 digits plus n, a single digit from one to nine was played auditorily using the tablet computer’s speaker and displayed visually on the screen of the tablet computer every five seconds, so that a total of 30 n-back comparisons had to be made per trial. The order of digits was determined for each trial by randomly assigning one out of four lists that contained a specific order of digits. Participants were instructed to respond verbally with “correct” if the digit presented n steps before the current one was identical with the current one, and to respond with “incorrect” if the digit presented n steps before the current one differed from the current one. Participants were asked to respond before the next digit was presented, i.e., they had to respond within less than five seconds. The experimenter logged the participants’ responses.

2.3.3 Objective measures

We collected three measures that quantified the participants’ performance, two of them regarding the primary task and one regarding the secondary task.

Measures of Primary Task. Regarding the primary task, the objective was to examine how fast participants were able to react to incoming notifications. Even though remote operation must not be time critical by law, from an economic point of view, a speedy reaction is still favorable to enable a business case built on remote operation. In addition, the duration participants spent for completing the task was measured to investigate if the HMI is suitable to fulfill the RO’s task in a timely manner. Hence, we measured the participants’ performance in the primary task using two variables: (a) the time that passed from appearance to the acceptance of the support request in seconds, hereafter called task reaction time, and (b) the time that passed from acceptance to completion of the support in seconds, called task completion time.

Measure of Secondary Task. To measure how much cognitive load was induced, we measured the participants’ performance in the secondary task using the number of correct n-back comparisons (max. 30) in the n-back task (see Sect. 2 .3.2 for further details).

2.3.4 Questionnaires

In addition, we collected self-report data using six questionnaires.

NASA-TLX. The NASA Task Load Index (Hart and Staveland 1988) was used to measure subjective workload after each trial. It is an established multi-dimensional measure for participants to report how taxing they experienced a task. The questionnaire distinguishes between six dimensions of workload: mental demand, physical demand, temporal demand, performance, effort, and frustration. Responses to each item were collected on a 7-point Likert scale ranging from the poles 1: “low” to 21: “high.”

SART. The Situation Awareness Rating Technique (SART; Taylor 1990) assessed the participants’ situation awareness (SA) post-trial. It was originally developed to determine pilots’ SA and consists of three subscales: demands on attentional resources, supply of attentional resources, and understanding of the situation. Responses to each item were collected on a 7-point Likert scale. The poles depended on the specific item but ranged from a low to a high degree on a specific construct. For instance, the poles of the item “instability of the situation” were 1: “The scenario is entirely stable” to 7: “The scenario is entirely unstable.” An overall SART score was calculated by deducting the difference of attentional demand and attentional supply from understanding.

SUS. The Systems Usability Scale (SUS; Brooke 1996) measures perceived usability. Originating from the need to quickly evaluate usability in software development, it is an economic solution to assess the construct robustly and across a wide range of domains (Bangor et al. 2008). Responses to each item were collected on a 5-point Likert scale ranging from the poles 1: “I do not agree at all” to 5: “I totally agree.” A single indicator value between 0 and 100 summarizes the status of the investigated HMI regarding the participants’ impression how well it was suited to execute a particular task. As a global assessment tool, SUS was administered at the end of the study.

UEQ-S. The User Experience Questionnaire Short Version (UEQ-S; Schrepp et al. 2017b) assesses user experience. It consists of two subscales, the pragmatic and the hedonic subscale. While the pragmatic one captures a construct that leans toward usability, the hedonic one focuses on the emotional quality of the interaction. The UEQ-S is a condensed version of the standard UEQ (Schrepp et al. 2017a), compressing the six subscales from the standard UEQ to the beforementioned two subscales. Using the UEQ-S ensured a more economic collection of subjective data on participants’ emotional experience with a system. Responses to each item were collected on a 7-point Likert scale. The poles were semantically opposed statements on a construct each, e.g., 1: “obstructive” to 7: “supportive.”

Acceptance scale. The Acceptance Scale (van der Laan et al. 1997) was developed as a standard tool to measure driver acceptance of new technology. With nine items divided in two scales, it measures the usefulness of a system, associated with usability, and the user’s satisfaction with said system, similar to user experience. Thus, it is conceptually related to the structure of UEQ-S but adds the dimension of user acceptance when both subscales are considered holistically. Responses to each item were collected on a 5-point Likert scale. The poles were semantically opposed statements on a construct each, e.g., 1: “useful” to 5: “useless.”

ATI. The Affinity for Technology Interaction Scale (ATI; Franke et al. 2019) assessed the participants’ affinity for technology. This construct entails a person’s tendency to engage in interaction with technology. With its satisfying psychometric characteristics, the ATI scale measures the affinity for technology with nine items. In this study, the construct was used to describe the sample. Responses to each item were collected on a 6-point Likert scale ranging from the poles 1: “I completely disagree” to 6: “I completely agree.”

2.4 Procedure

An overview of the procedure is given in Fig. 5. First, the experimenter welcomed participants, briefed them about the objectives of the study, and asked them to sign an informed consent, a non-disclosure agreement, and a data protection declaration. Subsequently, participants filled in the sociodemographic questionnaire and completed the ATI scale (Franke et al. 2019) before they received a detailed explanation of the research context. Participants were instructed to imagine being a remote operator who assists HAVs that operate as shuttle buses in public transport. An image of an exemplary HAV was presented to the participants. They were also informed about the setup and features of the RO’s prototypical workplace for remote assistance. Next, the experimenter invited participants to take a seat in front of the workplace.

Subsequently, participants were asked to adjust their swivel chair, so they could see every screen well and were able to reach all input devices, i.e., mouse, keyboard, and the tablet computer for administering questionnaires. The experimenter described the features of the workplace screen by screen. Afterward, participants were encouraged to familiarize themselves with the workplace HMI independently by closely looking at all the screens, clicking around, and learning about the implemented features. Once they were confident to have acquired a general understanding of the structure and features of the workplace, they were instructed to notify the experimenter. All participants did this within 5 to 10 min. Next, they were guided through the task completion process of the three scenarios (primary tasks, see also Sect. 2 .3.1) by the experimenter who commented on each step. The experimenter ensured that participants were able to understand the sequence of actions and possible interactions in all scenarios. Participants were invited to ask questions. Next, they were familiarized with the secondary task, the n-back task (Kirchner 1958). A tablet computer located right next to the touchscreen (see Fig. 4) visually and auditorily presented a digit from 1 to 9 in intervals of 5 s. The participants were given an example each for both variations of the task (1-back and 2-back) and underwent a trial each to familiarize themselves with the task until they felt confident with it. A short break of approx. 5 min concluded the training block.

In the first experimental phase that measured the baseline performance in the secondary task, participants completed one trial for both variants of the secondary task (1-back, 2-back) in a balanced order. After each of the two trials, they filled in the NASA-TLX questionnaire. In the second experimental phase that measured the baseline performance in the primary task, participants completed each of the three scenarios of the primary task (1, 2, 3) in a balanced order. Before carrying out the tasks, participants were instructed to make the journey as smooth, quick, and seamless as possible for the passengers of the HAV. Also, they were reminded of their responsibility for the passengers’ safety that can only be fulfilled appropriately if close attention was directed to all the information presented on the screens and recommendations with utmost care. Participants completed the NASA-TLX and SART after each trial. Subsequently, participants completed the joint data collection of the primary and secondary task with the same n-back variant. Two blocks (1-back, 2-back) of three trials each (Scenarios 1, 2, 3) were administered in counterbalanced order. Again, participants completed the NASA-TLX and SART after each of the six trials. Participants were instructed to give priority to completing the primary task but, at the same time, not to neglect the secondary task because failure to do so would disable the supervised HAV, resulting in passenger dissatisfaction. In each trial, participants performed the secondary task alone in the first 25 s of automated driving before the primary task was presented and had to be resolved by the participants. After completing the primary task, the secondary task lasted until 30 n-back comparisons were carried out.

Finally, participants filled in the questionnaires assessing usability, user experience, and acceptance, and were encouraged to provide remarks on the HMI and the study overall. The whole procedure took about 2.5 h.

3 Results

As this study was conducted using a within-subject design, a repeated measures analysis of variance (RM-ANOVA) was applied to determine the influence of primary and secondary task condition on the outcome variables presented above. Since the Mauchly (1940) sphericity test indicated a violation of sphericity in some cases, the Greenhouse–Geisser (1959) correction was applied in all reported RM-ANOVA results. In addition to the RM-ANOVA, post hoc pairwise comparisons with Bonferroni (1936) correction were performed to identify significant differences between specific groups.

3.1 Performance (H1)

To test Hypotheses 1.1 to 1.3, multiple statistical procedures were used. To test whether participants required more time to react to an incoming notification under varying levels of cognitive demand (H1.1), a 3 × 3 RM-ANOVA was computed. The descriptive statistics regarding task reaction time are presented in Table 1. The main effect of primary task condition (scenario) on task reaction time was not significant, F(2, 66) = 0.798, p = 0.448, η² = 0.024 (Fig. 6). There was also no main effect of the secondary task condition on task reaction time, F(2, 66) = 3.178, p = 0.063, η² = 0.088. Thus, induced cognitive load did not affect reaction times to incoming notifications. The respective hypothesis (H1.1) could not be accepted. There was no significant interaction effect between primary task condition and secondary task condition on task reaction time either, F(4, 132) = 0.597 p = 0.612, η² = 0.018. As shown in Table 2, no significant differences were yielded by post hoc pairwise comparisons.

Table 1 Descriptive statistics of task reaction time in seconds by primary task (scenario) and condition of secondary task

Human-centered design and evaluation of a workplace for the remote assistance of highly automated vehicles

Abstract

Similar content being viewed by others

Exploring remote operation of heavy vehicles—findings from a simulator study

Drivers’ Interaction with, and Perception Toward Semi-autonomous Vehicles in Naturalistic Settings

Measuring the Mental Workload of Operators of Highly Automated Vehicles

1 Introduction

1.1 Workplaces for remote operators

1.2 Situation awareness

1.3 Workload

1.4 Usability, user experience, and acceptance

1.5 Human–machine interface (HMI)

1.6 Research objectives and hypothesis

2 Method

2.1 Sample

2.2 Experimental design

2.3 Materials

2.3.1 Primary task (scenarios)

2.3.2 Secondary task

2.3.3 Objective measures

2.3.4 Questionnaires

2.4 Procedure

3 Results

3.1 Performance (H1)

3.2 Situation awareness (H2)

3.3 Workload (H3)

3.4 Questionnaires

4 Discussion

4.1 Results on H1 (performance)

4.2 Results on H2 (situation awareness)

4.3 Results on H3 (workload)

4.4 Results on questionnaires

4.5 Limitations

4.6 Conclusions and future research

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Appendix (Table 15)

Appendix (Table 15)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation