Robot Gaze During Autonomous Navigation and Its Effect on Social Presence

He, Kerry; Chan, Wesley P.; Cosgun, Akansel; Joy, Albin; Croft, Elizabeth A.

doi:10.1007/s12369-023-01023-y

Robot Gaze During Autonomous Navigation and Its Effect on Social Presence

Open access
Published: 25 June 2023

Volume 16, pages 879–897, (2024)
Cite this article

Download PDF

You have full access to this open access article

International Journal of Social Robotics Aims and scope Submit manuscript

Robot Gaze During Autonomous Navigation and Its Effect on Social Presence

Download PDF

Kerry He ORCID: orcid.org/0000-0003-4052-969X¹,
Wesley P. Chan¹,
Akansel Cosgun²,
Albin Joy¹ &
…
Elizabeth A. Croft³

1686 Accesses
1 Altmetric
Explore all metrics

Abstract

As robots have become increasingly common in human-rich environments, it is critical that they are able to exhibit social cues to be perceived as a cooperative and socially-conformant team member. We investigate the effect of robot gaze cues on people’s subjective perceptions of a mobile robot as a socially present entity in three common hallway navigation scenarios. The tested robot gaze behaviors were path-oriented (looking at its own future path), or human-oriented (looking at the nearest person), with fixed-gaze as the control. We conduct a real-world study with 36 participants who walked through the hallway, and an online study with 233 participants who were shown simulated videos of the same scenarios. Our results suggest that the preferred gaze behavior is scenario-dependent. Human-oriented gaze behaviors which acknowledge the presence of the human are generally preferred when the robot and human cross paths. However, this benefit is diminished in scenarios that involve less implicit interaction between the robot and the human.

“I Would Like to Get Close to You”: Making Robot Personal Space Invasion Less Intrusive with a Social Gaze Cue

Implementing a gaze control system on a social robot in multi-person interactions

Article 28 May 2020

Human Preferences for Robot Eye Gaze in Human-to-Robot Handovers

Article 21 January 2022

Find the latest articles, discoveries, and news in related topics.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

As robotic research matures, there is increasing interest in using robotic agents in socially assistive applications, including senior care facilities [29], health centres [33], and classrooms [19]. For mobile robots to become integrated into these environments, they need to be competent at navigating in human-populated spaces. In addition to being able to safely maneuver around humans, robots operating in these shared environments need to be equipped with appropriate social skills and behave in a socially acceptable manner. Particularly in environments such as senior care facilities where humans are lacking adequate levels of social stimulation [29], robots which have the ability to establish a social connection with surrounding humans through non-verbal cues offer the opportunity to improve the level of well-being of the humans which it interacts with. Moreover, for humans to feel comfortable within a robot’s vicinity, it is important for the robot to act in a predictable manner to be perceived as a safe and trustworthy entity [30]. Humans typically navigate cooperatively when moving among other people [38]. A moving robot should similarly be capable of developing a mutual understanding between itself and the humans in its vicinity when navigating among other people. This will allow both the robot and human to accurately interpret the actions of one another and allow for predictable movements [22].

It is well known that humans use head and/or eye gaze (the direction in which one appears to be looking) in social situations to communicate intent and display mental states [11]. Extensive research that has been conducted regarding interpersonal communication suggests that more than half of the meaning in social situations is communicated non-verbally [4]. Therefore, it is critical that non-verbal cues such as eye gaze are integrated into existing navigation algorithms to improve the social aspects of the robot.

To enable effective human-robot collaboration and teaming, the robot must be viewed as a socially capable agent by humans. In Human-Robot Interaction (HRI) literature, this is referred to as the social presence of the robot, which is defined as the “sense of being together with another” [2]. With the goal of enabling social robots to operate more harmoniously and effectively around people, this paper investigates how different robot gaze behaviors during different navigation scenarios affect the robot’s perceived social presence by conducting user studies in both real-world and simulated environments. We compare three types of gaze behaviors: a default behavior with no head movement, a gaze which looks at the robot’s planned trajectory, and a gaze which briefly looks towards the user. The experiments consist of one navigation scenario in which the human and robot do not cross paths, and two scenarios in which they do cross paths. Our results show that the preferred robot gaze behavior is dependent on the navigation scenario. In particular, gazes that glance at the human are generally preferred in scenarios in which the human and robot cross paths.

2 Background

In this section, we briefly review the existing literature on the use of mobile robots and social cues in HRI, followed by a deeper look at the use of gaze behaviors employed by robots for autonomous mobile robots.

2.1 Mobile Robots in HRI

As opposed to robots with a fixed base, mobile robots have access to a global workspace, and are therefore advantageous for tasks which require the robot to traverse between multiple areas. Fetch-and-carry tasks are a common application of mobile robots, typically contextualized as an assistive robot in household [7, 35] or warehouse environments [3]. In this context, both social navigation in human-populated spaces [8, 34] and the direct social interactions with humans in the space, e.g., conversations [31] and handover/delivery actions [16, 26], are important fields of research. Trajectory planning in the presence of humans considers aspects such as safety and visibility to the human, while also abiding by social conventions or human preferences. Proxemics [27, 39] considers the distance a robot should maintain from a human, which is important to maximizing the robot’s perceived safety. In situations when the robot and human must give way to each other, effective communication and reactive planning is beneficial to cooperatively avoid collisions with each other [5].

2.2 Social Cues in HRI

Social cues are used by robots to communicate various types of information to nearby humans. Cid et al. [6] uses a robotic head with mechanisms to control neck, eye, eyebrow, and mouth motions to mimic human facial expressions. Similarly, Zecca et al. [43] combines facial expressions with full body motions to recreate body language. However these require complex mechanisms to recreate recognizable expressions. Semantic-free audio cues have also seen some interest in HRI, however is still relatively unexplored [42]. Tatarian et al. [36] investigated the use of multiple social cues simultaneously, including proxemics, gaze, gestures, and dialogue, and found that each social cue had a distinct effects on how the robot was perceived.

An important type of social cue is gaze cues [1]. Gazes have been investigated in a variety of HRI contexts, for communicating different types of information. Moon et al. [28] uses gaze cues during a robot-to-human handover to communicate handover location and timing information. Terziouglu et al. [37] utilize a variety of gaze cues, including gazes towards the task and target, as well as gazes towards the human collaborator to acknowledge completion of a task. In navigational scenarios, gaze can be used to convey navigational intent [15] or social presence with humans [20, 39]. The latter is the primary purpose for which we employ gaze behaviors in our current study, while acknowledging that other interpretations of the gaze may be possible distractors from the true intention of the robot.

2.3 Simulation in HRI

Due to the recent COVID-19 pandemic, there has been renewed interest in the effectiveness of simulated HRI user studies for remote experimentation as a cheaper and easier alternative to real-world experiments. Although there is evidence that results for simulated and real-world HRI experiments coincide, this is not conclusive. Video-based user studies and real-world studies were conducted to study preferences in robot approach directions for a fetch-and-deliver task in [40, 41], and robot politeness in [23]. In these studies, it was concluded that there was a high level of agreement between the results of the two user study modes, although users expressed a preference towards participating in the real-world studies, and effects tended to be weaker in the video-based studies. Similarly, in [13], real-world studies were compared to studies where the user would interact with the robot remotely. Again, no significant differences were found between results of the two user study modes, although it was noted that participants would experience a higher cognitive workload. On the other hand, in the simulated studies in [24] which compared real-world studies to virtual reality studies to research proxemics in HRI, significant differences were found between the two results.

2.4 Gaze in Navigational Scenarios

Existing studies have investigated different aspects of robot eye gaze during navigation, including different types of gaze behaviors and navigation scenarios. However, conflicting results have been reported regarding effects of gaze behaviors on the robot’s perceived social presence [20, 39]. The primary difference between the two studies was the difference in the initiation timing of the gaze behavior and the navigation scenario explored. In the study conducted by Wiltshire et al. [39], a robot and a person are engaged in a series of interactions in a hallway navigation scenario where the person must give way to the robot crossing paths with the person perpendicularly. During each of the interactions, nonverbal cues of the robot such as proxemics or gaze behaviors were altered. From their user study, it was concluded that altering the gaze behavior of the robot did not result in a higher social presence for the robot, and other non-verbal cues such as the proxemic behaviors exhibited by the robot are of more importance. In contrast, the study conducted by Khambhaita et al. [20], which used very similar eye gaze behaviors, concluded that altering the gaze behavior of the robot resulted in higher social presence. The gaze initiation timings between both of these studies were not the same, which may have contributed to the difference in the results. Furthermore, the navigation scenario used by Khambhaita et al. is not the same as what was used by Wiltshire et al. Therefore, it is still unclear and worthwhile to explore whether the appropriate gaze behavior, as well as the timings of gaze behavior execution, are dependent on the navigation scenario.

Another aspect of gaze during navigation is gaze fixation duration. Studies regarding gaze behaviors of adults during natural locomotion have found that they initiate gaze fixation 1.72 s before encountering an obstacle [18]. It was found that the duration of the fixation depended on the obstacle in question. For obstacles that were designed to be stimulating (stickers were placed on the obstacle), the fixation duration was found to be 0.53 s, while regular obstacles had a fixation duration of 0.2 s. However, the appropriate fixation duration for robots during navigation is still to be determined and is worth investigating.

3 Research Questions

As identified in the previous section, existing works on robot gaze during navigation has produced some conflicting results. In particular, whether and how navigation scenarios affect the perceived social presence of a robot along with robot gaze behavior has remained largely unexplored. Hence, our current work aims to address the following research questions

1.
How do different robot gaze behaviors during navigation affect people’s perception of the robot as a socially present entity?
2.
Does the appropriate gaze behavior vary depending on the navigation scenario?

To answer these research questions, we conducted simulated and real-world user studies to investigate people’s perceived social presence of a robot when different gaze behaviors are used in different navigation scenarios.

4 User Study Design

In this section, we detail the user study design used to answer our research questions. We conducted user studies where we measure the perceived social presence of a robot agent when users interact with it in various navigational scenarios and for various robot gaze behaviors. We initially conducted a large simulated user study where the user is shown videos of a human–robot interaction in a navigational scenario from a first-person perspective was conducted. This was followed by a smaller in-person user study involving the same gaze behaviors and scenarios.

4.1 Independent Variables

To answer our research questions, our study consists of a two-factor design, examining four gaze behaviors and three navigation scenarios, for a total of twelve conditions. These variables are detailed in the following sections.

4.1.1 Gaze Behaviors

We tested four gaze behaviors in our study, visualized in Fig. 1:

Default (D): The robot always looks forwards, and does not move its head. This gaze serves as the control for the user study.
Path Oriented (PO): The robot looks at a point on the ground 1.5m ahead along the path it is planning to travel in.
Human-Oriented Short (HO-S) and Long (HO-L): The robot “acknowledges" the human by briefly directing its gaze at the human when passing them. Two gaze fixation timings are tested based on the timings used by humans during locomotion [18]. These gaze periods are 0.2 s for HO-S, and 0.53 s for HO-L. The robot uses the default gaze before and after it looks at the human.

4.1.2 Navigational Scenarios

Our study incorporates three different navigation scenarios as illustrated in Fig. 2. Hallway scenarios were chosen as they are typical in many buildings and are the most common scenario used in existing literature regarding gaze behaviors in navigational scenarios [12, 20, 25, 39]. Furthermore, these scenarios permit us to make direct comparisons with existing works.

Two way (TW): The person and robot navigate down a hallway starting from opposite ends. An obstacle is placed in the path of the robot to accentuate the difference between the default and path-oriented gaze behaviors.
Robot exits hallway (EXT): The robot moves down the hallway then enters a room on the human’s left. The robot briefly pauses before entering to attempt to give way to the human but always decides to enter the room first as the human is too far away to give way to.
Robot enters hallway (ENT): The robot begins inside a room to the human’s left, where it is not initially visible to the human. The robot exits the room before turning right and moving down the hallway. The person is forced to give way to the robot for it to make its turn.

4.2 Dependent Variables

Table 1 Statements which participants responded to for both the simulated and real-world user studies

Robot Gaze During Autonomous Navigation and Its Effect on Social Presence

Abstract

Similar content being viewed by others

“I Would Like to Get Close to You”: Making Robot Personal Space Invasion Less Intrusive with a Social Gaze Cue

Implementing a gaze control system on a social robot in multi-person interactions

Human Preferences for Robot Eye Gaze in Human-to-Robot Handovers

Explore related subjects

1 Introduction

2 Background

2.1 Mobile Robots in HRI

2.2 Social Cues in HRI

2.3 Simulation in HRI

2.4 Gaze in Navigational Scenarios

3 Research Questions

4 User Study Design

4.1 Independent Variables

4.1.1 Gaze Behaviors

4.1.2 Navigational Scenarios

4.2 Dependent Variables

4.3 Hypotheses

4.4 Implementation

4.4.1 Simulation Environment

4.4.2 Real-World Implementation

4.5 User Study Procedure

4.5.1 Simulated video study

4.5.2 Real-World Studies

5 Results

5.1 Co-Presence

5.1.1 Simulated Studies

5.1.2 Real-World Studies

5.2 Perceived Message Understanding

5.2.1 Simulated Studies

5.2.2 Real-World Studies

5.3 Perceived Behavioral Interdependence

5.3.1 Simulated Studies

5.3.2 Real-World Studies

5.4 Perceived Safety

5.4.1 Simulated Studies

5.4.2 Real-World Studies

5.5 Naturalness

5.5.1 Simulated Studies

5.5.2 Real-World Studies

6 Discussion

6.1 Simulated Studies

6.2 Real-World Studies

6.3 Design Implications

7 Limitations and Conclusion

Data Availability

Notes

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of Interest

Ethics Approval

Informed Consent

Additional information

Publisher's Note

Appendices

Appendix A: Simulated Post-hoc results

Appendix B: Real-world Post-hoc results

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation