Intergenerational Interaction with Avatars in VR: An Exploratory Study Towards an XR Research Framework

Karpowicz, Barbara; Masłyk, Rafał; Skorupska, Kinga; Jabłoński, Daniel; Kalinowski, Krzysztof; Kobyliński, Paweł; Pochwatko, Grzegorz; Kornacka, Monika; Kopeć, Wiesław

doi:10.1007/978-3-031-11432-8_23

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 440))

Included in the following conference series:

Conference on Multimedia, Interaction, Design and Innovation

4957 Accesses

Abstract

The dynamic development of solutions in the field of virtual and augmented reality poses challenges to designers. These challenges relate to both technical conditions, including hardware capabilities and software solutions, as well as psychophysical constructs conditioning the end users’ reception of the generated multimedia message. One of the key elements of the virtual and augmented reality experience is the interaction with the system through a virtual agent represented by an avatar, i.e. a reflection of the image of a participant in the virtual world, carrying on a conversation with the user. This paper presents a proposed software and hardware solution for conducting multifaceted research and comparative analysis of diverse interfaces and human-computer interaction in virtual and augmented reality. In the course of this research, statistically significant results were obtained indicating differences in perception between three types of virtual agents. Each of them represented by different avatars in a specially created research environment that allowed to conduct usability tests under reproducible conditions to study user interaction in virtual reality.

You have full access to this open access chapter, Download conference paper PDF

AR and VR – A Review on Recent Progress and Applications

Virtual Properties: Problems and Prospects

Article Open access 23 January 2024

Identifying Key Factors to Distinguish Artificial and Human Avatars in the Metaverse: Insights from Software Practitioners

Keywords

1 Introduction and Related Works

Recent years have brought a dynamic growth of diverse approaches and solutions to the novel modes user interaction and interfaces in the field of Human-Computer Interaction (HCI). One of the technologies undergoing significant evolution as new technical solutions and interfaces become available is Virtual Reality (VR), which challenges established paradigms of user interfaces, in particular well established WIMP paradigm (windows, icons, mouse, pointer).

This compels developers and academics to explore novel interfaces to facilitate effective human interaction with a three-dimensional virtual world, such as VR. There are multiple indicators of immersion in VR [14] in the field of applied psychophysiology [17], which may be used for the purpose of evaluating presence [5]. This aspect is key in evaluating interaction with avatars [2], or virtual agents [16], which are necessary in VR to engage people in social situations.

This study was a starting point for exploring various modes of voice interaction. It based on previous works of our XR Lab team in this field as a part of HASE research group (Human Aspects in Science and Engineering) by the Living Lab Kobo research activities on virtual reality rapid prototyping and development. These activities include end user engagement [10] and rapid content and software development [8] as well as alternative interfaces which were presented on major conferences, including CHI and also on previous MIDI conference, i.e. voice interfaces [7, 11] and brain-computer interaction and interfaces (BCI) [9].

Therefore, the primary objective of this work is to propose a hardware and software solution that enables repeated experimental research of user interaction with agents equipped with various types of avatars. Another objective is to determine the differences in perception of various forms of avatars representing virtual agents in virtual reality. The main research hypothesis is that regardless of the visual depiction of the avatar, i.e. the virtual “person” giving the information, there are no variations in the user’s perception of the identical content. In other words this research endeavour is in line with the concept of ecological validity.

The concept of ecological validity refers to experimental findings that can be generalized to real-life [1]. In research measuring emotional and cognitive processes, two approaches are often used – experimentally testing those processes in the laboratory or using retrospective recall with self-reported measures. Both of those methods are impairing the ecological validity of the study. First, laboratory settings are often very far from everyday life and thus the psychological processes measured in the lab might not fully reflect the everyday life of a given individual/group of individuals. Second, some of these processes, e.g. avoidance, can not be reliably measured through self-reported measures prone to retrospection biases. Thus, one of the main challenges in current research in the field of psychology and related disciplines is to assess and test psychological processes in the larger context of ecological validity, taking into account not only a given process but also the context of its development and maintenance [6]. Providing tools to study such psychological processes in the conditions of ecological validity is therefore a crucial research problem.

The results of this study coined the foundation for our XR framework for the development of advanced immersive environments and research tools providing ecological validity conditions with multimodal experimental data acquisition, including self-reported data (e.g. surveys) as well as objective psychophysiological data, related to eye movements, cardiac functions or skin conductance, described in the method section below. Therefore the results of this study paved the way for follow up studies and further research within the HASE group member labs, including Emotion Cognition Lab SWPS University and Institute of Psychology Polish Academy of Sciences.

2 Methods

2.1 Study Aims

To validate the study hypothesis while also evaluating the system’s usability, the following research variants of the virtual agent interaction are compared (see Fig. 1). They are embedded in the same omnidirectional visual environment:

1.
Avatar 1. High-fidelity model (rendered on the basis of photogrammetry) with scripted 3D animation,
2.
Avatar 2. Video recording of a real person,
3.
Avatar 3. VA (Voice Assistant), which is audio emitted from a virtual assistant model.

2.2 Mesures

As previously indicated, the study employed traditional research methodologies [3], both quantitative, including questionnaire surveys (conducted prior to, during, and after the VR session), and qualitative, in form of semi-structured interviews (prior to and after the VR session).

These methods were validated using objective psychophysiological markers, specifically:

1.
Eye Movement (EM) as a major sign of attention, measured by eye tracking,
2.
Synchronized signals from auxiliary source, namely:
1. (a)
  Cardiac function (PPG - PhotoPlethysmoGraphy, photoplethysmography, assessment of heart parameters based on blood flow analysis),
2. (b)
  Changes in skin conductivity (EDA/GSR - ElectroDermal Activity, Galvanic Skin Response).

The automated measurement of the aforementioned psychophysiological indicators within the proposed approach (research framework) was utilized to generate objective measures for evaluating the reliability of reception of the presented content. The objective of such verification was to eliminate inconsistencies in declarative data that are caused by natural human factors and are inherent in evaluating human-computer interaction, such as: the Hawthorne effect, [12, 13] which refers to the impact of the researcher’s presence and implicit expectations on the subject’s response, the desire to present a subject more proficiently than other subjects, and the possibility of obtaining insincere answers from the participants.

To test the research hypothesis, the results of the study participants’ declarative responses were compared to psychophysiological data on several dimensions relevant to assessing the immersion quality of the user’s interaction with virtual reality [5]. These dimensions include sense of immersion and co-presence, as well as the attribution of anthropomorphic features to agents, taking into account potential occurrence of the uncanny valley effect, which has been studied extensively in, and outside, of VR [15]. The last factor is especially pertinent when evaluating the quality of potential high-fidelity content, particularly humanoid avatar models [4].

2.3 Research Application

The research conducted for this work resulted in the creation of the dedicated solution depicted in Fig. 2, which was subsequently validated through an empirical survey with users, as detailed later. The research solution consisted of:

1.
Arduino - to mediate the Unity - Biopac communication.
2.
Unity - with necessary prefabs such as: GazeObjectManager, The EyetrackerMasnager, SMI_CameraWithEyeTracking and SceneSwitcher

The following tools were utilized to develop the software required for the study: Unity, the Arduino IDE, MS Visual Studio, and the HTC iViewHMD software.

2.4 Research Flow

Process name	Description
Baseline	Gathering the data from the Biopac sensors without the headset to serve as a baseline for evaluating the psychophysiological data gathered during the experiment proper. The participant for 5 min is alone in a room, sitting in front of a black wall
Survey settings	The first scene after turning on the application visible only to the researcher. Here, enter the prefix of the result files for the test subject and the port number to which the Arduino is connected. Additionally, you can select the data simulation mode for the eye tracker to facilitate testing the application
Startup scene/calibration	The first scene visible to the subject. This is where the eye tracker is calibrated and the order of the scenes presented is implicitly selected
Preliminary survey (training, warm-up)	A scene that allows the subject to become familiar with the questionnaire interface. Additionally, it will serve to establish the subject’s baseline mood
VR 360 scenes with an agent	The main scene of the application showing stages with different agents in VR: 3D animation, video recording and voice assistant in a random order for different participants
Follow-up questionnaire after each VR scene	Scene used for survey, after each stage with an assistant

2.5 Experimental Setup

The pilot experimental study was conducted in the Institute of Psychology of the Polish Academy of Sciences’ VR Lab. IP PAN’s VR Lab is equipped with the technological equipment fulfilling the requirements of the study, including a SMI eye tracker paired with a virtual reality headset, a system for psychophysiological assessments (Biopac), as well as statistical analysis capabilities and research hypothesis verification (Fig. 3).

2.6 Participants

The pilot study involved twenty-two Living Lab Kobo participants, including 18 from the experimental group, which included seniors over the age of 60, and 4 from the control group (under 50 years old). The experimental group consisted of 13 women and 9 men. The mean age for the entire study was 64.1 (standard deviation, SD = 15.52), with the experimental group averaging 70.8 (SD = 7.65) and the control group averaging 37.25 (SD = 8.31). The study’s youngest participant was 23 years old, while the oldest was 90 years old. The median age was 66 years, with equals 68 in the experimental group and 41 in the control group. 22 sets of measurements were taken throughout the study, which included 18 sets of measurements from the experimental group and 4 sets of measurements from the control group (Fig. 4).

3 Results

The results of declarative (ex-ante, control, and ex-post questionnaires) and psychophysiological (EM, PPG, and EDA) tests conducted during the analyses revealed statistically significant differences in perception of avatars, supporting the rejection of the hypothesis that no differences in perception of different types of avatars representing virtual agents in virtual reality exist.

The results of participants’ declarative responses to survey questions asked both before and after the study (on paper) and during the study: via a questionnaire module integrated into the research framework - were utilized to verify the research hypothesis. The questionnaire responses were examined in the context of psychophysiological data gathered using eye tracker EMs synchronized with Biopac signals (PPG and EDA/GSR).

Data analysis was conducted on several psychophysical dimensions identified in the formulation of the research problem that are relevant to assessing the immersion quality of a user’s interaction with virtual reality, specifically: a sense of immersion in virtual reality, a sense of co-presence, attributing anthropomorphic characteristics to agents, Belief in Human Nature Uniqueness (BHNU), and the uncanny valley effect. BHNU had a particularly strong link with the experience of co-presence in scenes with a humanoid avatar, with a correlation coefficient of 0.57 for video footage and 0.44 for rendered avatar. This demonstrates that the produced avatar lacked the sense of its human features present in the image from the video clip.

Moreover, additional extensive analyses of the anthropomorphic qualities assigned to avatars revealed further evident and statistically significant differences in perceptions of avatars. Fewer participants attributed human characteristics to the VA avatar than to a video recording and rendered avatar. The perceived sense of co-presence was most prominent for the video, decreased in case of rendered avatar, and was lowest for the VA.

Additionally, statistically significant variations were discovered in evaluations of the uncanny valley dimension: the phenomenon was observed the most for rendered avatar and occurred the least for the video recording.

The findings shown above, which are based on questionnaire and psychophysiological data, are consistent with the information gathered from in-depth qualitative interviews, as well as the analysis of eye tracker data.

4 Discussion

The results of the pilot study conducted in XR Lab PJAIT in cooperation with Emotion Cognition Lab SWPS University and Virtual Reality and Psychophysiology Lab of the Institute of Psychology of the Polish Academy of Sciences were deemed very promising by members of the HASE research group. As a result, work on the provided solution will resume, and the framework will undergo further development. Further waves of the study are planned to confirm the hypotheses by expanding the number of the experimental group of seniors and the control group of younger individuals.

With these objectives in mind, it is worth noting that the configuration of the connection between Arduino and Biopac, which is critical for synchronizing psychophysiological signals and correlating them to declarative questionnaire responses, proved to be effective and sufficient. However, due to the nature of the basic Biopac module (electrically unbuffered diagnostic ports), a more secure solution utilizing a specialized Biopac module for digital communication (STM type) or the use of an additional installation galvanically separating the electrical signal, such as an optocoupler, is recommended for the future.

5 Conclusions

The solution presented in this paper was validated through an experimental research procedure with users, demonstrating its efficacy and utility in resolving the primary research problem, which is the evaluation of interactions in virtual reality via new interfaces in the form of virtual agents with a variety of avatars. The experiment demonstrates that the numerous psychological measures used to assess users’ immersion in virtual reality reveal statistically significant variations in agents’ and their avatars’ perceptions. At the same time this study formed the basis for further work on the XR framework, which enables research teams to conduct XR experiments in the conditions of ecological validity, while at the same time verifying their qualitative findings through numerous psychophysiological measures. Such alignment of multimodal research measures in the immersive virtual reality enables the development of reproducible experiments providing more reliable, triangulated, results.

References

Andrade, C.: Internal, external, and ecological validity in research design, conduct, and evaluation. Indian J. Psychol. Med. 40(5), 498–499 (2018)
Article Google Scholar
Baylor, A.L.: Promoting motivation with virtual agents and avatars: role of visual presence and appearance. Philos. Trans. Roy. Soc. B Biol. Sci. 364(1535), 3559–3565 (2009)
Article Google Scholar
Brannen, J.: Mixing methods: the entry of qualitative and quantitative approaches into the research process. Int. J. Soc. Res. Methodol. 8(3), 173–184 (2005). https://doi.org/10.1080/13645570500154642
Article Google Scholar
Cheetham, M., Suter, P., Jäncke, L.: The human likeness dimension of the “uncanny valley hypothesis’’: behavioral and functional MRI findings. Front. Hum. Neurosci. 5, 126 (2011)
Article Google Scholar
Dillon, C., Keogh, E.: Aroused and immersed: the psychophysiology of presence. Citeseer (2000)
Google Scholar
Hayes, S.C., Hofmann, S.G., Wilson, D.S.: Clinical psychology is an applied evolutionary science. Clin. Psychol. Rev. 81, 101892 (2020)
Article Google Scholar
Jaskulska, A., Skorupska, K., Karpowicz, B., Biele, C., Kowalski, J., Kopeć, W.: Exploration of voice user interfaces for older adults—a pilot study to address progressive vision loss. In: Biele, C., Kacprzyk, J., Owsiński, J.W., Romanowski, A., Sikorski, M. (eds.) MIDI 2020. AISC, vol. 1376, pp. 159–168. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-74728-2_15
Chapter Google Scholar
Kopeć, W., et al.: VR hackathon with Goethe Institute: lessons learned from organizing a transdisciplinary VR hackathon. Association for Computing Machinery, New York (2021). https://doi.org/10.1145/3411763.3443432
Kopeć, W., et al.: Older adults and brain-computer interface: an exploratory study. Association for Computing Machinery, New York (2021). https://doi.org/10.1145/3411763.3451663
Kopeć, W., et al.: VR with older adults: participatory design of a virtual ATM training simulation. IFAC-PapersOnLine 52(19), 277–281 (2019). https://doi.org/10.1016/j.ifacol.2019.12.110. https://www.sciencedirect.com/science/article/pii/S2405896319319457. 14th IFAC Symposium on Analysis, Design, and Evaluation of Human Machine Systems, HMS 2019
Kowalski, J., et al.: Older adults and voice interaction: a pilot study with Google home. In: Extended Abstracts of the 2019 CHI Conference on Human Factors in Computing Systems, CHI EA 2019, pp. 1–6. Association for Computing Machinery, New York (2019). https://doi.org/10.1145/3290607.3312973
Macefield, R.: Usability studies and the Hawthorne Effect. J. Usability Stud. 2(3), 145–154 (2007)
Google Scholar
McCarney, R., Warner, J., Iliffe, S., van Haselen, R., Griffin, M., Fisher, P.: The Hawthorne Effect: a randomised, controlled trial. BMC Med. Res. Methodol. 7(1), 30 (2007). https://doi.org/10.1186/1471-2288-7-30
Article Google Scholar
Pugnetti, L., Meehan, M., Mendozzi, L.: Psychophysiological correlates of virtual reality: a review. Presence Teleoper. Virtual Environ. 10(4), 384–400 (2001)
Article Google Scholar
Seyama, J., Nagayama, R.S.: The uncanny valley: effect of realism on the impression of artificial human faces. Presence Teleoper. Virtual Environ. 16(4), 337–351 (2007). https://doi.org/10.1162/pres.16.4.337
Article Google Scholar
Wang, I., Smith, J., Ruiz, J.: Exploring virtual agents for augmented reality. In: Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems, pp. 1–12 (2019)
Google Scholar
Wiederhold, B.K., Rizzo, A.: Virtual reality and applied psychophysiology. Appl. Psychophysiol. Biofeedback 30(3), 183–185 (2005)
Article Google Scholar

Download references

Acknowledgements

This study constitutes an example of a bottom-up participatory research initiative done in the spirit of transdisciplinary collaboration between scientists, practitioners and volunteers. It was conducted without a dedicated grant to further the understanding of key concepts in HCI in the context of immersive virtual environments (IVR) and it constitutes the birth of a dedicated framework for the development of immersive interactive VR and XR research tools.

Therefore, we would like to thank the many people and institutions gathered together by the Living Lab Kobo and HASE Research Group. First, we would like to thank all the members of HASE research group (Human Aspects in Science and Engineering) and Living Lab Kobo for their support of this research. In particular, the members of XR Lab Polish-Japanese Academy of Information Technology (PJAIT) and Emotion-Cognition Lab SWPS University (EC Lab) for controlling the experimental conditions and setup alongside with coordination and facilitation of the entire experiment, Kobo Association (special thanks to Anna Jaskulska) for supporting the construction of the framework incl. electronic engineering (Sebastian Zagrodzki), Living Lab Kobo community, especially older adults, for supporting recruitment and their participation in the lab studies, VR and Psychophysiology Lab of the Institute of Psychology Polish Academy of Sciences for the access to psychophysiology research tool, software and support for the lab experimental setup and conducting and the lab studies, as well as 3D Lab PJAIT (especially Jakub Tyszka, Roman Karowiec and Martyna Bihun from Krzysztof Kalinowski’s team) for contributing 3D content and Laboratory of Interactive Technologies of National Information Processing Institute for supporting multi-modal data analysis.

Author information

Authors and Affiliations

XR Lab, Polish-Japanese Academy of Information Technology, Warsaw, Poland
Barbara Karpowicz, Rafał Masłyk, Kinga Skorupska, Daniel Jabłoński, Krzysztof Kalinowski & Wiesław Kopeć
VR and Psychophysiology Lab, Institute of Psychology Polish Academy of Sciences, Warsaw, Poland
Grzegorz Pochwatko
Emotion Cognition Lab, SWPS University of Social Sciences and Humanities, Warsaw, Poland
Kinga Skorupska, Monika Kornacka & Wiesław Kopeć
Laboratory of Interactive Technologies, National Information Processing Institute, Warsaw, Poland
Paweł Kobyliński
Kobo Association Living Lab and HASE Research Group, Warsaw, Poland
Barbara Karpowicz, Rafał Masłyk, Kinga Skorupska, Daniel Jabłoński, Krzysztof Kalinowski, Paweł Kobyliński, Grzegorz Pochwatko, Monika Kornacka & Wiesław Kopeć

Authors

Barbara Karpowicz
View author publications
You can also search for this author in PubMed Google Scholar
Rafał Masłyk
View author publications
You can also search for this author in PubMed Google Scholar
Kinga Skorupska
View author publications
You can also search for this author in PubMed Google Scholar
Daniel Jabłoński
View author publications
You can also search for this author in PubMed Google Scholar
Krzysztof Kalinowski
View author publications
You can also search for this author in PubMed Google Scholar
Paweł Kobyliński
View author publications
You can also search for this author in PubMed Google Scholar
Grzegorz Pochwatko
View author publications
You can also search for this author in PubMed Google Scholar
Monika Kornacka
View author publications
You can also search for this author in PubMed Google Scholar
Wiesław Kopeć
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Kinga Skorupska .

Editor information

Editors and Affiliations

National Research Institute, National Information Processing Institute, Warsaw, Poland
Cezary Biele
Systems Research Institute, Polish Academy of Sciences, Warsaw, Poland
Janusz Kacprzyk
Polish-Japanese Academy of Information Technology, Warsaw, Poland
Wiesław Kopeć
Systems Research Institute, Polish Academy of Science, Warsaw, Poland
Jan W. Owsiński
Institute of Applied Computer Science, Faculty of Electrical, Electronic, Computer and Control Engineering, Łódż University of Technology, Łódź, Poland
Andrzej Romanowski
Department of Applied Informatics in Management, Faculty of Management and Economics, Gdańsk University of Technology, Gdańsk, Poland
Marcin Sikorski

Rights and permissions

Open Access This chapter is licensed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

The images or other third party material in this chapter are included in the chapter's Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the chapter's Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.

Reprints and permissions

Copyright information

About this paper

Cite this paper

Karpowicz, B. et al. (2022). Intergenerational Interaction with Avatars in VR: An Exploratory Study Towards an XR Research Framework. In: Biele, C., Kacprzyk, J., Kopeć, W., Owsiński, J.W., Romanowski, A., Sikorski, M. (eds) Digital Interaction and Machine Intelligence. MIDI 2021. Lecture Notes in Networks and Systems, vol 440. Springer, Cham. https://doi.org/10.1007/978-3-031-11432-8_23

Download citation

DOI: https://doi.org/10.1007/978-3-031-11432-8_23
Published: 27 July 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-11431-1
Online ISBN: 978-3-031-11432-8
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics