Virtual environments (VEs) hold great promise for applications such as training simulations (Grantcharov, Bardram, Funch-Jensen, & Rosenberg, 2003), rehabilitation (Jack et al., 2001), and entertainment (Badique et al., 2002). However, one major hindrance to the usefulness of VEs is the tendency for viewers to underperceive depth extents, sometimes by as much as 50 % of the intended depth (e.g., Waller & Richardson, 2008). Underperception of depth characterizes judgments from oneself to an object (egocentric distance judgments; Loomis & Knapp, 2003) and also judgments of the distance between two objects separated in depth (exocentric distance judgments in the depth plane; Geuss, Stefanucci, Creem-Regehr, & Thompson, 2012). Such underperception of depth extents is in sharp contrast to results from real-world studies, where action-based judgments of egocentric distance such as blind walking are often reported to be approximately 100 % of actual distance,Footnote 1 on average, out to 20 m (Loomis & Philbeck, 2008).

In contrast to the underperception of depth extents in VEs, a recent study by Geuss et al. (2012) reported surprisingly accurate judgments (approximately 100 % of intended distance) of exocentric extent between objects arranged in a frontal plane (i.e., two objects equidistant from the viewer). Based on this finding the authors conclude that “distance underestimation in the [VE] generalizes to intervals in the depth plane, but not to intervals in the frontal plane.” The goal of the current project was to further evaluate their conclusion.

The basis for the conclusion by Geuss et al. (2012), that distance underperception in VEs characterizes depth extents but not frontal extents, is well demonstrated by their second experiment. Participants viewed two objects on the ground plane and made judgments of the distance separating the two objects. Objects were arranged either in a depth plane or in a frontal plane. Furthermore, participants viewed the objects in a real environment or in a virtual environment carefully designed to mimic the real environment. After viewing an object pair, participants performed a pantomime walking task (Philbeck, O’Leary, & Lew, 2004) in which they turned a small amount and then walked a distance that they believed was equivalent to the distance separating the two objects. Judgments of depth extents in the VE were smaller than those in the real world and also smaller than judgments of frontal extents, whereas judgments of frontal extents in the VE did not differ from those in the real world and also did not differ from veridical. These findings were replicated in their third experiment using a different measure of perceived distance.

Although it is possible that distance underperception in VEs only generalizes to depth extents and does not characterize frontal extents, Li, Sun, Strawser, Spiegel, Klein, and Durgin (2013) proposed an alternative interpretation of the Geuss et al. (2012) findings. According to Li et al., accurate judgment of frontal extents in the VE could arise from a fortunate balance of underperception caused by the VE and overperception of frontal relative to depth extents (Kudoh, 2005; Levin & Haber, 1993; Loomis, Da Silva, Fujita, & Fukusima, 1992; Loomis, Philbeck, & Zahorik, 2002). Li et al. (2013) provide evidence for this alternative hypothesis by showing that pantomime walking judgments were larger for frontal compared to depth extents in a real environment and in a virtual environment, but that all extents were perceived as shorter in the virtual compared to real environment, resulting in relatively accurate walking judgments of frontal extents in the VE.

The goal of the current project was to further evaluate Li et al.’s (2013) hypothesis that the relatively accurate judgment of frontal extents in VEs (Geuss et al., 2012; Li et al.) is due to a balance of overperception of frontal relative to depth extents and overall underperception of distance in VEs. Anecdotal evidence from our laboratory suggests that egocentric distance perception varies considerably across VEs. Studies in the real world indicate that the environmental context influences perceived egocentric distance (Witt, Stefanucci, Riener, & Proffitt, 2007). The current project compared the relationship between frontal and depth extents in two VEs: the indoor classroom environment used by Geuss et al. (2012) and a barren outdoor environment consisting of a ground plane covered in a grass texture, which has produced relatively large underperception of egocentric distance in our laboratory (Kelly, Hammel, Siegel, & Sjolund, 2014). Similar outdoor VEs have been used in numerous experiments testing distance perception (Kelly, Donaldson, Sjolund, & Freiberg, 2013; Li et al., 2013; Messing & Durgin, 2005; Richardson & Waller, 2005; Waller & Richardson, 2008; Wu, He & Ooi, 2007; Ziemer, Plumert, Cremer & Kearney, 2009). We expected that manipulation of the VE (room vs. grass) would affect judgments of depth extents, with smaller judgments in the grass VE compared to the room VE. If frontal extents are accurately perceived in VEs, as proposed by Geuss et al., then judgments of frontal extents should be unaffected by manipulation of the VE. However, if the accurate judgments of frontal extents reported by Geuss et al. were due to a balance of overperception of frontal relative to depth extents and underperception of distance in VEs, as proposed by Li et al., then judgments of frontal and depth extents should be affected by manipulation of the VE.

When viewed under full-cue conditions in the real world, judgments of exocentric extents containing both frontal and depth components have been described as an approximately linear combination of judgments of pure frontal and pure depth extents (Foley, Ribeiro-Filho, & Da Silva, 2004; Levin & Haber, 1993; Toye, 1986; Wagner, 1985). In light of the possibility that depth extents in the VEs will be underperceived and frontal extents will be accurately perceived (replicating results of Geuss et al.), we also included stimuli containing both frontal and depth components to evaluate whether the shift from underperception to accurate perception is graded or discrete. The experimental procedure and the virtual reality equipment were very similar to those used by Geuss et al.

Methods

Participants

Forty undergraduate students with normal or corrected-to-normal vision participated in exchange for course credit.

Stimuli and design

Participants judged the exocentric distance separating two virtual objects presented on the ground plane. The virtual environment was either an endless ground plane covered in a grass texture (grass VE; Fig. 1, top) or the room environment (room VE; Fig. 1, bottom) used by Geuss et al. (2012). Objects were 2-D shapes (triangles and pentagons of varying color) separated by exocentric distances of 2, 3, or 4 m. The center of each extent was positioned 5 m in front of the participant. Stimulus orientation relative to a fronto-parallel plane was 0° (frontal), 45°, or 90° (depth). All three independent variables (VE, exocentric distance, and stimulus orientation) were manipulated within participants. Each combination of exocentric distance and stimulus orientation was repeated three times in each VE, resulting in 54 total trials. VE was blocked and order was counterbalanced across participants. Stimuli were presented randomly within each block.

Fig. 1
figure 1

Perspective views of the grass virtual environment (VE; top) and room VE (bottom)

The VE was presented on a head-mounted display (HMD; nVisor ST50), which provided binocular images at 1280×1024 resolution that spanned a 40° horizontal × 32° vertical field of view. Head orientation was tracked using a three-axis orientation sensor (InertiaCube2+ by Intersense), and head position was tracked optically in three dimensions (PPTX4 by WorldViz). Graphics were rendered using Vizard software (WorldViz). Images in the HMD were dynamically updated based on sensed head position and orientation.

Procedure

Participants were instructed on the pantomime walking task while performing four practice trials in the real lab environment. No feedback was provided during practice or test. After practice, the participant donned the HMD and stood at the viewing location. On each trial, the ground plane and object pair appeared for 10 s during which the participant was free to make head and trunk movements. After 10 s the environment was completely occluded (i.e., the display turned black). The participant then turned 30° to the right and attempted to walk a distance equal to the previously viewed extent. The experimenter then pressed a button that saved the participant’s terminal location and the participant was guided without vision back to the viewing location for the next trial.

Results and discussion

Distance judgments were converted into ratios of walked-to-actual distance prior to analysis. The VE variable interacted with environment order, indicating the presence of carryover effects across environments. Therefore, we only interpret data from the first VE (in other words, VE was treated as a between participant variable using data from the first block only; data from the second block were not analyzed further). Judgment ratios (Fig. 2) were analyzed in a mixed-model ANOVA with terms for exocentric distance (2, 3, or 4 m), stimulus orientation (0°, 45°, or 90°), and VE (room or grass). The main effect of exocentric distance was significant, F(2,74)=54.95, p<.001, η p 2=.60, with larger distances producing smaller judgment ratios. The main effect of stimulus orientation was significant, F(2,74)=54.74, p<.001, η p 2=.60, with 0° stimuli producing larger judgment ratios than 45° stimuli, and 45° stimuli producing larger judgment ratios than 90° stimuli. The main effect of VE was significant, F(1,37)=17.14, p<.001, η p 2=.32, with the room VE producing larger judgment ratios than the grass VE. No interactions were significant.

Fig. 2
figure 2

Average distance judgment ratios as a function of actual exocentric distance, virtual environment, and viewing angle. Error bars represent ±1 SEM. The dotted line at 1.0 represents veridical performance

Depth extents (90° stimuli) were underperceived relative to frontal extents (0° stimuli), regardless of the environment. This result replicates numerous real world studies showing that stimulus orientation affects perceived exocentric distance (Kudoh, 2005; Levin & Haber, 1993; Li et al., 2013; Loomis et al., 1992, 2002; but see the real world conditions of Geuss et al., 2012, and Kelly, Loomis, & Beall, 2004). Furthermore, judgments of both frontal and depth extents were smaller in the grass VE than the room VE. In the grass VE, underperception was severe enough that both depth and frontal extents were underperceived relative to veridical. But in the room VE, only depth extents were underperceived, and frontal judgments did not differ from veridical.

The results from the room VE replicate those reported by Geuss et al. (2012), who concluded that distance underperception in VEs characterizes depth extents but not frontal extents. However, the addition of the grass VE condition in the current project makes it clear that their conclusion only characterizes responses in the room VE. In the grass VE, both depth and frontal extents were underperceived. As proposed by Li et al. (2013), the room VE caused underperception in near-perfect balance with the overperception of frontal relative to depth extents, leading to veridical judgments of frontal extents. However, such veridical judgments of frontal extents in the room VE should not be considered representative of perceived frontal extents in all VEs. Rather, a more general description of exocentric distance perception in VEs is that both depth and frontal extents are subject to underperception, which is more severe for depth than frontal extents.

Distance judgments were smaller in the grass VE than in the room VE, regardless of stimulus orientation. The current project was not designed to identify the factors leading to underperception of distance in VEs, but much previous work has addressed this topic. It is possible that underperception in virtual, compared to real, environments is due to missing or degraded distance cues. Potential cues that have received research attention include graphics quality (Kunz, Wouters, Smith, Thompson, & Creem-Regehr, 2009; Thompson et al., 2004), field of view (Knapp & Loomis, 2004; Willemsen, Colton, Creem-Regehr & Thompson, 2009), stereoscopic cues (Willemsen, Gooch, Thompson, & Creem-Regehr, 2008), and the mass and inertia of the HMD (Willemsen et al., 2009). However, none of those cues is singularly responsible for the underperception that characterizes distance perception in VEs. Recent research using newer HMDs has reported near-veridical judgments of egocentric distance (Li, Zhang, & Kuhl, 2014; Young, Gaylor, Andrus, & Bodenheimer, 2014), but the reason for the reported differences between HMDs is unknown. Distance judgments in the VE conditions of Li et al. (2013; experiment 2) were larger than those in the current experiment. The VE used by Li et al. is quite similar to the grass VE in the current experiment, but differences in overall judgment accuracy might be attributable to the wide field of view HMD used by Li et al. (126° compared to 40° horizontal; but see Knapp & Loomis, 2004). We can therefore only speculate as to why distance judgments in the current experiment were smaller in the grass VE than in the room VE.

It is possible that the effect of VE was due to environmental context. For example, Witt et al. (2007) reported that egocentric distance judgments were approximately 10 % larger when there was little open space beyond the target object, compared to when there was considerable open space beyond the target. Therefore, the effect of the VE manipulation in the current project could have been partially caused by the amount of open space beyond the targets. However, distance judgments were 44 % larger in the room than the grass VE, which exceeds the ~10 % effect reported by Witt et al., indicating that other factors may also be responsible for the difference between VEs.

Both the room VE and the grass VE contained numerous distance cues, including declination angle, texture gradient, binocular disparity, binocular convergence, and motion parallax. However, only the room VE contained familiar size cues (e.g., the doors and whiteboard visible in Fig. 1, bottom) and linear perspective (lines formed by intersections between the walls and floor/ceiling planes), and either cue could have been responsible for the differences between VEs. Experimental manipulation of linear perspective cues affects perceived distance, perhaps by influencing perceived eye level (Wu, Zhou, Shi, He, & Ooi, 2015). It is therefore possible that the linear perspective cue was responsible for judgment differences across the two VEs. Future work could manipulate the presence of familiar objects and walls within the grass VE and compare distance judgments to those made in the room VE to evaluate the influences of familiar size and linear perspective.

The conclusion that the room VE caused distance underperception that was offset by overperception of frontal extents relative to depth extents contrasts somewhat with the real world data reported by Geuss et al. (2012). In their real-world classroom condition, judgments of frontal and depth extents did not differ from one another. There is no obvious explanation as to why the real classroom would produce equivalent judgments of depth and frontal extents whereas the replica classroom VE would produce underperception of depth extents relative to frontal extents. Using a similar pantomime walking task, Li et al. (2013) reported overperception of frontal relative to depth extents in real and virtual outdoor environments, with overall larger judgments in the real compared to virtual environment, which stands in contrast to the Geuss et al. results using indoor environments. Although the real world data from Geuss et al. may complicate a general description of exocentric distance perception, they do not bear directly on the conclusions from the current study. In particular, the manipulation of VE (grass vs. room) in the current study clearly demonstrates that both frontal and depth extents in virtual environments can be subject to underperception, and that this underperception is more pronounced for depth extents than frontal extents.

The 45° stimulus orientation was included to determine whether there was a gradual or discrete shift from underperception of depth extents to accurate perception of frontal extents. However, this secondary research question was rendered less interesting in light of the primary finding that frontal extents were not accurately perceived across all virtual environments. Instead, the finding that extent judgments of the 45° stimuli were in between judgments of the 0° and 90° stimuli (in both VEs) corroborates other research indicating a linear relationship between stimulus orientation and perceived exocentric extent (Foley et al., 2004; Levin & Haber, 1993; Toye, 1986; Wagner, 1985). In both the grass VE and the room VE, the relationship between distance judgment ratio and stimulus orientation was well described by a linear function (grass: slope = −.072, R2 = .98; room: slope = −.077, R2 = .97). Similar results were obtained using a finer sampling of stimulus orientation in a separate experiment measuring perceived exocentric extent in the grass VE.Footnote 2

The results of this project are based on pantomime walking to reproduce exocentric extents, and it is unclear whether the results would generalize to other response types. Whereas Geuss et al. (2012) found that the results of their pantomime walking task generalized to a blind throwing task, Li et al. (2013) report differences between pantomime walking and a visual matching task. Furthermore, Philbeck et al. (2004) report differences between verbal, direct blind walking, and pantomime walking. Some of those differences might be due to recalibration of perceived distance traveled when walking without visual feedback.

In summary, frontal extents were overperceived relative to depth extents, regardless of whether they were viewed in a sparse grass VE or a more articulated room VE. This overperception of frontal relative to depth extents appears to be in near-perfect balance with the underperception caused by the room VE, leading to veridical judgments of frontal extents. However, results from the grass VE make it clear that frontal extents are not immune to underperception of distance in virtual environments, and that exocentric distance perception depends on both the stimulus orientation as well as characteristics of the VE.