Processing of haptic texture information over sequential exploration movements

Lezkan, Alexandra; Drewing, Knut

doi:10.3758/s13414-017-1426-2

Processing of haptic texture information over sequential exploration movements

Published: 03 October 2017

Volume 80, pages 177–192, (2018)
Cite this article

Download PDF

Attention, Perception, & Psychophysics Aims and scope Submit manuscript

Processing of haptic texture information over sequential exploration movements

Download PDF

Alexandra Lezkan¹ &
Knut Drewing¹

1753 Accesses
8 Citations
1 Altmetric
Explore all metrics

Abstract

Where textures are defined by repetitive small spatial structures, exploration covering a greater extent will lead to signal repetition. We investigated how sensory estimates derived from these signals are integrated. In Experiment 1, participants stroked with the index finger one to eight times across two virtual gratings. Half of the participants discriminated according to ridge amplitude, the other half according to ridge spatial period. In both tasks, just noticeable differences (JNDs) decreased with an increasing number of strokes. Those gains from additional exploration were more than three times smaller than predicted for optimal observers who have access to equally reliable, and therefore equally weighted, estimates for the entire exploration. We assume that the sequential nature of the exploration leads to memory decay of sensory estimates. Thus, participants compare an overall estimate of the first stimulus, which is affected by memory decay, to stroke-specific estimates during the exploration of the second stimulus. This was tested in Experiments 2 and 3. The spatial period of one stroke across either the first or second of two sequentially presented gratings was slightly discrepant from periods in all other strokes. This allowed calculating weights of stroke-specific estimates in the overall percept. As predicted, weights were approximately equal for all strokes in the first stimulus, while weights decreased during the exploration of the second stimulus. A quantitative Kalman filter model of our assumptions was consistent with the data. Hence, our results support an optimal integration model for sequential information given that memory decay affects comparison processes.

Unequal but Fair? Weights in the Serial Integration of Haptic Texture Information

Masking interferes with haptic texture perception from sequential exploratory movements

Article Open access 11 March 2021

Knut Drewing & Alexandra Lezkan

Going Against the Grain – Texture Orientation Affects Direction of Exploratory Movement

Textures are preferably judged by touch. Heller (1982, 1989) reported a greater contribution from touch compared with vision to texture perception. Given that textures are defined by repetitive small spatial structures on an object’s surface, exploration covering a greater extent results in repetitive, redundant, intake of the same stimulus signals. Texture perception can benefit from integrating sensory information over time. Current models of information integration mostly refer to simultaneously presented redundant signals (Ernst & Banks, 2002; Drewing et al., 2008), e.g., holding a pen in the hand simultaneously results in both tactile and kinesthetic information about its diameter. In the present study, we investigated information integration for sequentially gathered signals in texture perception. In three experiments, we challenge predictions from models on simultaneous information and develop and test a more general Kalman filter model that allows accounting for specific observations in the integration of sequential information (Knill & Pouget 2004) by memory-decay affected comparison processes.

To describe the integration of simultaneous redundant information, the Maximum Likelihood Estimation (MLE) model is well-established (overview in Ernst & Bülthoff, 2004). Jacobs (2002) suggested that integration uses all signals available for a property. First, signal-specific estimates s _i for the property are derived from each signal i. Second, all estimates are combined into a coherent percept P by weighted averaging:

$$ P={\sum}_i{w}_i{s}_i\kern0.5em \mathrm{where}\ {\sum}_i{w}_i=1\kern0.5em and\ {w}_i\in \left[0,1\right]. $$

(1)

Estimates derived from each signal are prone to noise $ {\sigma}_i^2 $. Averaging different estimates can decrease the perceptual variance ($ {\sigma}_{\widehat{s}}^2 $) of the combined percept (Landy, Maloney, Johnston, & Young, 1995). According to the maximum likelihood estimation (MLE) model, the variance ($ {\sigma}_{\widehat{s}}^2 $) of a percept is lowest and the weights (w _i) are optimal if the weights are proportional to the inverse variances of the signal-specific estimates (1/ $ {\sigma}_i^2 $):

$$ {w}_j=\frac{1/{\sigma}_j^2}{\sum_{i=1\dots, j,\dots N}1/{\sigma}_i^2}\kern1em \mathrm{with}\kern1em {\sigma}_{\hat{s}}^2=\frac{1}{\sum_i1/{\sigma}_i^2}. $$

(2)

Weighted averaging (Eq. 1) describes the percept of a property, when stimuli with signals slightly conflicting in their information on this property are created (Ernst and Banks, 2002). Experimental data also quantitatively confirm the predicted reduction of perceptual variance (measured via discrimination thresholds) in multi-estimate compared with single-estimate situations (Eq. 2), and even the predicted optimal weights, e.g., for the case of visuo-haptic and visuo-auditory integration of size and location (Alais & Burr, 2004; Ernst & Banks, 2002). Recent studies found neurophysiological correlates of optimal multisensory integration (Fetsch, DeAngelis, & Angelaki, 2013; Helbig et al., 2012).

Within haptic perception, observers use multiple redundant signals that are simultaneously available and integrate them in agreement with MLE predictions (Drewing & Ernst, 2006; Drewing, Wiecki & Ernst, 2008). However, in haptic perception, the integration of information over time is at least as important as integration over different sensory sources (Henriques & Soechting, 2005). Typical haptic exploratory procedures extend over time and space and can be decomposed into several exploration segments. For specific object dimensions, such as surface orientation or texture, exploratory behavior comes along with a systematic repetition of the same stimulus information. In texture exploration, individual exploration segments refer to scans of the finger over the same spatial region. Thereby, extending the exploration by repeating exploration segments increases the amount of redundant information. To formulate a model for such sequential and not simultaneous information, a Kalman filter (Kalman, 1960) may be better suited than the MLE model. The Kalman filter takes a more general approach to optimal information integration. It is able to describe how a series of sequential estimates are used for estimating a property in a way that the variance of the final estimate is minimized. The Kalman filter uses Bayesian interference, combining prior with present information, and can account for changes in the estimates over time. For example, a Kalman filter approach can model if memorized information from sequentially gathered signals gets noisier over time. First empirical studies observed correlates of fundamental Kalman filter characteristics, prediction and updating, in the brain activity of mice (Funamizu, Kuhn, & Doya, 2016). The MLE model and its predictions are captured within the Kalman filter framework as a (simple) special case with noninformative prior information and estimates that are stable over time (Battaglia, Jacobs, & Aslin, 2003; Ernst & Bülthoff, 2004).

The present study was designed to challenge predictions from the MLE model and to develop a better-suited Kalman filter model for the sequential integration of texture information. The exploratory procedure for textures includes several lateral strokes in different directions (Lederman & Klatzky, 1987). We define an exploration segment as a single unidirectional stroke across the texture. Then, a segment-specific estimate for a property is derived from the information gathered during a single stroke. We assume that each exploration segment i yields an estimate with equal variance ($ {\sigma}_i^2 $ = $ {\sigma}_0^2 $, with $ {\sigma}_0^2 $ being a constant value $ {\sigma}_i^2 $). The assumptions underlying the MLE model predict that all estimates are weighted equally in the percept (Eq. 2, left) and the final variance of the percept ($ {\sigma}_{\widehat{s}}^2 $) can be computed by $ {\sigma}_{\widehat{s}}^2={\sigma}_0^2/N $ (Eq. 2, right) with N being the number of redundant estimates. Given that the discrimination threshold ($ {t}_{\widehat{s}}^2 $) assesses the percept’s variance ($ {\sigma}_{\hat{s}}^2 $) with $ {t}_{\widehat{s}}^2=2{\sigma}_{\widehat{s}}^2 $ (Jovanovic & Drewing, 2014; Lezkan et al., 2016), it follows for discrimination thresholds:

$$ {t}_{\widehat{s}}=\sqrt{2{\sigma}_0^2/N}\kern1.25em \mathrm{and}\kern1em \log \left({t}_{\widehat{s}}\right)=-\frac{1}{2}\mathit{\log}(N)+\mathrm{const}. $$

(3)

That is, discrimination thresholds should depend on the number of exploration segments in a well defined fashion and a linear fit on log-log scales should have a slope of −1/2. Previous research on sequential integration of extended haptic stimulation seems not to support these predictions. Quick (1974) had already suggested in his model that visual thresholds linearly decrease with increasing stimulation on a log-log scale, but with diverse slopes. For haptic detection thresholds, the observed slope in Quick’s model was close to −1 (Gescheider, Berryhill, Verrillo, & Bolanowski, 1999; Gescheider, Bolanowski, Pope, & Verrillo, 2002; Gescheider, GüÇlü, Sexton, Karalunas, & Fontana, 2005; Louw, Kappers, & Koenderink, 2005) and thus clearly below the slope of −1/2 predicted from the assumptions underlying the MLE model. However, performance in detection tasks might not be relevant, because detection does not require perceiving the magnitude of a stimulus property (Louw et al., 2005). In a discrimination task on felt surface orientation, thresholds decreased with increasing length of exploration, and the decrements were smaller the longer the explored surface was (Giachritsis, Wing & Lovell, 2009). This is qualitatively in line with the threshold predictions but was not quantitatively analyzed and thus is not conclusive. Importantly, results from Metzger, Lezkan, and Drewing (2017) are at odds with the prediction of equal weights in the integration of sequential haptic information. The authors investigated softness discrimination, where people typically indent a soft stimulus repeatedly, and determined the weights of indentation-specific softness estimates for the first and the second stimulus in a trial. While a rather equal weighting was visible for the indentations of the first stimulus, during the exploration of the second stimulus weights decreased for later indentations.

Thus, Metzger et al.’s (2017) results casts the assumptions of the MLE model into doubt and call for a more complex model of the processes of sequential integration during discrimination tasks. These results seem to agree with a model of the comparison process between first and second stimulus that can be derived from single cell measurements on monkeys. In a vibrotactile discrimination task, Romo and colleagues (Romo, Hernández, Zainos, Lemus, & Brody, 2002; Romo & Salinas, 2003) found that neuronal responses in area SII are different for the first and the second stimulus in a trial. Whereas the response to the first stimulus was only associated with the first stimulus’ characteristics, the response to the second stimulus also included information about the first remembered stimulus. This is to say, neural responses during the second stimulus reflected the comparison between the two stimuli, which was the task of the monkey. Hernández et al. (2010) measured the monkey’s cortical activity during vibrotactile discrimination. The activity of frontal lobe circuits was associated with the result of the sensory decision which of the two stimuli had higher frequency as well as with the past information about the stimuli. Most importantly, cortical areas that receive inputs from area SI were reported to combine present sensory information from SI with sensory representations stored in working memory. Overall, the results suggested that comparison processes take place during the presentation of the second stimulus after the first stimulus has been captured and memorized as a reference.

This can explain the data from Metzger et al. (2017) on decreasing weight of sequential estimates during the exploration of the second stimulus in softness discrimination, as follows. During the exploration of the second stimulus, present sensory signals are continuously compared with the remembered estimate. Within this comparison process, the variance of the estimate of the remembered first stimulus increases due to memory decay. Hence, information gathered sooner after the first stimulus may lead to a more precise judgment on the difference between the two stimuli than later information and therefore is weighted higher. Such a process will not be captured by the rather simple assumptions underlying the MLE model but requires a Kalman filter model that can additionally account for changes in the estimates’ variance.

In the first experiment of the present study, we investigated for texture discrimination how the (spatio-temporal) extension of exploratory movements, i.e., the number of strokes across the texture, affects discrimination thresholds. The assumptions underlying the MLE model predict that the reduction of thresholds follows a power function of the number of strokes with exponent −1/2, whereas the outlined model on the comparison process with memory decay predicts less reduction (i.e., a larger exponent). In the second experiment, we tested whether stroke-specific estimate weights are unequal and follow the pattern predicted from the outlined model on the comparison process. Finally, in Experiment 3 we tested quantitative predictions for the estimate weights that stem from a Kalman filter model of optimal integration given memory decay affected the comparison process.

Experiment 1

We created haptic texture stimuli by using a PHANToM force-feedback device. The device is attached to a finger via a thimble. It simulates objects by monitoring 3D-finger position and by applying an appropriate reaction force. We used virtual gratings that consisted of sinusoidal ridges on an otherwise planar surface. Different grating stimuli differed in ridge height or the distance between adjacent ridges (= period). On each trial, participants explored one of the two possible standard gratings and one comparison grating. Afterwards, half of the participants decided which grating had felt higher (amplitude judgment), the other half decided about grating period (period judgment). Participants were instructed to explore with back and forth movements having a defined finger velocity and force to avoid confounds. As a consequence, participants had to simultaneously focus on the discrimination task and their exploratory movement. To reduce the attention needed for movement control, the movement was guided by intuitive visual feedback and participants initially practiced the instructed force and velocity.

The experiment started with this “practice phase.” Afterwards, in the “exploration phase,” we varied the number of strokes (1…8) that participants used to explore each stimulus. We measured just-noticeable differences (JNDs; assessing discrimination thresholds) for either task by using the adaptive staircase procedure called BestPEST (Lieberman & Pentland, 1982). We expected that JNDs would decrease with the number of strokes conducted following a power function. Furthermore, we tested the exponent of the power function against −1/2, which is the value predicted by the assumptions underlying the MLE model.

Participants

A total of 16 healthy participants, students from Giessen University, were tested (mean age: 22 years, range: 19-26 years; 9 females, 7 males). All participants had normal or corrected-to normal visual acuity, were right-handed, and none of them reported cutaneous or motor impairments. Participants were naïve to the purpose of the study. They participated for course credit. Methods and procedures of both experiments were approved by the local ethics committee LEK FB06 at Giessen University, and they were in accordance with the ethical standards laid down in the 1964 Declaration of Helsinki. Participants gave written, informed consent.

Apparatus and Stimuli

The apparatus can be seen in Fig. 1a. Participants sat in front of a custom-made visuo-haptic workbench, which comprised a PHANToM 1.5A haptic force feedback device and a 22"-computer screen (120 Hz, 1024 x 1280 pixel). The right index finger was connected to the PHANToM via a thimble-like holder, which allows for free finger movements having all six degrees of freedom in a 38 x 27 x 20 cm³ workspace. Simultaneously, the participants looked through stereoglasses (CrystallEyes™) and via a mirror onto the screen (40-cm viewing distance). The mirror prevents participants from seeing their hand and enables spatial alignment of the 3D-visual with the haptic display. The participants’ heads were stabilized by a chinrest. The devices were connected to a PC. A custom-made software controlled the experiment, collected responses, and recorded finger positions and reaction forces (from PHANToM, every 2 ms). Noise presented via headphones and ear plugs masked sounds generated by the PHANToM.

Both stimuli were presented after each other in front of the participants. The stimuli were virtual gratings covering an area of approximately 30-mm width (x-axis) x 15-mm depth (z-axis). Gratings consisted of ridges (width 1 mm; extending over the entire depth) on an otherwise planar surface. Ridge height was a sine-function (within 0 to π) of x-position. Programmed peak amplitudes of the ridges varied between 0.16 and 0.74 mm; the peak-to-peak period between ridges varied between 2 and 9 mm. In each single stimulus, ridge amplitudes and periods were constant. Strokes started left or right from the grating. Haptic grating stimuli were created using the PHANToM force feedback device. The device simulates objects by applying reaction forces $ {\overset{\rightharpoonup }{F}}_p $ as a function of the 3D-finger position P. Force magnitude linearly increases with the indentation depth of the finger into a virtual object (i _p) and force direction is normal to the object’s surface ($ {\overset{\rightharpoonup }{n}}_p $: normal vector, D: spring constant):

$$ {\overset{\rightharpoonup }{F}}_p={{\overset{\rightharpoonup }{n}}_p}^{\ast}\left|{\overset{\rightharpoonup }{F}}_p\right|\ \mathrm{and}\kern0.5em \mid {\overset{\rightharpoonup }{F}}_p\mid ={D}^{\ast }\ {i}_p $$

(4)

The spring constant D was replaced by the variable K to keep object indentation constant under differing finger forces. The variable K was defined such that for the target indentation I (set to 1 mm) the magnitudes of finger force and reaction force were (approx.) equal. Vertical finger force was estimated from the device’s reaction forces in y-direction F _y(j) (y-axis = height) in the previous device cycles j = 1 … n (~previous 300 ms):

$$ K=\raisebox{1ex}{$\frac{1}{n}\sum_{j=1\dots n}{F}_y(j)$}\!\left/ \!\raisebox{-1ex}{$I$}\right. $$

(5)

Design and Procedure

Participants successively explored two gratings. Between participants we varied the Judged Dimension (Amplitude, Period). Half of the participants judged which of the two gratings had felt higher (Amplitude); the other half judged which grating had higher spatial period (Period). We further varied the Number of strokes (1, 2, 3, 4, 5, 6, 7, 8) that participants used to explore each of the two stimuli (within-participant variable). A single stroke was defined by a single unidirectional exploratory movement across the grating. We measured 75% discrimination thresholds (JNDs) for two standard stimuli. The standard stimuli in the Amplitude group had amplitudes of 0.4 or 0.5 mm and periods of 5 mm. In the Period group, the standard stimuli had periods of 5 or 6 mm and amplitudes of 0.4 mm.

JNDs were determined using the BestPEST adaptive staircase procedure combined with the two-interval forced-choice task. In the BestPEST method (Lieberman & Pentland, 1982) before each stimulus presentation, the likelihood distribution of possible thresholds is calculated by using the sigmoid-shaped psychometric function with a slope of one, on the basis of all previous responses of the participant. The value with the maximum likelihood of being the threshold value is then chosen as the comparison stimulus. This method is an optimum strategy for fast threshold determination. In effect, the procedure raises the difference between the values of comparison and standard after a wrong response and lowers it after a correct response. We terminated the procedure after 26 trials per staircase, estimating the 75% threshold (JND) by the final maximum-likelihood estimate. For each Number of strokes and each standard stimulus, two up and two down staircases measured the upper and lower JNDs, respectively. In the Amplitude condition, initial amplitudes of the comparison stimuli were given by the standard’s amplitude ±0.35 mm; the comparisons’ period was always 5 mm. In the Period condition, initial periods of the comparisons were the standard’s period plus/minus 3 mm; the comparisons’ amplitude was always 0.4 mm. Trials from all staircases were randomly interleaved in the measurement phase. Overall, the measurement phase consisted of 2 [standards] * 2 [staircases] * 26 [staircase length] * 2 [repetitions] * 8 [Number of strokes] = 1,664 trials. The entire experiment consisted of five sessions lasting approximately 2 hours each. Before to the experiment, participants were trained for approximately 30 min to execute exploratory movements with constant instructed finger velocity (15 cm/s) and force (1.5 N). The training consisted of two parts. In the first part, participants trained on a virtual plane without ridges. In the second part of the training, movements were performed on virtual gratings. Each part ended after participants had performed 20 trials in a sequence with maximally 3 movement errors. We defined movement errors as a deviation of actual velocity or force values by more than 60% from the target velocity and force.

Each trial started with a visual representation of the upcoming stimulus and start point (left or right of the grating, balanced). Participants initiated the trial with a button press at the start point location. Then, participants stroked across a first grating back and forth. The computer program stopped the stimulus presentation, when the required number of strokes had been conducted. Afterwards, a second grating was explored using the same number of strokes as for the first grating. Finally, participants had to decide by a button press (done with the PHANToM), which grating had felt higher in amplitude/had higher spatial period. During the strokes, a vertical line that moved forth or back along the exploratory axis indicated the prescribed finger velocity (15 cm/s) and stroke direction. A stationary horizontal line indicated prescribed force (1.5 N). Participants monitored their current velocity and force by further feedback lines, which were displayed while the finger was outside the grating area. A vertical line displayed the current 1D-finger position on the x-axis; a horizontal line moved up and down with exerted force. Trials were repeated later in the session when a movement error was detected.

Data Analysis

We calculated individual JNDs per Number of strokes condition by averaging across the two upper and the two lower JNDs for each standard stimulus (8 JND values). These values were log-transformed (base 10) before analyses. According to the predictions it is the log JNDs that should linearly decrease with the log Number of strokes. In addition, the log-transformation allows comparing gain ratios in the amplitude and the period conditions. It transforms the ratios between JNDs for different Number of strokes into differences, which can be directly analyzed by an ANOVA.

Results

Individual log JND values entered an ANOVA with the within-participant variable Number of strokes (1…8) and the between-participant variable Judged Dimension (Amplitude, Period). For the variable Number of strokes, we calculated linear contrasts, which provide a targeted test of our hypotheses. The linear contrast of Number of strokes was significant, F(1,14) = 15.326, p < 0.001 (one-tailed), confirming the predicted decrease of JNDs with an increasing Number of strokes. The interaction Number of strokes (linear contrast) X Judged dimension failed to reach significance, F(1,14) = 0.350, p = 0.563, which may suggest that both amplitude and period JNDs depend in similar manner on the Number of strokes. To be more precise, the lack of effects on log values suggests that the ratios between the JNDs of different Number of strokes conditions are similar. Finally, the main effect of Judged Dimension was significant, F(1,14) = 584.050, p < 0.001, which is, however, essentially not interesting, because it only reflects the fact that (log) amplitude and period JNDs differ in scale. Figure 2 shows a log-log plot of the JNDs.

We fit a power function separately to the amplitude JNDs and to the period JNDs. To achieve this, we linearly regressed log transformed JNDs on log transformed stroke numbers. As a consequence, the slope of the fitted line corresponds to the exponent of a power function fitted to the non-logarithmized data. In both cases, the fitted line described the data well. For the Amplitude group, the regression line explained r ² = 88% of the variance. For the Period group, the explained variance was r ² = 80%. According to the MLE predictions, the slope is expected to be −0.5. In contrast, the slopes of the fitted lines reached values of −0.148 for the Amplitude group and −0.112 for the Period group. By fitting regression lines to the individual log-log data, we were able to calculate a t test against the predicted slope of −0.5. In the Amplitude group (M = −0.148, SD = 0.151) and the Frequency group (M = −0.112, SD = 0.066), the slopes differed significantly from the MLE prediction, t(7) = 6.580, p < 0.001 and t(7) = 16.673, p < 0.001.

Discussion Experiment 1

In Experiment 1, we found that participants discriminate grating stimuli the more precisely the longer they explore them. Such redundancy gains were smaller than predicted by the assumptions underlying the MLE model. According to these assumptions each single estimate is weighted according to its inverse variance. In case of repeated strokes across the same stimulus, estimates from each single stroke should have equal variance and, hence, each estimate should obtain equal weight. The present results disprove the MLE predictions, and thus extend the previous evidence (Metzger et al.’s, 2017), suggesting that the assumptions underlying the MLE model do not apply to sequential integration.

As outlined in the introduction, an alternative model, which may explain the present and previous observations on sequential integration, links to memory decay during the comparison process of the discrimination task. There is evidence that discrimination performance is based on a continuously ongoing comparison process between a remembered estimate from the first stimulus and present sensory signals from the second stimulus (Romo et al., 2002; Romo & Salinas, 2003; Hernández et al., 2010). During the comparison process, i.e., during exploration of the second stimulus, the memory trace of the first stimulus might diminish from stroke to stroke, and thus the variance of the remembered estimate increases. Memory decay and increasing variance, as observed, will lead to lower redundancy gains than predicted from the MLE assumption of equal variance and higher overall estimate variance. An optimality model, including these factors, in sequential presentation, would further predict that strokes within the second stimulus are not weighted equally but decrease for later strokes. We designed further experiments to test whether information from different strokes during the exploration is unequally weighted in the grating percept.

Experiment 2

In Experiment 2, participants discriminated a standard and a comparison stimulus according to grating period using a two-interval forced choice task combined with the method of constant stimuli. They stroked three times across each stimulus. While participants explored the standard stimulus, we presented slightly discrepant period information in one of the strokes. That is, the grating period of each stroke in the standard stimulus could take one of two values. The stroke with the deviant period in the standard stimulus is the discrepant stroke. We defined several standard stimuli by varying the Position [1, 2, 3] of the discrepant stroke within the presentation of the standard. Additionally, the standard was either presented as the first or second stimulus of the trial, which is represented in the variable Stimulus order [first vs. second]. Each standard stimulus was combined with 14 comparisons. The comparisons differed in their periods, but for the strokes across each single comparison stimulus the period was kept constant. For each of the standards, we determined the point of subjective equivalence (PSE) with the comparison. Based on this, we calculated the weight of the discrepant information in the standard stimulus for each combination of Position and Stimulus order. We predicted an interaction between those two variables. Weights were expected to decrease with higher Position in the second but not in the first stimulus.