Method
Participants
In Experiment 1, 64 female students from Leiden University participated in exchange for a monetary reward (€4.50) or course credit. The sample size was determined by a power analysis (G*Power; Faul, Erdfelder, Lang, & Buchner, 2007) with an effect size of g = 0.34 (based on a meta-analysis by Phaf, Mohr, Rotteveel, & Wicherts, 2014) and a power of 90%. Only female participants were chosen both to reduce noise due to gender differences and because earlier research has suggested that women have more pronounced approach-avoidance tendencies than men (Rotteveel & Phaf, 2004; Solarz, 1960). Nine participants had to be excluded because of too few valid trials either in the mobile or in the computerized AAT (for criteria, see the data exclusions section, preregistration, and the project’s Open Science Framework page; https://osf.io/y5b32/). A follow-up analysis indicated that some of these exclusions were due to a sensor error which caused implausibly short reaction times in the mobile AAT (see the project’s Open Science Framework page; https://osf.io/y5b32/). The final sample of Experiment 1 included 55 participants between the age of 18 and 29 years (M = 21.6, SD = 2.6). The study was approved by the institutional ethics board (3832757848) and informed consent was obtained from all participants.
Procedure.
After filling in the informed consent, half of the participants first completed the mobile AAT and half first completed the joystick AAT. Instructions on how to complete each AAT were given verbally by the experimenter. Participants were instructed to stand during the mobile AAT. The experimenter remained in the room during the practice trials to ensure all movements were performed correctly, but left the room during the experimental trials. After the first AAT, participants completed the filler task before completing the second AAT. Finally, they completed the stimulus rating task, were debriefed, and rewarded for their participation.
Mobile AAT
In Experiment 1, the mobile AAT was completed on an iPhone 3GS provided by the experimenter. This first version of the mobile AAT was programmed in Objective-C using Xcode, which made it usable only on iOS devices (note, however, that the iPhone version of the mobile AAT is currently no longer developed as we switched to Android to speed up deployments; see Experiment 2). During the mobile AAT, pictures of happy and angry faces were presented on a smartphone that participants were instructed to either pull toward themselves or push away from themselves. Participants completed two blocks—a congruent and an incongruent one. The order in which these blocks were completed was counterbalanced between participants. In the congruent block, they were instructed to pull happy faces toward themselves and to push angry faces away from themselves. In the incongruent block, the instructions were reversed. This means that participants were instructed to attend to the stimulus feature, based on which the approach-avoidance effect was later calculated (feature-relevant instructions). During each block, five happy and five angry face stimuli (taken from Rotteveel & Phaf, 2004) were presented four times each (repeated within but not between blocks), yielding a total of 80 trials. Throughout the task, participants were instructed to hold the phone in a horizontal orientation and, between trials, to move the phone to a starting position from which they could easily pull it toward themselves or push it away from themselves (see Fig. 1). Before each block, they were instructed which stimuli to pull and which to push. They were also instructed to react as quickly and accurately as possible. Each stimulus was preceded by a fixation cross, which remained on screen for 1.5 seconds. During each response, the phone’s accelerometers and gyroscopes tracked the gravity- and rotation-corrected acceleration of the movement in the direction perpendicular to the face of the screen (100Hz sampling rate). Based on this acceleration the responses, reaction times, and response forces were calculated (see Fig. 2). If no response was given within two seconds, a clock was displayed on the screen to inform participants that the trial had timed out. Before each block, participants were presented with an additional ten practice trials, which unlike experimental trials were followed by a response feedback (a green screen for a correct response and a red screen for an incorrect response). The source code of the mobile AAT is available on the project’s Open Science Framework page (https://osf.io/y5b32/).
Control tasks
Joystick AAT
To investigate whether our design could successfully evoke approach-avoidance tendencies, participants also completed a computerized version of the AAT (the joystick AAT; (Wiers, Rinck, Dictus, & van den Wildenberg, 2009). The joystick AAT was similar to the mobile AAT except that participants were presented with stimuli on a laptop screen and instructed to approach or avoid these by pulling or pushing a joystick. Approach was simulated by increasing the stimulus size during pull movements and avoidance was simulated by decreasing the stimulus size during push movements (Rinck & Becker, 2007). Also, as is common practice in joystick AATs, reaction times were recorded at 30% of the maximum joystick extension. To reduce learning effects, stimuli were not repeated between the two AATs, but their presentation was counterbalanced so that, across the whole sample, each stimulus appeared equally often in the mobile and in the joystick AAT. To further reduce learning effects, the two AATs were separated by an unrelated filler task (an associative priming lexical decision task; de Groot, 1984; Matzke et al., 2015).
Stimulus ratings
As an additional manipulation check, we included a stimulus rating task in which participants were asked to rate each stimulus’s valence and approachability on seven-point scales ranging from 1 = “not positive/approachable at all” to 7 = “very positive/approachable”.
Data preprocessing
To detect the direction, reaction time (RT), and response force (RF) of each reaction, we analyzed the acceleration of the phone in the direction perpendicular to the ventral axis of the participant (see Fig. 2). Raw acceleration measures were first interpolated. Next, the first peak was detected based on a zero-crossing derivative, with the condition that the amplitude of detected peaks should be at least 30% of the maximum amplitude of the response and that peaks should be at least 10 ms apart from each other. These cutoffs were chosen based on visual inspection of a random selection of trials and preregistered in Experiment 2. Responses were categorized based on the sign of the first peak (an initial positive peak indicates an approach response, whereas an initial negative peak indicates an avoidance response). Next, RTs were detected based on an absolute acceleration cutoff of 0.5 meters per second squared (m/s^2) on the left side of the first peak. This cutoff was chosen based on visual inspection of a random sample of response curves. We chose this cutoff to get to the earliest detectible change in acceleration, while at the same time preventing the algorithm from picking up sensor noise as responses. The cutoff was preregistered for Experiment 2. To detect RFs, we used the magnitude of the first acceleration peak as a proxy. As the mass of the phone was constant throughout the experiment, force should be directly related to acceleration and all standardized differences in acceleration should exactly reflect standardized differences in force. Data preprocessing was performed using Python (version 3.5.5). All preprocessing scripts are available on the project’s Open Science Framework page (https://osf.io/y5b32/).
Data exclusions
Practice trials, error trials, trials with missing sensor data, trials with implausibly short reaction times (< 200 ms), and trials with low absolute maximum forces (< 1 m/s^2; indicating non-responses) were excluded before analysis. Participants with less than 80% valid experimental trials were also excluded. All data, including those of excluded participants, are available on the project’s Open Science Framework page (https://osf.io/y5b32/).
Analysis strategy
Statistical analyses were performed using R (version 3.4.3). Following recommendations of Baayen and Milin (2010), we analyzed our data using linear mixed effects models (LMMs; using the lmerTest package). LMMs have the advantage over more commonly used repeated measures ANOVAs that they do not require data aggregation and avoid the resulting loss of information. The primary effects of interest in this study were the interaction effects between response direction (approach vs. avoidance) and stimulus type (happy vs. angry) on RT and RF. Because RTs tend to be non-normally distributed, they need to either be transformed or analyzed using generalized LMMs. In our analyses, we used the inverse transformation (1/RT) to normalize RTs while at the same time keeping values interpretable. Inverted RTs can be interpreted as the number of reactions a participant can perform in one second. We followed recommendations by Pek and Flora (2018) and reported only unstandardized effect sizes, which means that all estimated mean RTs and RT effect sizes were reported in reactions per second (1/s) and all estimated mean RFs and RF effect sizes were reported in meters per second squared (m/s^2). Initial model comparisons based on data from Experiment 1 indicated that both by-participant random intercepts and by-participant random slopes were best supported by our data. These comparisons were based on Aikake Information Criteria (Akaike, 1998; Matuschek, Kliegl, Vasishth, Baayen, & Bates, 2017). Including random slopes has the advantage that it increases generalizability compared to including only random intercepts (Barr, Levy, Scheepers, & Tily, 2013) and also allows us to calculate the split-half reliabilities of approach-avoidance effects based on mixed models. To calculate split-half reliabilities, we split each dataset into odd and even trials (balanced by response direction and stimulus type) and ran a separate mixed model for each split. We then extracted the resulting by-participant random slopes for each model and correlated them. Finally, we applied the Spearman-Brown correction to the correlations to acquire split-half reliabilities. The necessity of random slopes also indicates the presence of possibly explainable between-participant variance. To investigate this possibility, we added between-participant moderator variables in Experiment 2. To decrease collinearity and to allow us to interpret simple effects as main effects, we dummy coded response direction (is_pull) and stimulus type (is_happy) and mean-centered all predictor variables. The complete models tested in Experiment 1 were defined as:
$$ \boldsymbol{1}/\boldsymbol{RT}\sim \boldsymbol{is}\_\boldsymbol{pull}\ast \boldsymbol{is}\_\boldsymbol{happy}+\left(\boldsymbol{1}+\boldsymbol{is}\_\boldsymbol{pull}\ast \boldsymbol{is}\_\boldsymbol{happy}\ |\ \boldsymbol{participant}\right)\boldsymbol{force}\sim \boldsymbol{is}\_\boldsymbol{pull}\ast \boldsymbol{is}\_\boldsymbol{happy}+\left(\boldsymbol{1}+\boldsymbol{is}\_\boldsymbol{pull}\ast \boldsymbol{is}\_\boldsymbol{happy}\ |\ \boldsymbol{participant}\right) $$
All analysis scripts, including additional robustness checks, analyses with modeled autocorrelation, and analyses of response accuracy/error data are available on the project’s Open Science Framework page (https://osf.io/y5b32/).
Results
Mobile AAT
Of the experimental trials, 7.2% were excluded from the analysis (for criteria, see the data exclusions section, preregistration and the project’s Open Science Framework page; https://osf.io/y5b32/). We analyzed the mobile AAT data in two separate mixed models, one with inverted RT and one with RF as the dependent variable. In the RT mixed model, there was a significant main effect of response direction (b = 0.068 [0.032, 0.118], t = 3.09, p = .003) as participants, generally, reacted faster when approaching (M = 1.90, SE = 0.052) compared to avoiding stimuli (M = 1.84, SE = 0.049). There was no main effect of stimulus type. More importantly, there was a significant interaction effect between response direction and stimulus type on RT (b = 0.241 [0.138, 0.346], t = 4.61, p <.001). As hypothesized, participants reacted faster when approaching happy (M = 1.98, SE = 0.053) compared to angry faces (M = 1.83, SE = 0.055), and reacted faster when avoiding angry faces (M = 1.88, SE = 0.048) compared to happy faces (M = 1.79, SE = 0.053; see Figs. 3 and 4). In the RF mixed model, there was a significant main effect of response direction (b = -1.868 [-2.615, -1.084], t = -5.00, p < .001), as well as a main effect of stimulus type (b = -0.601 [-0.930, -0.306], t = -3.55, p = .001) on RF. On average, participants used less force in approach movements (M = 13.47, SE = 0.236) compared to avoidance movements (M = 15.34, SE = 0.443) and less force when reacting to happy faces (M = 14.10, SE = 0.319) compared to angry faces (M = 14.70, SE = 0.308). More importantly, there was a significant interaction between response direction and stimulus type on RF (b = 1.262 [0.438, 2.207], t = 2.62, p = .012; see Figs. 3 and 4). As hypothesized, participants used more force to approach happy faces (M = 14.72, SE = 0.501) compared to angry faces (M = 13.45, SE = 0.274) and more force to avoid angry faces (M = 15.95, SE = 0.452) compared to happy faces (M = 14.72, SE = 0.501). The Spearman-Brown split-half reliability was high for both RT-based (r = .77) and RF-based approach-avoidance effects (r = .84).
Control tasks
Joystick AAT
Of the experimental trials, 7.0% were excluded from the analysis (for criteria, see the preregistration and the project’s Open Science Framework page; https://osf.io/y5b32/). In the joystick AAT, there was a significant main effect of response direction on RT (b = 0.083 [0.057, 0.110], t = 7.10, p < .001) as participants, generally, reacted faster when approaching (M = 1.65, SE = 0.031) compared to avoiding (M = 1.57, SE = 0.027) stimuli. There was no main effect of stimulus type on RT. More importantly, there was a significant interaction effect between response direction and stimulus type on RT (b = 0.102 [-0.065, 0.260], t = 2.14, p = .037). As hypothesized, participants initiated approach responses toward happy faces faster (M = 1.68, SE = 0.036) than toward angry faces (M = 1.62, SE = 0.032), and they initiated avoidance responses toward angry faces faster (M = 1.59, SE = 0.029) than toward happy faces (M = 1.55, SE = 0.031; see Fig. 4). The Spearman-Brown split-half reliability of the approach-avoidance effect was high (r = .81).
Stimulus ratings
To confirm that happy faces were perceived as more positive and more approachable than angry faces, we performed two mixed model analyses with stimulus type as predictor and valence and approachability ratings as outcome variables. Results of these analyses indicated that stimulus type indeed successfully predicted valence (b = 4.064 [3.875, 4.259], t = 40.48, p < .001) and approachability ratings (b = 4.010 [3.875, 4.259], t = 31.16, p < .001), with happy faces being rated both as more positive (M = 6.80, SE = 0.095) and more approachable (M = 6.61, SE = 0.137) than angry faces (valence: M = 2.74, SE = 0.101; approachability: M = 2.60, SE = 0.111).
Comparison of effect sizes
The analyses of the mobile and joystick data indicated that the approach-avoidance effect detected by the mobile AAT was larger than that detected by the joystick AAT. To test this difference in an exploratory analysis, we added task type (mobile vs. joystick) to the RT mixed model. The resulting three-way interaction between task type, response direction, and stimulus type was indeed significant (b = 0.133 [-0.005, 0.281], t = 3.92, p < .001), confirming that the approach-avoidance effect detected by the mobile AAT was larger than that detected by the joystick AAT.
Correlation between effects
In another exploratory analysis, we tested the correlations between RT-based approach-avoidance effects in the mobile AAT, RF-based approach-avoidance effects in the mobile AAT, and RT-based approach-avoidance effects in the joystick AAT. To do so, we extracted random slopes for each participant from each of the models and correlated these random slopes. None of these correlations were significant (rs < .05, ps > .800; see discussion).
Discussion
Together, the results of Experiment 1 indicate that the mobile AAT successfully detected approach-avoidance effects, based both on reaction times and response forces. As predicted, participants reacted faster and with more force when having to approach happy or avoid angry faces compared to when they had to avoid happy or approach angry faces. In an exploratory analysis, we found no correlation between effects detected by the mobile and the joystick AAT. This lack of association is surprising, but others have likewise not observed such a correlation (Krieglmeyer & Deutsch, 2010). We also found no correlation between RT-based and RF-based approach-avoidance effects in the mobile AAT. Although this could mean that the two effects are driven by separate processes, further research is necessary to test this hypothesis. In a final exploratory analysis, we found larger approach-avoidance effects with the mobile compared to the joystick AAT. This finding indicates that the mobile AAT might be more sensitive than other versions of the AAT, although this too requires further examination.
In summary, Experiment 1 provided a promising first test of the mobile AAT. Yet, Experiment 1 also had some limitations. First, it was based on a relatively small sample of only female participants. Second, although one of the main advantages of the mobile AAT is that it can easily be deployed in the field, Experiment 1 tested it in the laboratory. Finally, although model comparison indicated between-participant differences that might be explained by between-participant variables, no such variables were included in Experiment 1. Experiment 2 addresses these limitations.