Introduction

Interoception is the process by which the nervous system senses, interprets, and integrates signals originating from within the body [1].These signals include physical sensations related to internal organ functions such as heartbeat, respiration, satiety and the autonomic nervous system activity related to emotions [2]. Many of these remain unconscious, but some are conscious or are potentially accessible to consciousness. Interoceptive awareness refers to this conscious level of interoception [3].

Interoceptive awareness helps the organism to maintain homeostasis and is thus crucial for survival [4]. Moreover, it is thought to be connected to self-regulation and emotion-regulation [2, 5]. It is suggested that interoceptive awareness is often problematic in patients with mental disorders, such as mood disorders [5], posttraumatic stress disorder [6], somatic symptom disorders [7], and eating disorders [8, 9]. While most studies have been conducted in patients with specific diagnoses, problems in interoceptive awareness might be regarded as a transdiagnostic factor, a common vulnerability across multiple disorders [10]. Awareness of physical sensations may be upregulated or downregulated in patients with mental disorders [11].

Interoceptive awareness has been measured by different self-report questionnaires, such as the Porges Body Perception Questionnaire (BPQ) [12], the Scale of Body Connection (SBC) [13, 14], the Somatic Awareness Questionnaire (SAQ) [15, 16], and the Multidimensional Assessment of Interoceptive Awareness (MAIA), version 1 and 2 [3, 17]. Interoceptive awareness is closely related to interoceptive sensitivity, described by Forkman et al. [18] as a dispositional tendency to be internally focused. This term thus captures beliefs about body sensations usually assessed with self-report questionnaires. However, the term interoceptive sensibility does not differentiate clearly an anxiety and hypervigilance-driven attention style versus a more mindful, adaptive and potentially healthy attention style. The MAIA-2 has been designed to differentiate between anxiety-driven and mindful attention styles towards interoceptive cues, a distinction of key clinical importance [3, 19]. Presently, the MAIA-2 is the most used self-report questionnaire, covering different components of interoceptive awareness. The MAIA-2 has been translated and psychometrically evaluated in several languages, including French [20], Turkish [21], Chinese [22], Norwegian [23], Ukrainian [24], and German [5]. These studies used non-clinical samples except for the study conducted by Eggart et al. [5], who provided psychometric values for the MAIA-2 in a group of people with Major Depressive Disorder. The above-mentioned studies, as well as studies using the first version of the MAIA, revealed varying results regarding factor structure and internal consistency of the subscales.

To date, the MAIA-2 has not been translated and psychometrically evaluated in Dutch. The SAQ [15, 16], as well as the SBC [13, 14], have been used in studies in the Netherlands [25, 26], but both lack thorough psychometric evaluation. Furthermore, these questionnaires do not cover the multidimensional concept of interoceptive awareness as applied in the MAIA-2, which we deem preferable for an instrument to be used in a broad array of mental disorders. Therefore, we refrained from further evaluation of the SAQ and SBC and selected the MAIA-2 for psychometric evaluation in a convenience sample of Dutch adults.

The central aim of this article is to evaluate the psychometric properties of the Dutch version of the MAIA-2 (MAIA-2-NL) in a non-clinical sample. This step may be regarded as the first necessary step, after which psychometric evaluation in different target groups should follow.

Method

Participants

Data were collected in a non-clinical sample of Dutch adults (n = 1054) aged between 18 and 83. The sample consisted of 331 (31.4%) men and 723 (68.6%) women. Descriptive statistics are shown in Table 1.

Table 1 Demographic characteristics of the sample

Procedure

Data collection took place in agreement with the Medical Ethical Committee of the University Medical Center Groningen. Students enrolled in a bachelor psychomotor therapy research course in 2019 collected data from their personal network, resulting in a convenience sample of Dutch adults. Participation was voluntary, and data were analysed anonymously. No participatory incentives were offered. All participants were informed of the study’s purpose, method, and voluntary and anonymous nature of participation before they entered the study. Data was collected using computerized questionnaires through a secure online survey platform (Formdesk). To assess temporal reliability 109 respondents (Men = 30, Women = 79, Mean age = 43.78 (SD = 17.86), Mean BMI = 24.79 (SD = 4.31)) completed the questionnaire again after 14 days.

Measures

The MAIA-2 is a self-report questionnaire developed by Mehling et al. [3] (see Table 2). It is a 37-item scale that consists of eight subscales: Noticing, Not-distracting, Not-worrying, Attention regulation, Emotional awareness, Self-regulation, Body listening and Trusting. Subscale scores are calculated by taking the arithmetic mean of the items on each scale. Items are scored on a 6-point Likert scale, ranging from 0 (never) to 5 (always). Higher scores indicate more interoceptive awareness [3].

A forward-backward translation based on the original English version of the MAIA-2 was made following a recommended five-step procedure [27]: (1) Two forward translations were performed independently by two native Dutch speakers, with proficiency in English and with knowledge of the concept of interoceptive awareness and its meaning in practice; (2) The two versions were then compared item-by-item, and discrepancies were resolved. (3) Two independent professional bilingual translators, naive to the construct of interoceptive awareness, performed a back-translation into English; (4) Discrepancies between the back-translations and the original English version were discussed and resolved; (5) This resulted in a pre-final Dutch version. In the next step, a pilot study was conducted among 41 respondents to examine the usability and comprehensibility. This led to minor adjustments in the wording of some items, resulting in the final version used in the present study (see Table S1).

Table 2 Multidimensional Assessment of Interoceptive Awareness, English version*

Data analysis

SPSS 28 for Windows was used for statistical analyses. Factor structure was examined by exploratory factor analysis (EFA) and confirmatory factor analysis (CFA). The sample was randomly split into two halves, one of which was analysed with EFA and the other with CFA. Reliability was assessed using both McDonald’s omega and Cronbach’s α. McDonald’s omega was used because the assumption of tau-equivalency is not accepted [28]. Cronbach’s α was reported to allow comparisons with other validation studies. In addition, there is no universally accepted guideline for acceptable levels of omega reliability, but they need to meet the same standards as Cronbach’s α [29]. Thus, an omega ≥ 0.70 is regarded as good [30]. Test-retest reliability was established by intraclass correlation (ICC). A two-way mixed model for absolute agreement was used [31]. An ICC between 0.50 and 0.75 was considered moderate, between 0.75 and 0.90 good, and ≥ 0.90 excellent [32].

For the EFA, Maximum Likelihood with oblique rotation was used as factor extraction method. Kaiser’s criterion, requiring factors with eigenvalues of at least 1, was adhered to [33]. The number of factors retained was based on interpreting the scree plot [34] and parallel analysis [35]. Cross-loadings were defined as an item that loads at > 0.32 on two or more factors [36].

The other half of the sample (n = 527) was used for confirmatory analysis (CFA) to evaluate the adequacy of the proposed factor structure following EFA [37,38,39]. Mplus Version 8.0 was used [40]. Each index type provides different information about model fit [37]. Therefore, a broad range of indices was reported, including root mean square error of approximation (RMSEA) [41, 42], standardized root mean square residual (SRMR) [43], Comparative Fit Index (CFI), and Tucker Lewis index (TLI) [37, 40, 44].

Results

Table 3 shows the descriptive statistics forthe subscales, along with internal consistency, both from the entire sample and separately for men and women. Table 4 shows the descriptive statics and internal consistency for both the Exploratory Factor Analysis (EFA) and Confirmatory Factor Analysis (CF) samples, as well as the test-hertest sample. The internal consistency (ω) of all subscales of the MAIA-2 NL was moderate to good with ω = 0.67 to 0.89. The MAIA-2 subscales showed moderate to good reliability over time, with intraclass correlation coefficients (ICC) varying from 0.67 to 0.79.

Table 3 Descriptive Statistics, internal consistency, and sex differences
Table 4 Descriptive Statistics EFA and CFA samples, and test-retest

EFA revealed a six-factor structure as best solution, with the items from the original subscales Noticing (items 1 to 4) and Emotional awareness (items 23 to 27) as well as the subscales Self-regulation (items 28 to 31) and Body listening (items 32 to 34) taken together as one factor. The other items are loaded at four factors, which is in accordance with the subscales as proposed by Mehling et al. [3], see Table 5.

Table 5 Exploratory Factor Analysis

CFA on the six factors resulting from the EFA (see Table 5) provided an acceptable fit. However, CFA with the original eight factors resulted in a better fit (see Table 6).

Table 6 Goodness of fit index values for CFA

Discussion

The central aim of this study was to evaluate the psychometric properties of the Dutch version of the MAIA-2 (MAIA-2-NL) in a large non-clinical sample of Dutch adults. Our analyses show good internal consistency and reliability over time for all eight MAIA-2-NL subscales. EFA and CFA points towards a six-factor structure instead of the eight-factor structure from the original English version [3]. However, the CFA reveals better fit indices for an eight factor solution than for a six factor solution.

A six-factor structure was also established as best solution in the French [20] and Turkish [21] versions of the MAIA-2. However, in the French version, the subscales Not-worrying and Not-distracting were excluded, making a comparison with our results impossible. In the Turkish version, a six-factor solution was only reached after removing items 1, 6, 19, 20, and 35. We refrained from excluding items to be able to make adequate comparisons between the MAIA-2-NL and the original MAIA-2. In our opinion adjustments to existing questionnaires should be made with caution and parsimony because they negatively influence the comparison between studies.

A possible explanation for the aggregation of the two original subscales Noticing and Emotional awareness in our factor analysis could be that the subscales both measure awareness, although differing in the cognitive appraisal being measured. Several items of the subscale Noticing and Emotional awareness show some overlap in sentence formulation, for example, item 1, subscale Noticing (“When I am tense, I notice where the tension is located in my body”) and item 23, subscale Emotional Awareness (“I notice how my body changes when I am angry”).

The original MAIA-2 subscales Self-regulation and Body listening are also considered one factor in our analysis. This could be explained by the fact that both subscales are described by Mehling et al. [17] as belonging to an overarching dimension of Mind-Body Integration, referring to an overall felt sense of an embodied self. This implies that one has access to more developed levels of body awareness, to a sense of “the interconnectedness of mental, emotional, and physical processes as opposed to a disembodied sense of alienation and of being disconnected from one’s body” [17, p.3].

Although we retained six factors in the EFA for the MAIA-2-NL versus eight factors in the original MAIA-2 [17], it should be noted that the two ‘new’ factors consist of four ‘old’ factors taken together two by two. More important, the CFA based on the original eight factors shows a better fit than the CFA based on six factors. These considerations lead to the conclusion that, with the present knowledge, maintaining eight factors isacceptable. Especially when comparing study results on the MAIA-2-NL with other studies, using eight factors, as in the original English scale, seems preferable. It would ofcourse be advisable to replicate the EFA and CFA in another large population sample.

This study has a couple of unique strengths. First, it benefits from a large sample size, setting it apart from other studies in the field. This large sample size offered the possibility of randomly splitting the sample in half and conducting EFA on one half of the sample and CFA on the other, as recommended by Swami and Barron [39]. It should be noted that, unfortunately, a considerable amount of the studies on the psychometric properties of the MAIA-2 were conducted in relatively small samples. For example, the study on the Turkish version of the MAIA-2 [21] was conducted in a total sample of 400 respondents, which is on the small side when taking into account the recommended participant-to-item ratio of 10:1 for EFA [36, 39]. The same goes for the French validation of the MAIA-2 [20], based on an EFA in 154 respondents and a CFA in another 154 respondents.

Another strong point lies in the insight provided by this study into the temporal reliability of the MAIA-2-NL, which ranges from moderate to good. As test-retest reliability is one of the components of the Reliable Change Index, this is essential knowledge in view of future studies on the evaluation of therapeutic interventions addressing interoceptive awareness.

Lastly, due to the digital administration of the questionnaire, no missing data were present. Additionally, this method eliminates the possibility of input errors that could occur with pen and paper administration, which could have implications for data accuracy.

The study has some limitations that need to be addressed in future research. First, construct validity for the MAIA-2-NL has not yet been investigated. Furthermore, as mentioned by Todd et al. [45], men and older adults often remain underrepresented in this kind of studies. This is, regrettably, also the case in our study. The mean age in our convenience sample was 35 years and men comprised one-third of the sample. It is important to note that the gender distribution is not equal and thus does not reflect the gender composition of the general Dutch population. Nevertheless, compared with other studies with much smaller male datasets [e.g., 20, 21, 23, 24], this study includes a substantial number of male participants (n = 331), thus contributing valuable comparative data. It is evident that more data from an elderly group and male respondents need to become available in order to speak not of a convenience sample but of a sample representing the Dutch population.

More importantly, future studies need to be conducted in clinical samples. This is, of course, an essential step in order to be able to evaluate problems in interoceptive awareness in different groups of patients and in order to measure the effect of interventions targeting these problems. However, confirmatory factor analyses in these samples should be performed before drawing conclusions on scores obtained with the MAIA-2 in clinical samples [19]. Such a strict psychometric procedure is worthwhile and should preferably be followed because it provides a sound base for further usage of an instrument. Unfortunately, obtaining the required sample size in a clinical setting is often challenging. An exception is the German version of the MAIA-2, validated in a sample of severely depressed patients (n = 110). Importantly, The MAIA-2 was found reliable and sensitive to change in this sample [5].

To summarize, our findings indicate that the MAIA-2-NL is a reliable and valid instrument to measure interoceptive awareness in a non-clinical sample of Dutch adults.