Introduction

Traditionally, over time and across cultures, pregnant women have been surrounded by knowledgeable women, family members and close friends who supported the transition to motherhood. Although pregnancy and childbirth entailed more risks than today, women were considered “experts” in their own pregnancy and in the care of their babies [1]. However, since the emergence of obstetrics and the explosion of new biomedical technologies that have facilitated a preventive approach, pregnancy and childbirth have begun to be treated as risk events, which should be controlled and managed by “experts” [2]. Women have lost their status as an expert in their own pregnancies and have lost psychological confidence. Against this background, Maternal Education (ME) has emerged as a resource for guidance and support for women [3]. ME today is a complex health intervention, which should not only be aimed at teaching a combination of skills and knowledge, but also at providing comprehensive support to foster self-management and self-care during pregnancy, childbirth, puerperium and parenting [4]. Women today demand continuous, accessible, rigorous and personalised ME [5, 6].

In accordance with these requirements, our team has designed an e-Health tool to support Maternal Education, through a collaborative research process [5,6,7,8]. The resulting tool has been conceived as a complement to ME, with resources that facilitate its accessibility, continuity and adaptation to the needs of each woman. This tool, EMAehealth, includes: (1) an information area, which is systematically updated according to clinical practice guidelines, (2) a communication area, which allows women to interact with other users and the midwife, (3) a self-management health area, which includes tools for women to self-assess their own health needs, as a basis for informed and/or shared decision-making, and (4) a clinical data area, with access to their own clinical records [7].

In the search for valid, reliable instruments for self-assessment of health needs during pregnancy, we have found that multidimensional instruments aimed at self-assessment of psychosocial needs are practically non-existent. This is despite a growing number of studies reporting an association between mothers' prenatal emotional status and support and their children's socio-emotional and developmental outcomes [9,10,11,12], as well as women's emotional health in the postpartum period, or the likelihood of breastfeeding [13]. Poor maternal mental health has also been shown to be more strongly associated with smoking, alcohol and other substance abuse in pregnancy than other known risk factors, including socioeconomic status or maternal age [14].

Most of the instruments found in the literature within the scope of psychological and social needs are aimed at the specific evaluation of one clinical entity such as depression [15, 16], anxiety [17,18,19,20], stress [21, 22] and fear of childbirth [23, 24]. A few evaluate social support [25,26,27] or that of the partner [28, 29], as well as the coping strategies used by the woman to deal with the stress that the imminent childbirth may cause [30,31,32,33], the locus of control style [34] or childbirth self-efficacy [35]. In addition, many of them lack psychometric analyses carried out with a Spanish sample.

However, if the objective is for a woman to take control of her own health and that of her family, it is essential that there should be a simple instrument that allows the woman to self-assess how prepared she is to face the future birth and maternity in a global way. In this respect, a valid, reliable tool, adapted to our cultural and social situation, will allow her to take stock of her situation, assess whether she has the necessary psychological and social resources, and work on the areas that turn out to be more deficient or request information, help or advice; in short, to move from having a role of passive recipient of health services to taking an active role, making informed and well-founded decisions [36].

Therefore, given the scarcity of complete, specific questionnaires with proven metric qualities for assessing perinatal psychosocial needs, the objective of this study was to design an appropriate measurement questionnaire for our context in digital format which meets these requirements.

Method

Design

This study is part of a broader piece of research in which the perceptions and needs of women during pregnancy, childbirth and postpartum have been analysed, as well as the resources available to them to adapt to each moment of the process. The protocol developed is now available for consultation [37].

This is a cross-sectional study to evaluate the metric characteristics of a digital tool for detecting needs during pregnancy which was carried out between January 2019 and December 2020 in the Basque Public Health Service (Osakidetza). It is a service that provides healthcare to a population of just over two million inhabitants and currently has 7 hospitals where deliveries are performed. Each hospital coordinates with a set of primary health care centres to monitor pregnancy, labour and postpartum.

Procedure

The questionnaire for Detection of Needs during Pregnancy was created in four steps:

  • 1) Focused review of the existing scientific literature for the initial development of the questionnaire.

    Two bibliographical reviews were carried out in the following English and Spanish databases: PubMed, Web of Science, Embase, CINAHL, PsychLIT, PsycINFO, PsicoDoc, IBECS, Cochrane library plus and Google Scholar.

    In the first review, articles related to women’s needs during pregnancy were searched for in order to identify relevant constructs and define them. This work was reviewed by a group of midwives belonging to the research team, and they evaluated its suitability for the context where the study is being carried out.

    In the second review, instruments were searched for which measured the previously identified constructs, with a double purpose: 1) to operationalise them by generating a pool of items and 2) to choose the gold standard. To assess the quality of the metric properties of the existing tools, the “consensus-based standards for the selection of health measurement instruments (COSMIN)” checklist was used. This checklist describes the validity [content validity, construct validity (structural validity, hypothesis testing, cross-cultural validity), and criterion validity], reliability (internal consistency, test–retest reliability, measurement error), and responsiveness of a questionnaire [38].

  • 2) Review of the constructs and items by a committee of experts.

    For the initial elaboration of the questionnaire, a multidisciplinary team was formed, with 7 primary care midwives, 5 hospital care midwives, 2 paediatricians, 3 psychologists, 3 methodologists (1 psychometrist and 2 primary health care researchers) and 2 pregnant women. In the content validity evaluation, 6 experts participated to whom was sent a cover letter and the questionnaire explaining why were invited to participate, along with clear and concise instructions on how to rate each item. The purpose of the tool to be developed and the definition of the aspects to be evaluated were explained in writing, in order to avoid bias. They experts individually evaluated each of the 128 items in the initial pool with respect to (1) the relevance of each question in the tool (how important the question is) to a positive experience of delivery; (2) the clarity of each question (how clear the wording is); (3) the essentiality of each question (how necessary the question); (4) adequacy for the population to which it is addressed; (5) relevance of the response scale; giving them a score from 0 to 10 in each aspect. In order to calculate the index of content validity (CVI) and the content validity ratio (CVR), scores of 9 or 10 were considered adequate (very relevant or clear or essential). Item-CVI (I-CVI) is the most widely reported approach for content validity in instrument development. I-CVI is computed as the number of experts giving a rating of “very relevant” for each item divided by the total number of experts. Values range from 0 to 1 where I-CVI > 0.79, the item is relevant, between 0.70 and 0.79, the item needs revisions, and if the value is below 0.70 the item is eliminated. The second type of empirical analysis was CVR, which measures the essentiality of an item. CVR varies between 1 and − 1, and a higher score indicates greater agreement among panel members. The formula for the CVR is CVR = (Ne – N/2)/(N/2), where Ne is the number of experts indicating an item as “essential” and N is the total number of experts. The experts could also report (6) recommendations for improvement of each question and (7) if they consider it necessary to add any question not previously collected. The resulting questionnaire was piloted with a sample of 12 women who reported on their perception of the relevance, adequacy and clarity of each of the items.

  • 3) Preliminary analysis of the properties of the instrument.

    In this phase, the web layout of the pilot questionnaire and the gold standards selected for each construct were carried out, and administered to a sample of 100 pregnant women.

    The women were recruited by their midwives at a low-risk pregnancy check-up or through peer group information. They were offered the possibility of receiving the link to a questionnaire in digital format. They were also encouraged to share the link with other women in the same situation. All pregnant women over 18 years of age, fluent in Spanish to understand and respond to the questions presented, could be included. When the woman accessed the link, she received information about the characteristics of the study, the type of use that would be made of the data (for research purposes only) and the ability to withdraw from the study at any time without this compromising their standard of care. The questionnaire was only given if the informed consent was accepted.

    Once the information was collected, an analysis was carried out to evaluate the presence and patterns of missing values and outliers. Next, the descriptive statistics of each of the items were calculated (% cases that are chosen in each option, mean, standard deviation, asymmetry and kurtosis) and compliance or not with the basic assumptions underlying the general linear model (GLM). For the analysis of the internal structure, decision-making techniques were applied regarding the optimal number of factors to be extracted within each construct, and exploratory factor analyses (EFA), using the polychoric correlation matrices, were performed. The internal consistency (ordinal alpha) of each dimension was also calculated, as well as how much this indicator would vary if each item were removed. If items had low saturations in their factor of belonging, and their removal increased the internal consistency of the dimensions, they were eliminated.

  • 4) Administration and analysis of the metric properties of the final version

    The findings in the pilot test (high communalities, no cross-loadings, strong primary loadings per factor, the high number of indicators per factor and the absence of missing values), and the moderate length of the questionnaire (62items), suggest that a size sample greater than 200 offers adequate statistical power for the CFA of data [39, 40]. In addition, the possible effect of other variables was considered, such as age, parity, nationality (Spanish/immigrant), educational level (low/medium/high), work outside the home (yes/no) and the presence of certain previous risk factors (such as obesity, negative obstetric history), to guarantee the representativeness of the sample. Following the same procedure as in the pilot test for capturing and collecting information, the enrolled health professionals (gynaecology specialists and midwives) asked 341 women to take part in the study, while another 80 were included by other health professionals or by informal contact between participants.

    A preliminary analysis of the information gathered was carried out in order to clean-up the data and verify compliance with the basic assumptions of the GLM. Afterwards, the fit of the models resulting from the EFAs of step 3 was tested by means of confirmatory factor analysis (CFA). Given the ordinal nature of the items, the estimation method used was diagonal weighted least squares (DWLS) on the polychoric correlation matrix. The evaluation of fit of the model to the data was based on the value of the Chi-square/df ratio, together with information provided by the comparative fit index (CFI), the root mean square error of approximation (RMSEA) and its standardization (SRMR). Models with Chi-square/df ratio values ​​less than 5, equal to or greater than 0.90 in CFI, and equal to or less than 0.08 in RMSEA and 0.10 in SRMR were considered acceptable, models with values equal to or greater than 0.95 in CFI, and equal to or less than 0.05 in RMSEA and 0.08 in SRMR were considered good [41, 42]. The pattern of associations with other variables to obtain evidence of external convergence was analysed using the coefficient of Spearman's correlation (rs). The degree of agreement in the classification as a normal or problematic score was calculated using the kappa index of agreement (K) with the classification made with the gold standard questionnaires. Finally, the analysis of the internal consistency of the dimensions was carried out using the coefficients ordinal alpha (ordinal α) and McDonald's omega (ω). The statistical program R (v.4.0.2) was used.

Results

The main results of the four phases which were completed before reaching the optimized version of the questionnaire are described below (see Fig. 1 for a summary).

Fig. 1
figure 1

Flowchart of the development process and analysis of the metric properties of the questionnaire

Phase 1. Review focused on the existing scientific literature for the initial creation of the questionnaire

After the first review, it was established that the variables that determine the experience of childbirth could be classified into three groups: A) Personal characteristics of the woman (weeks of gestation, physical state, psychological state (screening for anxious-depressive symptoms) and previous or vicarious personal experiences) [43], b) Preferences/desires in relation to the childbirth process [44], and c) Psychological variables and beliefs regarding childbirth, including: (Coping skills/strategies; Need for control/locus of control/beliefs about responsibility (personal or professional) [45]; Self-confidence for the birth [46]; Beliefs about the danger of childbirth [47]; and Fear of the birth [48]). This focussed the objective of the study on the creation of a perinatal psychosocial needs assessment tool.

The second review centred on the search for tools for the evaluation of these psychosocial aspects. As a result of this process, it was concluded that there was a need to create a new, complete, updated tool that is appropriate to our health and digital context.

Table 1 presents the dimensions that were established as most relevant, the questionnaires that would be used as gold standards and the evolution of the number of items remaining to operationalize each dimension.

Table 1 Development of the psychosocial needs self-assessment scale during pregnancy. Evolution of the number and distribution of the items

Phase 2. Review of the constructs and items by a committee of experts

After the experts committee’s review, the 66 items that obtained an I-CVI < 0.70 were eliminated, leaving almost all the scales with a lower number of items than initially (see columns 3 and 4 Table 1). The I-CVIs ranged from 0.00 to 1.00. Thirty-six items had an I-CVI = 1.00, twenty-six a score of 0.83, twenty-two a score of 0.67, twenty-one a score of 0.50, sixteen a score of 0.33, five a score of 0.17 and two a score of 0.00. Sixty-two items (48.43%) were marked as relevant. The CVR was generated for each item. Items that were marked not essential had a CVR < 0.99 (this value is based on the total number of experts, N = 6). Seventeen (13.2%) items had a CVR of 1.00, thirty-six a score of 0.67, thirty-one a score of 0.33, twenty-two a score of 0.00, sixteen a score of -0.33, six a score of − 0.67 and one a score of − 1.00. Nonessential items can be eliminated, but in this case were not due to the loss of information that this would entail. The complete suppression of the dimension ‘Attitude towards the birth’ is to be noted, as it was considered that its content was already covered by other dimensions. In addition, the content of the dimension ‘Fear of pain in the birth’ was expanded by 3 items to include the influence of previous or vicarious experiences, leaving a total of 64 items. Finally, the content of 6 items was reformulated or qualified to increase their clarity and the number of possible responses to each item was homogenized, providing a Likert scale with five alternatives.

After the administration of this version to the group of 12 women, 1 question was added to the ‘Anxious-depressive symptoms’ dimension in order to cover whether or not difficulties are generated in daily activity, and 3 items were eliminated from the ‘Locus of control’ dimension. Additionally, the research team carried out a second review of the eliminated items in case they considered it necessary to reinstate any, reaffirming the decisions made previously.

Phase 3: Preliminary analysis of the properties of the instrument

To evaluate the comprehensibility, readability, duration and initial properties of the final questionnaire (which was eventually made up of 48.44% of the initial items), it was formatted and a pilot test was carried out with 100 pregnant women.

The results showed that it was a test that the women found easy to use, as it takes around 15 min to complete. In addition, most of the respondents found it easy to understand and interesting.

Based on the findings in the exploratory and internal consistency factor analyses, it was decided that7 items would be eliminated: the 4 items from the external locus of control sub-dimension, 1 item from the positive coping skills dimension, and 2 from the perception of need for medicalization of childbirth dimension. As a result, the optimised version of the questionnaire was left with 55 items.

Phase 4: Administration and analysis of the metric properties of the final version

Characteristics of the participants

Of the 341 women invited to participate by the research team, 183 (54%) actually did so, to which the 80 women attracted by other means were added. Finally, 263 women with a mean age of 33.55 (SD = 4.73) gave their consent and answered all the questionnaires between weeks 8 and 41 of gestation (28.87; SD = 7.34). The characteristics of the participants are distributed in a similar way as in the population of pregnant women [53] and can be seen in Table 2.

Table 2 Characteristics of the participants

Once checked there was no missing or impossible numbers ​​in the database, we proceeded to make a formal description of each of the items (Arithmetic mean (M) and its 95% confidence interval (95% CI), standard deviation (SD), asymmetry index (AI) and Kurtosis index (Ku.)), and evaluate the internal structure, reliability and convergent validity of the questionnaire.

As can be derived from the CFA results for the scale as a whole (55 items, so the factors being specified as the sub-scale items and run within a correlated measurement model), the overall fit of the questionnaire is good (χ2 = 2,610.09, df = 1229, p < 0.001, χ2/df = 2.12, CFI = 0.95, TLI = 0.95, RMSEA (90% CI) = 0.06 (0.06 -0.07), SRMR = 0.08). However, the scale-by-scale analyses are presented below since we are interested that each of them separately also shows good fit so that they can be used in isolation.

Anxious-depressive symptoms

This is a one-dimensional scale (χ2 = 7.12, df = 4, p = 0.129, χ2/df = 1.778, CFI = 0.99, TLI = 0.99, RMSEA (90% CI) = 0.05 (0.00 -0.11), SRMR = 0.05) made up of 5 items (see Table 3), with high internal consistency (ordinal α = 0.74; ω = 0.78), high association with the scores obtained in the gold standard (rs = . 70 with the STAI and rs = 0.61 with the EPDS) and a good agreement index in the classification of women as being at risk of suffering from anxiety (K = 0.58) and/or depressive problems (K = 0.33).

Table 3 Characteristics of the items that measure anxiety-depressive symptoms

Acceptance of the pregnancy

This is also a one-dimensional scale (χ2 = 8.76, df = 2, p = 0.003, χ2/df = 4.38, CFI = 0.92, TLI = 0.91, RMSEA (90%CI) = 0.11 (0.08-. 15), SRMR = 0.06) made up of 3 items (see Table 4), with high internal consistency (ordinal α = 0.76; ω = 0.69), high association with the scores obtained in the gold standard (rs = 0.40) and an acceptable agreement index according to the classification made with the gold standard (K = 0.30).

Table 4 Characteristics of the items that measure Acceptance of pregnancy

Perception of support from the partner

This is a scale made up of 6 items (χ2 = 8.23, df = 9, p = 0.512χ2/gl = 0.91, CFI = 0.99, TLI = 0.99, RMSEA (90% CI) = 0.01 (0.00-0.06), SRMR = 0.02) (Table 5), whose internal consistency (ordinal α = 0.93; ω = 0.90), association (rs = 0.81) and agreement index of the classification with the gold standard were excellent (K = 0.66).

Table 5 Characteristics of the items that measure Partner support

Coping

This is made up of two subscales, one of 7 items that measures positive coping skills (χ2 = 18.98, gl = 12, p = 0.089, χ2/gl = 2.71, CFI = 0.99, TLI = 0.99, RMSEA (90% CI) = 0.05(0.00-0.85), SRMR = 0.04, ordinal α = 0.75; ω = 0.79,rs = 0.77, K = 0.50) and another of 5 that measures avoidance (χ2 = 5.84, df = 5, p = 0.322, χ2/df = 1.17, CFI = 0.96, TLI = 0.98, RMSEA (90% CI) = 0.05 (0.00-0.11), SRMR = 0.03, ordinalα = 0.71, ω = 0.67,rs = 0.86, K = 0.69) (Table 6), both with excellent metric properties.

Table 6 Characteristics of the items that measure Coping

Internal locus of control

This is made up of 5 items (χ2 = 13.56, df = 5, p = 0.019, χ2/gl = 2.71, CFI = 0.98, TLI = 0.97, RMSEA (90% CI) = 0.08 (0.03-0.13), SRMR = 0.05) (Table 7), presents good internal consistency (ordinal α = 0.69; ω = 0.69), high association with the scores in the gold standard (rs = 0.69) and an acceptable agreement index with the classification of the gold standard (K = 0.28).

Table 7 Characteristics of the items that measure Locus of Control

Childbirth self-efficacy

This is made up of 11 items (χ2 = 151.80, df = 43, p < 0.001, χ2/df = 3.53, CFI = 0.98, TLI = 0.97, RMSEA (90%CI) = 0.09 (0.08-0.11), SRMR = 0.07) (Table 8), presents very high internal consistency (ordinal α = 0.87; ω = 0.86), high association with gold standard scores (rs = 0.59) and a good agreement index in the classification of at risk of presenting difficulties (K = 0.43).

Table 8 Characteristics of the items that measure Self-efficacy for childbirth

Perception of childbirth as a medicalised process

This is a scale with 8 items (see Table 9) which shows a good fit to a two-dimensional model of related factors (χ2 = 37.38, gl = 19, p < 0.001, χ2/gl = 1.97, CFI = 0.98, TLI = 0.98, RMSEA (90%CI) = 0.06 (0.03-0.09), SRMR = 0.06). The first 4 items evaluate the perception of childbirth as a medical process and present good reliability and convergent validity (ordinal α = 0.79; ω = 0.77, rs = 0.64, K = 0.23) and the other 4 evaluate the perception as a process as natural, also with good metric properties (ordinal α = 0.75; ω = 0.72,rs = -0.44). These aspects correlate inversely (r = -0.34).

Table 9 Characteristics in the items that measure the perception of childbirth as a medical process

Fear of childbirth

This is made up of 5 items (χ2 = 4.69, df = 5, p = 0.460, χ2/gl = 0.94, CFI = 0.99, TLI = 0.99, RMSEA (90%CI) = 0.01 (0.00-0.08), SRMR = 0.04) (Table 10), and presents very high internal consistency (ordinal α = 0.90; ω = 0.87), moderate association with the scores in the gold standard (rs = 0.57) and good fitto the classification of the gold standard (K = 0.40).

Table 10 Characteristics of the items that measure Fear of childbirth

Discussion

The promotion, prevention, early intervention, and treatment of perinatal psychosocial problems should be a global priority [54, 55]. Given the scarcity of complete, specific evaluation instruments with checked metric quality, a four-phase process has been developed, to produce a digital questionnaire made up of 55 items that evaluate 8 essential aspects for good psychosocial adjustment during pregnancy and successful coping with pregnancy.

Among the issues that are related with adaptation to pregnancy, the following stand out: 1) The screening of anxious-depressive symptoms through 5 items that serve to assess whether the cognitive and emotional state allows the woman to use her mental and social skills; 2) Acceptance of the pregnancy, reflected in 3 items that evaluate the positive response to being pregnant and; 3) The perception of support from the partner, measured with 6 items that include the partner's interest in the woman's needs as a future mother, her adjustment to the new paternal/maternal role, and empathy shown. These three factors have already been highlighted in the literature, some of them interrelated. For example, identifying low acceptance of pregnancy can potentially improve current medical practice by improving early detection of maternal depression [56], which is a good predictor of general mental health and the future bond that the woman will make with the baby [57].

The aspects that can be decisive for the birth process are: 1) The perception of coping skills made up of 12 items that evaluate the cognitive and behavioural strategies that the woman uses to manage the internal or external demands of a difficult situation such as childbirth; 2) The locus of control or perception that the woman has about the causes of what happens in her life, in this case measured with 5 items that evaluate the attribution/internal control over them; 3) Self-efficacy, or the belief a woman has that she possesses the capabilities to perform the necessary actions that allow her to obtain the desired results during the childbirth process, measured with 11 items; 4) The perception of childbirth as medicalised, which is the a priori belief in the risk that childbirth poses to the life of the mother and the baby and the need for it to be carried out in instrumentalised medical environments, with a list of 8 items and; 5) Fear of childbirth, measured with 5 items, understood as suspicion, apprehension or distressing mood disturbance due to a real or imaginary risk or harm, or fear of something happening contrary to what is desired during childbirth.

All the dimensions showed good global fit to the starting theoretical models, with values ​​of the χ2/df ratio between 0.91 and 4.38, making them all under 5, the maximum value recommended by the psychometric community. This also occurred with the RMSEA and SRMR indicators, which ranged between 0.01 and 0.09 in the first case and between 0.02 and 0.07 in the second, with 0.10 being the maximum value tolerated. None of the items presented local fit problems, all saturating above 0.4 in their respective dimensions (the majority above 0.60). The associations of the scores obtained in the dimensions with those found in the gold standard were moderate to high, ranging from 0.40 to 0.86, which is considered good evidence of convergent validity. The values ​​found in the kappa agreement indices between the classifications made with the new tool and the gold standard point in the same direction, varying from 0.23 in the case of the perception of childbirth as dangerous and in need of medicalised assistance, to 0.69 in the case of negative childbirth coping skills. In terms of reliability, we can say that internal consistency was very good in all dimensions, with values ​​between 0.69 and 0.93 in ordinal alpha and between 0.69 and 0.90 in McDonald's omega.

It is important to have a global vision of all dimensions when evaluating and intervening. For example, it has been seen that coping can function as a mediating factor between other factors: greater social support decreased the likelihood of depression, not only directly but also through the mediating role of coping styles [58]. The fear of birth could be concurrent with depressive symptoms [59], and can have a negative impact on a woman’s psychological wellbeing during pregnancy and her experience of birth. It has also been associated with adverse obstetric outcomes and postpartum mental health difficulties [60]. Interventions can consist of working on one dimension to achieve an effect on another or others, and we can also evaluate the dimensions, together or separately, to adapt the intervention methodology to each case.

The greatest strength of this study is the use of a participatory approach, which obtains evidence on the validity of its content from the scientific literature and in a consensual and collaborative manner from health experts and users. It also has good psychometric properties, as it is much shorter than the total of the questionnaires used as the gold standard (55 vs. 173 items), in addition to the possibility of evaluating each area in isolation. Among its uses, it makes it easier to collect information from pregnant women without consuming a great deal of time. The instrument systematically and efficiently evaluates aspects of health that may be difficult for the professional to address in the usual consultation time, facilitating care that is more comprehensive. In maternal education sessions, it can be used to adapt the contents to the needs of the group, and also to evaluate if objectives have been achieved. In addition, it makes it easier for the woman to take an active role in finding the resources she needs to attend to her health and that of her baby, once she has self-assessed her needs. Furthermore, in the longer term this type of tool, in digital format, may prove useful for collecting data at community level. This characteristic will facilitate the progressive and continuous adaptation to the needs of different types of populations (immigration, socioeconomic differences, fathers/mothers), increasing the effectiveness of interventions.

Limitations

The fact that the selection of the sample was not random might increase the representation of more proactive women with a higher educational level. However, measures have been taken to avoid introducing bias: the women were selected by 25 midwives belonging to public health centres located in various population areas, both rural and urban, and of different socioeconomic and social characteristics. In light of the socio-demographic data, it can be considered that the women in our study are representative of the study population.

The length of the scale was very considerable when it was presented together with the gold standard questionnaires, which might reduce the participation of women with less motivation. To remedy this effect, the questionnaire was presented in digital format, accessible by mobile phone, which favoured its use and management in a young population of all social origins, who use smartphones in their daily lives. The questionnaire also allowed it to be completed in stages.

Finally, there is a lack of evidence regarding the stability over time of the scores, but we expect information on this aspect will be obtained in future studies.

Conclusions

Based on the results found and the changes made to them, it can be stated that the scores that will be obtained from the final version of the questionnaire have the guarantees of validity and reliability required by the international standards for test creation and adaptation [61].

The result of this work aims to respond to the demand for addressing the mental health of women from a global perspective, centred on the individual [55], in a multidimensional way [55, 62].