Introduction

Postpartum care has been described as the most neglected aspect of maternity services and generally, fewer resources are devoted to it than to prenatal care or delivery [1,2,3]. In our health system, if there is no particular condition that requires special attention, visits to the midwife are usually limited to two, one in the first 10 days after the birth and the other a month after that [4]. Consequently, attention to health from a comprehensive perspective and centered on the needs of women, which includes the promotion of health and its follow-up, is impeded. This happens despite the fact that it is a crucial stage, since the health of the baby and even the rest of the family will depend on the current and future health of the mother, in the short and long term [3].

The main objectives of care for women in the postpartum period are to identify risks, problems and complications, promote the health of the mother and the newborn, and support parents in the new situation of having a baby [5]. Along with the available resources, Maternal Education (ME) focuses on addressing some of these objectives, especially those related to the promotion of health and preparation for motherhood. However, until now and at least in our area, matters related to postpartum are treated at an inappropriate time for women in ME sessions, between weeks 32 and 42, which means that the information is almost ignored. Possibly due to this, one of the demands of the women themselves [6, 7] is to adapt ME to the needs of this period, giving it greater continuity and providing relevant information at each stage. Puerperal in our area also want the health system to allow them to have the autonomy to manage their own health and that of their babies [8, 9].

Likewise, the Global Strategy for Women’s, Children’s and Adolescents’ Health (2016–2030) recognises the special importance of care for women during the postpartum period, highlighting the need to identify and measure not only serious morbidity, but also any another condition that may affect women’s well-being in some way or prevent them from progressing in their recovery and adapting to motherhood [10]. They also recommend follow-up tools with solid data that will serve as a basis for decision-making and greater transparency and accountability in the health system as a way to transform services provided at this stage. Consequently, it is essential to have valid, reliable instruments that will enable the health system to gather information as a basis for improving and monitoring postpartum services, but which will also enable women to assess their own health and seek resources to meet their needs [10].

In line with the above, the International Consortium for Health Outcomes Measurement (ICHOM) carried out a study aimed at defining a minimum set of appropriate outcome measures at international level to evaluate and improve perinatal care, including a postpartum period up to six months [11]. The outcome measures proposed as the most important according to the ICHOM are survival, morbidity, patient satisfaction with care, and health and well-being reported by the patients themselves. Among the health and well-being results reported by patients, the areas classified as most important for women are (1) health-related quality of life (HRQOL), (2) incontinence, (3) pain in sexual relations, (4) confidence and success in breastfeeding, (5) transition and adaptation to the role of mother, and (6) mental health [11].

It is indisputable that there are large numbers of tools available for the postpartum period, aimed at identifying signs and symptoms of a physical and psychological nature. There are instruments – self-administered or requiring administration by a professional – to identify aspects such as depression and/or anxiety [12,13,14,15,16], evaluate urinary incontinence [17], efficacy with breastfeeding [18,19,20], quality of life [21,22,23,24], fatigue [25], quality of sleep [26], quality of sexual relations [27], social support [28], and the mother–child bond [29, 30].

However, there are no well-validated self-report instruments that give comprehensive, efficient coverage of the 6 areas identified by the ICHOM as being important for women. This means that the only possible way to measure the results reported by patients in the postpartum period, covering the 6 areas, is to use a combination of tools [31, 32], as the ICHOM itself suggests. This means carrying out a fragmentary evaluation and analysis of women's health, which is not the ideal method for approaching health from a thorough, person-centred perspective. It is not a useful basis for providing comprehensive family and community healthcare (such as consensual care plans, care management, distance monitoring or searches for social resources and support networks); nor does it help women themselves to manage their own health.

Given the absence in our area of global, efficient instruments, the objective of this study was to design a complete, specific measurement tool with good metric qualities in digital format for the evaluation of self-reported health and well-being during the puerperium, to conform to what was proposed by the ICHOM. This instrument or dossier of scales that have optimized the various instruments that measure these issues separately can be used globally or partially, making use of 1 or more sections separately. It is housed in the EMAeHealth app, which we have created using a collaborative research process [33] in which women, healthcare professionals, managers and researchers have participated. EMAeHealth has, among other things, an area for self-management of health, which enables women to evaluate their own needs, as a basis for making informed and/or shared decisions regarding their health and that of their families. This digital tool was designed as a complement to Maternal Education (ME), with resources that facilitate its accessibility, continuity and adaptation to the health needs of each woman.

This article describes how the instrument was developed to assess the priority health and well-being needs of women in the postpartum period, and analyzes its psychometric properties.

Method

Design

This study is part of a larger body of research in which the perceptions and needs of women during pregnancy, childbirth and postpartum were analysed, as well as the resources available to them for adapting to each moment of the process. The protocol is now available for consultation [34].

It is a cross-sectional study designed to evaluate of the metric characteristics of a digital tool for detecting health needs in the postpartum, and it was carried out between September 2019 and June 2022 in the Basque Health Service (Osakidetza). This is a public health service that serves a population of just over two million inhabitants and that currently has 7 hospitals where women give birth. Each hospital coordinates with a set of primary healthcare centres for pregnancy and postpartum follow-up.

Procedure

The postpartum health needs detection questionnaire was created in four steps:

  1. 1)

    Focused review of the scientific literature.

Two bibliographical searches were carried out of English and Spanish databases: PubMed, Web of Science, Embase, CINAHL, PsychLIT, PsycINFO, PsicoDoc, IBECS, Cochrane library plus and Google Scholar.

In the first search, articles related to postpartum health needs were looked for with the aim of identifying relevant constructs and defining them. The search terms we decided to use were those referring to psychosocial and health factors of priority for women (functional status, depression, anxiety, social support, breastfeeding, sexual activity, parental auoefficacy, etc.), as well as those relating to the specific period (postpartum, post-delivery, postnatal, puerperium…) both in free text and in controlled language. This work was checked by a group of midwives belonging to the research team, and they evaluated its fit for the area where the study was being carried out.

The second search looked for instruments that measured the previously identified constructs, with a double aim: 1) to operationalize them by generating a pool of items and 2) to choose the gold standard. To the terms used in the first search were added those referring to tools (tool, instrument, questionnaire, survey, test, data collection, measure…) and their metric properties (validity, reliability, psychometric, etc.). The “consensus-based standards for the selection of health measurement instruments (COSMIN)” checklist was used to evaluate the quality of the metric properties of the existing tools. This checklist describes the validity (content, construct and criteria), reliability (internal consistency, reproducibility, measurement error), and responsiveness of a questionnaire [35].

  1. 2)

    Review of the constructs and items by a committee of experts.

A multidisciplinary team was formed, with 4 primary care midwives, 3 puerperal and paediatric nurses, 1 paediatrician, 3 psychologists, 3 methodologists (a psychometrist and 2 researchers in health sciences) and 2 puerperal women. In order to avoid bias, the purpose of the tool to be developed and the definition of the aspects to be evaluated were explained in writing. Each of the 192 items from the initial pool was evaluated individually for (A) its relevance to a positive postpartum experience and (B) its fit to the reference population. They also checked (C) the clarity of the items and (D) the relevance of the response scales, giving them a score of 0 to 10 in each aspect. An average was estimated for each item and those that did not obtain a score of 8 or higher in relevance and fit to the population were eliminated. Any questions that were still considered important and suitable, but did not obtain averages equal to or higher than 9 points in clarity and relevance on the answer scale were reformulated. Finally, using an open question, the experts assessed the need to include new questions or answer categories. The resulting questionnaire was piloted with a sample of 12 women, who reported on their perception of the relevance, fit and clarity of each of the items.

  1. 3)

    Preliminary analysis of the properties of the instrument.

In this phase, the web layout of the pilot questionnaire was created and the gold standards selected for each construct were administered to a sample of 100 puerperas.

The women were recruited by their midwife in postpartum check-ups. They were offered the option of receiving a link to the questionnaire in digital format. They were also encouraged to share the link with other women in the same situation. All pregnant women over 18 who spoke enough Spanish to understand and answer the questions presented could be included. When the woman accessed the link, she received information about the characteristics of the study, the type of use that would be made of the data (for research purposes only) and about the possibility of withdrawing from the study at any time without this compromising her standard of care. The questionnaire was only filled in if informed consent was given.

Once the information was gathered, an analysis was carried out to evaluate the presence and patterns of absent results, atypical results and compliance with the basic assumptions underlying the general linear model (GLM). Next, the descriptive statistics of each item were calculated (% cases chosen in each option, mean, standard deviation, asymmetry and kurtosis). For the analysis of the internal structure, decision-making techniques were used for the optimal number of factors to be extracted within each construct, and exploratory factorial analyses (EFA) were carried out. The internal consistency of each section was also calculated, as well as how much this indicator would vary if each item were removed. If items had low levels of inclusion factor, and their removal increased the internal consistency of the section, they were eliminated.

  1. 4)

    Management and analysis of the metric properties of the final version.

The findings in the pilot (high commonalities, no cross-loadings, strong primary loadings per factor, high number of indicators per factor and no missing results), and the moderate length of each section of the questionnaire (15 items maximum), suggest that a sample size over 200 offers sufficient statistical power for the CFA of data [36, 37]. In addition, the possible effect of other variables was considered, such as age, parity, nationality (Spanish/immigrant), level of education (low/medium/high) or paid employment (yes/no). Following the same procedure for gathering information as in the pilot, registered healthcare professionals asked 443 women to take part in the study, while another 64 were added by other healthcare professionals or through informal contact between participants.

A preliminary analysis of the information gathered was carried out in order to refine the data and check compliance with the basic assumptions of the GLM. After that, the fit of the models resulting from the EFAs of step 3 was tested through confirmatory factor analysis (CFA). Given the ordinal nature of the items, the estimation method used was diagonally weighted least squares (DWLS) using a polychoric correlation matrix. The evaluation of fit of the model to the data was based on the value of the chi-square/df ratio, together with information provided by the comparative fit index (CFI), the root mean square error of approximation (RMSEA) and its standardization (SRMR). Models with chi-square/df ratio results less than 5, equal to or greater than 0.90 in CFI and equal to or less than 0.10 in RMSEA and SRMR were considered acceptable [38, 39]. The pattern of correlation with other variables to obtain evidence of external convergence was analysed using Spearman’s rank correlation coefficient (rs). Finally, the analysis of the internal consistency of the sections was carried out using the coefficients ordinal alpha (ordinal α) and McDonald’s omega (ω). As a criterion to determine the presence of a possible source of distress, we propose using scores above the 75th percentile value for risk factors and below the 25th percentile for protective factors. The statistical program R (v.4.0.2) was used.

Results

The main results of the four phases completed before reaching the optimized version of the questionnaire are described below:

Phase 1. Focused search of the scientific literature

After the first search, it was established that, in addition to the characteristics and evolution of the birth (newborn weight and state of health, type of delivery and surgical procedures performed, e.g. episiotomy), the variables that best determine well-being during the postpartum period could be classified into the 6 groups proposed by the ICHOM. These comprised: (1) functional status/quality of life related to health: possible complications (e.g. haemorrhages, diarrhoea or fever) pain and ability to perform daily activities, all of which are aspects linked to physical recovery in the puerperium [3, 40, 41]. (2) Incontinence as a sequela in the late postpartum period [41, 42]. (3) Recovery of sexual activity and satisfaction [43, 44]. (4) Successful breastfeeding [18]. (5) Transition and adaptation to the role of mother: evolution/confidence in parenting and the satisfaction of the newborn’s needs [45]. (6) Mental health: aspects related to body dissatisfaction, lack of sleep [46] and moods/postnatal depression [41, 47].

The second search focused on tools for the evaluation of these aspects. As a result of this process, it was concluded that there was a need to create a new, complete, updated tool that would be suitable for our social and health environment online.

Table 1 presents the sections that were established as most relevant, and the first column shows the number of items that were initially generated to operationalize each section.

Table 1 Evolution of the number and distribution of the items

In addition, the questionnaires that would be used as gold standards were chosen to determine the convergent validity of the instrument. These, together with their psychometric properties in our sample, are presented in Table 2:

Phase 2. Review of the constructs and items by a committee of experts

Table 2 Gold standards and their psychometric properties in our sample

After the review by the committee, the 81 items that obtained a median score of less than 8 in “Relevance” were eliminated, leaving almost all the scales with a lower number of items than at the beginning (see Table 1, column 2) and 2 new items were added to the questions on alarm signals. Finally, the content of 5 items was reformulated or qualified to increase their clarity and the number of possible responses to each item was homogenised, so that they were all a Likert scale with five alternatives.

After this version was given to the group of 12 women, a key question was added to the scales of pain, incontinence, sexual activity, practice and self-efficacy with breastfeeding, sleep problems and emotional difficulties/depression, so that the respondents would not have to answer all the questions if they did not have that problem. Additionally, the research team carried out a second check of the eliminated items, in case they considered it necessary to reinstate any questions, and the decisions made previously were reaffirmed.

Phase 3: Preliminary analysis of the properties of the instrument

To evaluate the comprehensibility, readability, duration and initial properties of the final questionnaire (comprising 62.75% of the initial items), it was formatted and a pilot was carried out with 100 postpartum women.

The results showed that it was likely to be used, since it takes around 25–30 min to complete it in its entirety and much less if the scales are used separately. Moreover, most of the respondents considered it easy to understand and interesting.

Based on the findings of the exploratory and internal consistency factor analyses, it was decided that all the elements would be kept, pending further evidence regarding their function.

Phase 4: Management and analysis of the metric properties of the refined version

Characteristics of the participants

Of the 443 women invited to participate by the research team, 348 women answered at least one question, and 216 (48.75%) completed the entire questionnaire. A further 64 were completed by puerperal women contacted through other healthcare workers or through informal contact between participants. Finally, 280 puerperal women with a mean age of 34.93 (SD = 4.74) gave their consent and answered all the questions. The characteristics of the participants can be seen in Table 3.

Table 3 Characteristics of the participants

After checking for the absence of patterns of missing results and impossible results in the data, we proceeded to make a formal description of each item (Arithmetic Mean (M) and its 95% confidence interval (95%CI), standard deviation (SD), asymmetry index (AI) and kurtosis index (Ku.)), and evaluate the internal structure, reliability and convergent validity of each of the sections that make up the questionnaire.

Functional state

Alarm signals

This is a one-dimensional scale (χ2 2 = 30.96, df = 27, p = 0.273, χ2/df = 1.15, CFI = 0.96, TLI = 0.95, RMSEA (90% CI) = 0.02 (0.00-. 05), SRMR = 0.12) made up of 11 binary items (see Table 4). Its internal consistency is around what was expected, given the heterogeneity of its content (ordinal α = 0.67; ω = 0.47).

Table 4 Characteristics of the items that measure alarm signals in the postpartum period

Positive response to any of the items is indicative of potential problems that may require specialized attention.

Pain

This is a scale made up of 8 items: 1 binary that acts as a key question; and 7 politomic ones to locate the origin and intensity of the pain (see Table 5). These 7 elements are adjusted to both bidimensional models (χ2 = 14.49, df = 13, p = 0.340, χ2/df = 1.11, CFI = 0.96, TLI = 0.94, RMSEA (90% CI) = 0.03 (. 00-0.09), SRMR = 0.07). 3 belong to the section on specific postpartum pain and 4 to more non-specific or general pain. The internal consistency of the subsection on specific pain is good (ordinal α = 0.73; ω = 0.71) while that of nonspecific pain is lower, as expected for its content (ordinal α = 0.51; ω = 0.50).

Table 5 Characteristics of items that measure pain in specific areas of the body

Scores of 3 or higher on the sum of specific pain or general pain items are considered high.

Functionality/QOL

This is a one-dimension scale (χ2 = 8.76, df = 2, p = 0.003, χ2/df = 4.38, CFI = 0.92, TLI = 0.91, RMSEA (90% CI) = 0.01 (0.00-0.12), SRMR = 0.02) made up of 4 items (see Table 6), with high internal consistency (ordinal α = 0.82; ω = 0.86) and high correlation with the scores obtained in the SF-12 v2, used as the gold standard (rs = 0.71).

Table 6 Characteristics of the items that measure functionality

Scores of 12 or higher on this functionality scale are considered indicative of difficulties.

Incontinence

This is a very simple scale, made up of one single item; its formal description is shown in Table 7. This item has a high correlation with the scores obtained in the ICIQ (rs = 0.94). It is unnecessary to calculate the internal consistency or the factor loading.

Table 7 Characteristics of the item that measures incontinence

The presence of incontinence is considered an aspect that may impair the quality of life and therefore require specific attention.

Sexuality

This is a scale made up of 12 items: 1 binary that acts as a key question and is used to indicate whether the woman has resumed sexual activity; 10 politomic questions, of which 6 are used to find out the degree of satisfaction with sexuality; and 4 questions to locate the possible origin of sexual discomfort, plus 1 additional question to evaluate knowledge about contraception (see Table 8). The 10 elements that measure sexual satisfaction fit the proposed bidimensional model very well (χ2 = 34.80, df = 33, p = 0.382, χ2/df = 1.11, CFI = 0.99, TLI = 0.99, RMSEA (90%CI) = 0.03 (0.00-0.09), SRMR = 0.06). The internal consistency of both the subsection of satisfaction (ordinal α = 0.93; ω = 0.91) and of pain (ordinal α = 0.89; ω = 0.87) are very good. The link of the scores with the gold standard in the pain subsection was moderate and inverse (rs = -0.36) and with the satisfaction subsection it was high and positive (rs = 0.76).

Table 8 Characteristics of the items that measure sexuality

Scores of 15 or less are considered indicative of dissatisfaction with sexual activity.

Breastfeeding

This is a section made up of 25 items, divided into the subsections Knowledge, Practice and Self-confidence in breastfeeding.

Knowledge about Breastfeeding

This is a one-dimensional scale (χ2 = 17.53, df = 9, p = 0.042, χ2/df = 1.95, CFI = 0.99, TLI = 0.99, RMSEA (90%CI) = 0.08 (0.02-0.13), SRMR = 0.06) made up of 6 politomic questions with 5 response alternatives that evaluate some beliefs regarding breastfeeding (see Table 9). The internal consistency was high (ordinal α = 0.89; ω = 0.83). The correlation with the PBSES scores was moderate (r = 0.30).

Table 9 Characteristics of the items linked to breastfeeding

Breastfeeding practice

This is a scale made up of 4 first-order factors: motivation for breastfeeding; personal difficulties with breastfeeding; external difficulties with breastfeeding; and practical breastfeeding, explained by a global second order factor (c2=147.74, df=50, p< .001, c2/df=1.95, CFI=0.96, TLI=0.95, RMSEA (90%CI) = .09 (.07-.11), SRMR=0.07) (see Table 9). The internal consistency of the 12 items that make up the global factor was high (ordinal α = .85; ω=.76). The correlation with the PBSES scores is moderate (r=.41).

Self-confidence in breastfeeding

This is also a one-dimensional scale (c2=10.96, df= 9, p=.278, c2/df=1.21, CFI=0.99, TLI=0.99, RMSEA (90%CI) = .03 (.00-.08), SRMR=0.04) made up of 6 items (see Table 9) with high internal consistency (ordinal α= .82; ω=.77). The correlation with the PBSES scores is moderate (r=.43)

Scores of 22 or less in knowledge and self-efficacy for breastfeeding and equal to or less than 49 in practice are considered low.

Adaptation to role of mother

Parental self-efficacy

This is a one-dimensional scale (χ2 = 270.72, df = 44, p < 0.001, χ2/df = 6.10, CFI = 0.99, TLI = 0.99, RMSEA (90%CI) = 0.12 (0.11-0.14), SRMR = 0.08) made up of 11 items (see Table 10) and very high internal consistency (ordinal α = 0.95; ω = 0.91). The correlation with the total score in the PSOC is moderate (rs = 0.60).

Table 10 Characteristics of the items that measure parental self-efficacy

Scores of 52 or less would be indicative of low perceived self-efficacy.

Perceived social support

This is a one-dimensional scale (χ2 = 54.6, df = 11, p < 0.001, χ2/df = 4.96, CFI = 0.99, TLI = 0.98, RMSEA (90%CI) = 0.13 (0.11-0.17), SRMR = 0.08) made up of 11 items (see Table 11) that present high internal consistency (ordinal α = 0.93; ω = 0.87). The correlation with the total score on the MOS-SSS is high (r = 0.80).

Table 11 Characteristics of the items that measure perceived social support

Scores of 24 or lower would be indicative of low perceived social support.

Mental Health

Self-image

This is a scale made up of 8 items: 1 binary that acts as a key question and is used for seeing whether the woman is satisfied with her current physical appearance; and 7 politomic ones that evaluate the degree of discomfort that body dissatisfaction generates (see Table 12) (χ2 = 41.33, df = 14, p < 0.011, χ2/df = 2.95, CFI = 0.99, TLI = 0.99, RMSEA (90%CI) = 0.08 (0.05-0.11), SRMR = 0.06). Internal consistency is high (ordinal α = 0.90; ω = 0.87). The correlation of the scores with the body dissatisfaction scale of the EDI-3 used as the gold standard was strong (rs = 0.76).

Table 12 Characteristics of the items that measure self-image

Scores of 22 or higher would indicate high body dissatisfaction.

Sleep problems

This is a scale made up of 6 items: 1 binary that acts as a key question and is used to find out if the woman feels satisfied with her quality of sleep in general; and 5 politomic questions that evaluate the origin and type of difficulty (see Table 13) (χ2 = 9.53, df = 4, p = 0.049, χ2/df = 2.38, CFI = 0.97, TLI = 0.93, RMSEA (90%CI) = 0.10 (0.01-0.19), SRMR = 0.08). The internal consistency is acceptable (ordinal α = 0.73; ω = 0.67). The correlation of scores with ISI was moderate-high (rs = 0.61).

Table 13 Characteristics of items that measure sleep problems

Scores of 19 or higher would indicate dissatisfaction with sleep quality.

Depression

This is a one-dimensional scale made up of 6 items (see Table 14) (χ2 = 9.21, df = 9, p = 0.417, χ2/df = 1.01, CFI = 0.99, TLI = 0.99, RMSEA (90%CI) = 0.01 (0.00-0.07), SRMR = 0.05). The internal consistency is very high (ordinal α = 0.89; ω = 0.83). The correlation of scores with the EDPS is very high (rs = 0.82).

Table 14 Characteristics of the items that measure depression

Scores of 14 or higher would indicate the presence of possible mood disorders.

Discussion

When it comes to care of a woman during the postpartum period, as we mentioned in the introduction, it is of special importance to identify and measure not only serious morbidity, but also any other condition that prevents her from progressing in her recovery and adaptation to motherhood or that can affect her well-being in some way [10]. In this task, it is essential not only to take into account the results reported by patients or users of the health service, for example using PROs [57], but also that the measurement instruments enable the woman herself to obtain useful information and make decisions about her own health and that of her family. Likewise, in the selection of these instruments, in addition to demonstrating good metric characteristics and documentation for their interpretation [58,59,60], efficiency at the time of collecting the information is a quality that we must not overlook: if there is the option for selecting a short questionnaire, it should not be necessary to submit the patients to endless batteries of questions that gather identical information.

In line with this, given the lack of comprehensive, specific evaluation instruments with proven metric quality adapted to our environment, a digital instrument or dossier of scales made up of 99 items was developed, which evaluates 6 essential aspects for good psychosocial adjustment and successful coping mechanisms during the postpartum period. These aspects are in perfect harmony with the proposal of self-reported measurements by patients (PROs) that ICHOM made for the collection of health and well-being data in the postpartum period with a focus on the outcomes that matter most to patients [11]. It should be noted that it is a multidimensional self-evaluation instrument which covers the 6 priority areas proposed by the ICHOM and has several advantages: the first is that it can be used as a single measure of global or partial evaluation of the mentioned areas, making use of one or more sections separately; the second is that it is short, as in only 20 min it evaluates the six areas, which have been compared to gold standards equal or similar to those contained in the measurement tools proposed by ICHOM; the third is that it has been conceived and validated as a self-evaluation instrument by women, so it is useful for making shared decisions with the appropriate professional; and the fourth is that it is designed to be used in the EMAeHealth app, which was created through a collaborative research process [33], with the participation of puerperal women, professionals, managers and researchers. This app can easily be linked to the patient’s clinical e-records.

We believe that, if used correctly, it will be an instrument that will permit the collection of useful data for professionals related to postpartum care (e.g. gynaecologists, midwives, nurses and physiotherapists) and other health professionals, but above all that it will be useful for the woman herself, since it will allow for the exploration of her physical, social, emotional and sexual sphere in a short space of time, making it easy for her to make informed decisions about her health during the 12 months following delivery.

Limitations

The ICHOM working groups understand that their function is not to design new measures of results, but to agree on a minimum set of well-validated measures, including the measures reported by patients, that everyone should use [61]. The purpose is to incorporate standard sets in patients’ medical e-records, to be able to make comparisons at different levels. Currently, our instrument has been developed in and for our health, social or cultural context; therefore, it would not be usable without previous adaptation and validation in a different environment. However, this fact is not something inherent to our tool but affects all psychometric measurement instruments; in fact, some recommend deepening evaluative research on the contribution of PRO instruments to a comparison between providers [62] which would contribute to the comparability of results. It might even be thought that, because it is a context-specific tool, it may have greater potential for use as a measurement tool in the clinical environment and in research implementation [63].

On the other hand, the fact that part of the sample has been selected by means of snowball sampling could increase the representation of more proactive women with a higher level of education. However, measures have been taken to avoid introducing bias: the overwhelming majority of the women were selected by 25 midwives belonging to public health centres located in various population areas, both rural and urban, and of different socioeconomic and social characteristics. Given that the women belonging to the snowball sample were referred by these same women, we can assume that they will come from equally varied backgrounds. In light of the sociodemographic data, it can be said that the women in our study are representative of the study population.

Finally, there is lack of evidence regarding the temporal stability of the scores. Information will probably be obtained on this aspect in future studies.

Conclusions

This digital tool for measuring the priority health issues for women during the postpartum period, which is adapted to the cultural and health environment where it was designed (public primary gynaecological care), and which is in Spanish and has good psychometric properties, is considered useful and accessible for women and professionals.

The psychometric quality of the EMA-postpartum instrument, together with the advantages it presents in terms of format and length, would justify its consideration by the ICHOM as a possible PROM tool, within the Standard Set of Outcome Measures for Pregnancy and Childbirth, for the Health and Welfare section. However, a previous psychometric analysis of the properties of the scores derived from EMA-postpartum in other populations of pregnant women is recommended.