Introduction

Over the last decades, healthcare researchers have witnessed the breakthrough changes that have led to new concepts regarding the evaluation and appraisal of health care. Standard measures of morbidity and mortality, just evaluate the effectiveness of medical interventions in chronic diseases. When treatment of this diseases does not significantly modify survival rates and when pharmaceutical or other interventions may cause serious adverse events, there is a need to evaluate the effectiveness in a new way [1]. Health Outcomes Research (HOR) is a relatively recent discipline that focuses on the measurement of the impact of the disease and treatment upon patient perceived health, providing an answer to these new current medical concerns [2]. HOR is applied to clinical and population-based research, leading to the enhancement of the healthcare end results in terms of benefits to the patient and society.

Human papillomavirus (HPV) causes one of the most common sexually transmitted infections (STIs), and high-risk genotypes are related to cancer in several localizations [3, 4]. Diagnosed in more than 90% of cervical cancers, the fourth deadliest cancer in women, human papillomavirus (HPV) is currently the most common pathogen responsible for female cancers. Moreover, HPV infection is associated with many other diseases, including cutaneous and anogenital warts, and genital and upper aero digestive tract cancers [5,6,7]. Recently, features of the cervicovaginal microbiota are found to be associated with the incidence of HPV-related diseases, presenting a novel approach to identify high-risk women through both blood and cervical samples [5]. HPV infection usually occurs soon after starting sexual life and is mostly prevalent in young adults [8, 9]. A previous study suggests an estimated 6.2 million HPV infection annual incidence in people ranging from 14 to 44 years old [10]. Moreover, the overall estimated prevalence of HPV cases (genital warts) in Spain has been notified to be 118/100,000 inhabitants [11]. Several studies have shown that HPV infection has a negative psychosocial impact on well-being and health-related quality of life (HRQoL), both low-risk HPV inducing genital warts and high-risk HPV inducing (pre-) cancerous lesions [12,13,14]. Patients with HPV infection experienced heavier psychosocial burden than the general population, and females experienced higher burden than males [12,13,14]. HPV infection impact on HRQoL dimensions depending on the questionnaire used, but those related with self-image and sexual impact were usually highly impacted [15, 16]. Literature has also found other health areas with a substantial concern among people with HPV, including anxiety, stress, and detriments in sexual functioning [17,18,19,20]. Anxiety is significantly higher in women testing HPV-positive than those without HPV infection [21]. Participating in a routine screening can cause pain, embarrassment, fear, and inconvenience, even when the tests detect no abnormalities [20,21,22]. Despite the wide previous literature, few researchers have used specific questionnaires to measure the psychosocial effects of HPV-infection. Many studies have used generic instruments to measure HPV-infected women feelings such as anxiety or depression, but these tools are broad and may not capture all the salient features caused by the condition. Although some HPV-specific questionnaires have been described, few have been formally developed or validated. The HPV impact profile (HIP) was developed to assess the psychosocial impact of HPV infection and related interventions. The “Cuestionario Específico para Condiloma Acuminado” (CECA-specific questionnaire for condyloma acuminata) [23,24,25], only explores the impact of having genital warts, a particular presentation of HPV clinical spectrum, in patient quality of life. The Functional Assessment of Chronic Illness Therapy-Cervical Dysplasia (FACIT-CD) assesses the quality of life related to the finding of cervical dysplasia, focusing on the physical and psychological fields, but is tedious and should be validated in further studies [26]. To our knowledge, there is no comprehensive instrument designed to evaluate the full spectrum HPV infection-health related quality-of-life impact in women. The HPV-Quality-of-Life (HPV-QoL) questionnaire was developed to determine the impact of HPV infection and related interventions in women HR-QoL. This study provides, for the first time, the development of a new HPV-QoL questionnaire in adult women with HPV along with its preliminary psychometric properties.

Materials and methods

Panel of experts

The questionnaire development process began with the selection of a panel of eight experts composed of six gynaecologists specialized in lower genital tract diseases, one gynaecologist specialist in sexuality and HRQoL, and one methodologist expert in HRQoL. Every step of the development and validation of the questionnaire was supervised by the panel of experts. A literature review and compilation of the articles published about the impact of HPV and/or genital warms in different life aspects of woman HRQoL were first carried out. This search included Medline, Embase, Current Contents, Cochrane Library, and Google scholar. Taking these reviews as a starting point, the panel of experts generated an initial set of questions related to the impact of HPV on different domains of woman quality-of-life: social wellbeing, psychological wellbeing, and interference with activities of daily-living, mental health, concern about infecting other people, worrisome about general health consequences, anxiety, depression sensation, sexual desire, and characteristics of sexual relationship.

Patient’s interviews

Patient’s interviews were carried out to record their opinion on the relevance of the dimensions and items initially considered by the panel of experts, and to collect additional information about those aspects of treatment the expert’s panel might have missed. Patients were questioned about the time during which they had been infected by HPV, their concern about the impact on daily-living, general health possible consequences and patient’s feelings, sensation, and usual behaviour after getting HPV. A total of 20 patients were interviewed in outpatient gynaecology clinic in the Community of Madrid and Málaga (Spain).

Item generation

Combining the initial considerations from the experts, literature review and the information recorded from the patient´s interviews, dimensions held to be relevant were included in the questionnaire, thus generating a comprehensive list of items in affirmative format and reflecting the expressions directly recorded from the patients interviews at its most. The items were designed to ensure they referred to a single concept, avoiding double negation and ambiguity, and using a first-person format. The answers were scored on a Likert-type scale from 1 to 5, as follows: 1 = “Totally agree”; 2 = “Agree”; 3 = “Neither agree nor disagree”; 4 = “Disagree” and 5 = “Totally disagree” [27]. The initially formulated items were evaluated by a semantic discussion and screening process, resulting in 20 items grouped into 4 domains (Fig. 1): General well-being including social well-being and psychological well-being (subdomains), sexuality, concern about infecting other people and worrisome about general health consequences. Social well-being focuses on patient interferences with activities of daily-living and psychological well-being evaluates different aspects related to woman emotional status. Sexuality domain assesses arousal, sexual desire and satisfaction with sexual relations.

Fig. 1
figure 1

Conceptual modelling of the Human Papillomavirus-Quality of Life (HPV-QoL) questionnaire

Subjects

Since the new instrument aims to measure the impact of HPV infection on women HRQoL, its development and validation were based on the selection of adult women previously diagnosed with HPV infection. For patient recruitment, the researchers conducted a systematic sampling in two hospital centres: Hospital Clínico San Carlos (Madrid) and Hospital Quirón (Málaga). Patients were selected if they met the following inclusion criteria: outpatient women over 18 years of age, with HPV infection diagnosis irrespective of previous vaccination against HPV, ability to understand and answer health questionnaires provided in Spanish language (Spain), and willing to sign the informed consent form. Exclusion criteria were pregnancy at the time of the assessment, recent diagnosis of any malignancies and current diagnosis of psychiatric disorders. A multicentre cross-sectional study was conducted under the conditions of usual clinical practice regarding disease treatment, and all patients were requested to grant informed consent.

Three different samples were used: (1) pilot sample: composed of 30 randomly recruited patients to check feasibility, pertinence and understandable items; (2) reduced sample: the first version of questionnaire was administered to a limited sample of 102 women (mean [SD] age: 38.0 [8.6] years, range: 21 to 65 years old) with HPV infection to confirm feasibility and to check eventual items redundancy, wording of items, rating system (5-categories Likert scale) and to explore the scale dimensionality [27]; (3) validation sample: this sample was formed by 252 women with HPV infection enrolled systematically from the Low Genital Tract Unit and fulfilling inclusion–exclusion criteria mentioned above, and was used to check psychometric properties of the HPV-QoL questionnaire. The size of the validation sample was determined based on the criterion of Rummel [28], whereby the ratio subjects/variables should be 4/1 to 7/1. Considering the number of items of the first version of the questionnaire, a minimum of 150 patients was considered an adequate sample. The size of the validation sample was over-dimensioned to allow statistical comparisons between meaningful groups related to the validity study (known group validity) and to explore initial composition of the norms of the scale. Thus, a minimum of 250 women were selected. Table 1 reports the main demographics data of the validation sample.

Table 1 Demographic and clinical data of women (n = 252) included in the psychometric validation study of the HPV-QoL questionnaire

Reduction of the questionnaire

The initial 20 items of the questionnaire were administered to the pilot sample. Patient comments and information about comprehension and reading problems with the proposed items were collected. Contributions of the pilot sample were incorporated to the questionnaire and it was in turn administered to the reduction sample. Data obtained from this sample were then used for the following: (1) to check adjustment of the patient responses to the structure (dimensions or subscales) proposed by the group of experts; (2) to assess the metric properties of the items; and (3) to reword four items considered confusing or need further clarification by interviewers. Reduction of the questionnaire and determination of the underlying dimensions were carried out using a sequence of exploratory factor analyses and were based on the analysis of internal consistency. This factor analysis made use of principal components extraction method. Principal axes method was also explored. Rotation methods included varimax, direct oblimin, promax, quartimax and equamax [24]. The optimum number of factors was determined by application of the Kaiser K1 rule, the percentage of variance accounted for, and the magnitude of the eigenvalues after rotation [29,30,31,32]. Internal consistency was studied with the Cronbach’s alpha reliability coefficient and the change in alpha coefficient after deleting each item from the scale [29,30,31,32].

In the reduction of the length of the questionnaire and analysis of dimensionality, we adopted the proposals of Gorusch and Russell [33,34,35]. Firstly, we discarded those items with a clear floor or ceiling effect (i.e., items with more than 50% of answers concentrated in the first or last answer category). Secondly, an exploratory factor analysis was made with the items of the scale to determine the number of underlying factors or dimensions (subdomains). Lastly, dimensionality (factor analysis) and internal consistency (Cronbach’s alpha coefficient) of each subscale were analysed, assuming them as unidimensional. In this latter step, those items with lower loading in the first dimension or cross-loading in more than one dimension were discarded. Items with lowest contribution to the scale overall alpha coefficient were also dismissed. After each deletion, the same analyses were repeated until the unidimensional structure in each subscale was found to be stable, showing no improvement in the alpha coefficient. Finally, an exploratory factor analysis was carried out with all the optimized subscales to check that the structure remained stable. Throughout this process, the initial questionnaire was reduced first to a version of 11 items and, subsequently, during the reduction phase and factor analysis exploration, four items were split, increasing the total number up to 15. This final version was tested in the validation phase.

Psychometric properties of the final version

The final version of the questionnaire was included in a case report form (see Spanish and English versions of questionnaire in supplementary material), together with information about the clinical impact of the infection in daily life, sociodemographic information, general questions regarding HPV infection and its vaccination, sexual status and related therapies. Spanish versions of the GHQ-12 questionnaire from Goldberg et al. [36] to explore mental functioning, psychological well-being and coping, along with the Female Sexual Function Index were also included [37, 38]. The data obtained from this sample were then used for the following: (1) to assess the metric properties of the questionnaire; and (2) to elaborate exploratory norms for the Spanish female population with HPV infection.

The following metric properties were studied: (1) feasibility: administration time, floor and ceiling effects, percentage of missing values in each item; (2) reliability: internal consistency, evaluated by means of Cronbach’s alpha coefficient and the Pearson correlation coefficient among items and between each item and the total composite score. A test–retest was also performed to check the temporal stability. This was tested by asking 30 women to complete HPV-QoL questionnaire for a second time at a mean of 1.6 days (range 1–3 days) after first administration and evaluated using an intraclass correlation coefficient (ICC) with a mixed effect two factor model and total agreement [39,40,41]; (3) content validity: ensured by the panel of experts and (4) construct validity: was tested by means of different approaches, convergent and divergent analysis was carried out by estimating the Pearson moment r coefficient between each domain, between domain and items and between each item to analyse correlations between items and domains of the questionnaire. Extrinsic construct validity was explored by means of estimating the concurrent validity of the HPV-QoL domains and total scores with those of the GHQ-12 and the Female Sexual Function Index. This estimation was performed by calculating Pearson r coefficients. Known groups validity was performed to analyse the power of the HPV-QoL total and domain scores to discriminate between groups of women with different responses in clinical and demographic criteria that were hypothesized a priori to be different. These criteria included the split of the sample in two subgroups according to responses in GHQ-12 (mental deterioration; GHQ-12 total score < or ≥ 17 points) and sexual dysfunction (FSFI scale > or ≤ 26.55 points) [38, 42]. Differences were analysed with a general linear model including covariates (age, body mass index, menopause, antidepressant therapy, sexual activity, and STIs). Bonferroni adjustment was performed in case of multiple comparisons and estimation of the 95% confidence interval of differences by non-parametric 1000 bootstrap iterations. Magnitude of differences were calculated by determining the statistic effect size [43, 44]. All analyses were made using the IBM SPSS version 20.0, NY, USA, statistical package (https://www.ibm.com/analytics/spss-statistics-software).

Scoring

Summing up the direct scores of the items yields a total composite score ranged from 15 to 75. This total composite score can be transformed to a more intuitive and easier to understand metric with a minimum of 0 (worst quality-of-life) and a maximum of 100 (best quality-of-life), using the following expression: Y′ = [YobsYmin] / [YmaxYmin] × 100  = Yobs; in which Ymax = 75 (maximum total score); Ymin = 15 (minimum total score); Yobs = total score obtained by the patient; and Y′ = transformed score. A similar expression can be used to change the metric of each individual dimension. To facilitate interpretation, a decile distribution was implemented for the total score questionnaire and dimensions as a proxy of scale norms.

Reporting guidelines

The Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) statement guidelines for reporting observational studies were followed www.strobe-statement.org.

Results

The HPV-QoL questionnaire was administered to 252 women, 21 to 65 years of age, in which 18 (7.14%) left 26 of 3,780 items blank (0.69%). The mean completion time was 4.99 (DE = 3.92) minutes (range: 1 to 15 min). Completion time was not statistically associated with either educational level (F = 0.97, p = 0.382) or age (r = − 0.032, p = 0.638). The proportion of unanswered items was not statistically significant regarding the level of studies (linear Chi2 = 2.354, p = 0.125) or age (t = 0.325, p = 0.745). No correlation between age and total scores or scale dimensions was found as rho coefficients ranged from − 0.051 to 0.089 (p = 0.159 or greater). Table 1 shows the main demographic and clinical data.

Factorial structure and components

Taking the Kaiser criterion of eigenvalues equal to or greater than 1 and the visual examination of the Cattel sedimentation graph as a reference, the results obtained using factor analysis revealed the presence of a multifactorial solution with four dimensions (Table 2), which explains 69.37% of the total variance (determinant of the matrix = 0.000; KMO = 0.88; Bartlett’s sphericity test; Chi2 = 2040.813, p < 0.0001). The non-additive test showed an F-statistic of 9.827 (p = 0.002) ruling out the additive hypothesis of the questionnaire. In all items, high factorial weights were observed in their corresponding dimensions, ranging from 0.638 of item 12 to 0.903 of items 14 and 15.

Table 2 Principal component analysis supporting a factorial structure with four domains confirming the conceptual model of the questionnaire HPV-QoL

The first factor (1) was “General well-being”. It included items 1 to 6 and explained 41.65% of the total variance and showed two sub domains that explained 79.74% of the variance of this factor and that were called “Psychological well-being” (items 1 to 4) and “Social well-being” (items 5 and 6). The second factor, which explains 12.97% of the variance, has been labelled as “Health” and contains items 9 to 12. The third factor explains 8.36% and has been labelled as “Sexuality” (items 13 to 15) and the fourth factor explains 6.40% and was labelled as “Contagiousness” (items 7 and 8).

Reliability and variability

Table 3 shows reliability and variability of the HPV-QoL questionnaire together with the mean (SD) scores in each domain and the total questionnaire. Missing items were negligible (less than 2%) and floor and ceiling effects were acceptable except for a ceiling effect of 100% in social well-being subdomain. Internal consistency coefficient was above 0.7 in total score and in every domain apart from contagiousness, with a lower Cronbach´α coefficient (0.285). Temporal stability was supported by a test–retest ICC punctuation above 0.7 in total score (0.817) and in every domain, except contagiousness, with a moderate score of 0.533.

Table 3 Reliability and variability of the HPV-QoL questionnaire in the validation sample (n = 252) of adult Spanish women

Construct validity

Intrinsic convergent and divergent validity is shown in Table 4 and Fig. 2. Table 4 includes correlation coefficients of convergent and divergent validity for item to item and item to domain. Figure 2 includes convergent and divergent validity for total score to domains, domain to domain and domain to items. Most of the correlation coefficients showed higher values, with items in its own domain or total domain, than with items in other domains of the scale. Likewise, items correlations with total score were lower than with items in its own domain. Extrinsic construct validity was supported by means of exploring concurrent validity of the HPV-QoL questionnaire (total score and domains) with FSFI and GHQ-12 questionnaires in total score and domain scores. Table 5 includes correlation coefficients of HPV-QoL with FSFI and GHQ-12, showing highest values of moderate intensity in domains that were related at first: Sexuality in HPV-QoL showed good correlation with all domains and total score of the FSFI. Well-being (general, psychological, and social) had also strong correlation with all domains and total score of GHQ-12. Additionally, total scores of HPV-QoL and GHQ-12 showed correlation coefficients with moderate intensity.

Table 4 Construct validity of the HPV-QoL; convergent and divergent validity item to item and item to dimension
Fig. 2
figure 2

Construct validity of the HPV-QoL questionnaire: convergent and divergent validity total to domain, domain to domain and domain to item

Table 5 Construct validity: concurrent validity of HPV-QoL questionnaire with FSFI and GHQ-12 questionnaires

Finally, construct validity was supported by discriminant or known groups validity. Figure 3 shows that domain scores and total score of HPV-QoL were significantly different according to the presence of sexual dysfunction. Significantly lower punctuation in total score, sexuality and general well-being in women with sexual dysfunction than those without sexual dysfunction was found. Also, HPV-QoL total and all domain scores were significantly lower in women with mental deterioration according with GHQ-12 than those without mental deterioration (Fig. 3). Magnitude of differences as assessed by effect size calculation were moderate to high (> 0.40) in all domains when GHQ-12 was applied and in sexuality and total score when FSFI was used (Table 6). Table 7 shows preliminary normative values for HPV-QoL questionnaire.

Fig. 3
figure 3

Known groups validity of HPV-QoL questionnaire in women according to presence of sexual dysfunction (score in questionnaire FSFI ≤ 26.55 points, graph a) or mental deterioration by GHQ-12 questionnaire (≥ 17 points, graph b). FSFI = Female Sexual Function Index de R. Rosen; GHQ-12 = General Health Questionnaire de Golberg de 12 items

Table 6 Known groups validity effect sizes for active HPV infection, HPV vaccination, sexual activity, menopause status, antidepressant therapy and sexually transmitted infection
Table 7 Interpretability of the HPV-QoL questionnaire: decil values by domain and total score in the validated sample of 252 women 21–65 years old

Discussion

The main goal of the female Lower Tract Disease Units is to improve the accuracy in the diagnosis of the HPV-related diseases. This aim has been traditionally focused on the evolution of the terminology, teaching and colposcopy technologies [45,46,47]. Through the last decades, the patient feelings have become more relevant when assessing their health status. In women with HPV infection, quality of life evaluation may constitute an essential point in decision-making. This study delves into the development of the metric properties of a new specific instrument to measure HRQoL in HPV infected women. According to the results, this clinical tool, called HPV-QoL questionnaire, in its paper and pencil version, is valid, reliable and feasible to use in the daily clinical practice. It can be implemented as a unidimensional instrument, if used as a total composite score.

The results reveal that the HPV-QoL questionnaire provides very good metric properties. Regarding its feasibility, the response rate is highly satisfactory, as almost all patients answered every question with negligible missing items, and irrespective of age and educational level. Additionally, time to administer the instrument was very brief: less than 5 min on average—thus making it feasible to use at any level of healthcare and particularly among outpatients, whose time for the visit is usually short. The reliability of the questionnaire, internal consistency, and temporal stability showed values above the acceptable standards [48, 49], as related to the total composite score and in terms of the individual domains, except for dimension contagiousness, in which internal consistency coefficient was low. It should be noticed that social well-being subdomain was significantly associated with ceiling effect. The different aspects analysed in relation to the validity of the questionnaire have also led to satisfactory results. The study of the responses based on exploratory factor analysis, and the relationship between items, items to domains and total score, supported the initially proposed theoretical structure. Specifically, the present study corroborated the presence of four main domains, with two subdomains: general well-being (this with subdomains psychological well-being and social well-being), health, contagiousness, and sexuality. The combination of all domains provides a composite meaningful total score that represents the impact of the HPV infection on women HRQoL. The instrument was able to correlate with domains of GHQ-12, including stress, self-esteem, coping and total mental score. In addition, total and sexuality domain scores of the HPV-QoL questionnaire were significantly correlated (with moderate correlation coefficient) with all domains of FSFI questionnaire as hypothesized initially. Finally, this study stresses the ability of the questionnaire, based on the total score and domains, to discriminate among groups that were assumed to be different because of the presence of either sexual dysfunction or mental deterioration. This last statement is another important component of construct validity testing. Total and sexuality domain scores were significantly different in presence of sexual dysfunction, showing moderate to high magnitude effect sizes. Furthermore, total and specific domain scores were able to find significant differences among women with mental deterioration assessed with GHQ-12, with moderate to high effect sizes. Likewise, given the limited number of participants used to test instrument properties, the present study was able to identify preliminary norms of the tool that should be confirmed with larger samples.

The HPV-QoL questionnaire offers several advantages over another existing related instrument [23,24,25,26]. Although some HPV-specific questionnaires have been described, few have been formally developed or validated. According to the previous literature, few questionnaires can evaluate the psychosocial impact of abnormal Papanicolaou tests or cervical disease [26, 50,51,52,53,54]. Moreover, this is the first study that displays a scale that focus on the impact of HPV-infection in women concerning their perception and feelings. The CECA questionnaire is an instrument only validated for women with genital warts [24, 25], being able to contrast the quality of life in patients with these genital lesions compared with the general population [25, 55]. The HPV Impact Profile (HIP) was developed as a self-administered tool to assess the psychosocial impact of an abnormal Papanicolaou test, CIN of any grade severity, and genital warts. Nonetheless, it does not analyse the total spectra of HPV infection impact [23].

The present study has as a limitation its cross-sectional design that restricts the possibility to examine longitudinal properties of the instrument such as sensitivity to change, responsiveness or predictive validity. Further studies will be needed regarding the ability of the instrument to detect changes in women health status, patient response to treatment or has a role in anticipating prognosis of HPV infection.

Conclusion

The HPV-QoL questionnaire is a novel patient-reported outcomes specific measure to assess health-related quality-of-life in women with HPV infection. The findings of the study suggest that the instrument has good acceptability as well as satisfactory psychometric properties, including validity and reliability of the composite total score. The findings support the use of the HPV-QoL questionnaire as a HRQoL measurement tool in daily medical practice and clinical research.