Response probabilities and response-mode preferences in a self-administered survey

Sjetne, Ingeborg Strømseng; Iversen, Hilde Hestad; Holmboe, Olaf; Helgeland, Jon

doi:10.1186/s13104-019-4328-7

Response probabilities and response-mode preferences in a self-administered survey

Research note
Open access
Published: 27 May 2019

Volume 12, article number 289, (2019)
Cite this article

Download PDF

You have full access to this open access article

BMC Research Notes Aims and scope Submit manuscript

Response probabilities and response-mode preferences in a self-administered survey

Download PDF

Ingeborg Strømseng Sjetne¹,
Hilde Hestad Iversen¹,
Olaf Holmboe¹ &
…
Jon Helgeland¹

1815 Accesses
7 Citations
Explore all metrics

Abstract

Objective

Response rates in surveys continue to fall, and electronic online versions are increasingly replacing paper questionnaires in order to save costs and time. This can influence the composition of the respondent group in surveys. Using data from a national survey of patient experiences with maternity care, we aimed to (1) classify all of the women invited to participate in the study according to their different probabilities of responding, based on registry data, and (2) classify all of the respondents according to different probabilities of choosing a paper questionnaire when an online alternative was available, based on registry and self-reported data.

Results

We found that the likelihood of responding to surveys is strongly influenced by background variables, with the age, number of previous births and geographic origin predicting the response probability (range 0.25–0.73). Education level predicted the likelihood of choosing a paper questionnaire. Women with less education would more likely (probability 0.50) than women with more education (probability 0.38) choose a paper questionnaire rather than answering online.

Introduction

A high response rate is a common goal when conducting surveys, but this has generally been declining for decades, and there is little hope that it will change for the better [1]. Technological developments make electronic online versions increasingly available, and replacing the use of paper questionnaires with digital solutions will save costs and time.

One approach to understanding the effect of non-responses is to investigate how the backgrounds of subjects influence their propensity to respond. The relevant variables may be available from registries or from a survey itself.

After issuing a white paper in 2009 about pregnancy, birth and postnatal care, the Norwegian Ministry of Health and Care Services commissioned a national survey among the users of the relevant health-care services. All phases of the care were to be included, with special attention paid to immigrant women. The Norwegian Institute of Public Health is responsible for conducting surveys to collect patient-reported experience measures among health-care users.

The aims of this paper are to present the following observations that were a side product in the national survey of patient-reported experiences with maternity care in Norway in 2011:

1.
To classify all women invited to participate in the study according to their different probabilities of responding, based on individual data collected from registries.
2.
To classify all of the respondents according to their different probabilities of using a paper questionnaire when an online version is available as an alternative, based on individual data collected from registries and additional respondent-reported data.

Main text

Methods

A national survey

A questionnaire and data collection routines were developed for this specific population. The final questionnaire consisted of 145 items in total (comprising 16 pages in the printed version) collecting the women’s description of their experiences and sociodemographic information [2].

We included women who gave birth in a birthing institution or hospital department during the last quarter of 2011 and were aged 16 years or older. Based on our experiences when performing previous patient surveys, the sample size was set to 400 potential respondents in each institution. All women at hospitals with less than 400 births during the inclusion period were included, while a random sample of women was drawn from hospitals with more than 400 births. The Medical Birth Registry, which also provided clinical information about the women, performed the sampling. Statistics Norway provided data about the countries where the women were born, and this information was coded in four categories: (1) Norway; (2) Asia, Turkey, Africa, South America; (3) Eastern Europe; or (4) Western Europe, North America, Oceania.

Before the national study, the postal and electronic alternative data collection modes were studied in a randomized comparison of effectiveness and costs [3]. Based on the findings in this study, all the included women were contacted by mail in the national survey about 17 weeks after the birth. The initial invitation offered an electronic response option only, and a printed questionnaire was enclosed in both of two reminders that were subsequently sent to non-respondents.

Statistical procedures

The Response Homogeneity Group (RHG) model was used to reduce bias from nonresponse [4] and to model response preference. In this model, the initial sample is partitioned into groups based on data in the sampling frame or registry. The response probability is assumed constant within each group, and is estimated from the observed response rates.

In addition to being an important step in weighting procedures, the models produce observations about the composition of the survey sample that are valuable per se.

To identify predictors for responding, we initially tested 15 variables that we hypothesized to be associated with responding, and that were available in our data set. The candidate variables were tested in bivariate logistic regression models with response as the outcome variable. The woman’s age, number of previous births, geographic origin (four categories), Caesarean section and episiotomy were significantly associated with response to the survey (p < 0.001). These variables were all entered into a multivariate regression model for response probability, addressing the first aim of the study. We used the recursive partitioning method with bootstrapping to construct a regression tree [5,6,7], using the rpart package in R, version 3.0.3 [8].

In order to classify the participating women according to their probability of responding via a printed questionnaire when there was the alternative of answering online (to address the second study aim), we selected potential predictors in the same way as described above and supplemented with self-reported data from the respondents. The variables included were the women’s age, number of previous births, region of birth, Caesarean section, instrument use, episiotomy, size of the municipality of residence, self-reported employment status, self-rated health and education level.

The register data were complete, and the item missing rates were all below 2.4% in the self-reported variables.

For both models, we set the minimum size of groups to 100 women per RHG, to avoid generating RHGs with very few women.

Results

Of the 8670 sampled women, 4904 (56.6%) responded. Table 1 lists the characteristics of the groups of women with the same probability to respond, RHGs. The response probability in the eight RHGs varied from 0.25 to 0.73 (Table 1).

Table 1 Response homogeneity groups

Full size table

Table 2 shows the results of applying the same modelling procedure to predicting whether the respondents chose to respond on a paper questionnaire. Only educational level was eventually retained in the model.

Table 2 Response homogeneity groups for the use of a paper questionnaire among survey respondents

Full size table

Discussion

In this side product to a national survey, we have confirmed that the likelihood of responding to surveys is strongly influenced by background characteristics. The response probability varied considerably (from 0.25 to 0.73) among groups in our sample. The age, number of previous births and geographic origin predicted the response probability, and education level alone predicted the probability of respondents opting to use a paper questionnaire as a response mode.

To our knowledge, there are no previous publications about using this specific approach to explore survey participation in different sample subgroups. That survey participation in general may vary between groups is a known phenomenon in surveys using self-administered data collection. In a similar national survey of experiences with maternity care in the United Kingdom in 2010, the respondents were more likely to be older, to be married, to be living in the least deprived areas and to be born in the United Kingdom, compared to non-respondents [9]. Our analysis also showed that response probability was larger for older women and women from Norway or other western countries. In Norway, the immigrant population has increased markedly in Norway over the past 20 years, from approximately 5% in 1999 to 17% in 2018 [10]. Most likely, this has consequences for the response rates in many populations.

A review of studies comparing response rates between different data collection methods, found that response rates in web surveys are lower than in alternative response modes, but that web surveys are the most efficient with regard to time and costs [11]. Internet use in Norway in 2012 was ubiquitous among women of child-bearing age, with 100% of that population having used the Internet within the previous 3 months, and digital skills among the general Norwegian population are among the best in Europe [12, 13]. We therefore assume that online responding is an easily accessible option in this relatively young population. In a study comparing postal versus mixed mode (internet and paper questionnaire in combination) the authors concluded that a mixed mode solution should be a method to consider, in particular if the target population is young and well educated [14]. In showing that education level predicted response mode preference, our study draws attention to possible consequences of ceasing to offer a postal response mode in populations that also include older persons. According to Norwegian statistics, 49% of the general population between 30 and 34 years was educated at college or university level in 2017, compared to 22% among persons older than 66 years [15, 16]. Thus, there is a risk that older persons will be underrepresented in surveys that offer online responding only.

The availability of a large high-quality data set provided the main motivation for reporting on this side product. We believe that our findings show that exploring the consequences of population diversity is highly relevant, and that the findings represent helpful input in informing considerations before deciding on future data collection procedures.

Limitations

The present data were collected in 2012, which could be regarded as a limitation given the rapid ongoing developments in this field. We believe that even if response rates continue to decrease, it can be assumed that patterns like those we found are still present, and hence worthy of attention.

Future studies should include a larger set of background data about the complete population, such as the education level of non-respondents.

Availability of data and materials

The data from the current study are available to named researchers at the Norwegian Institute of Public Health.

References

Galea S, Tracy M. Participation rates in epidemiologic studies. Ann Epidemiol. 2007;17(9):643–53.
Article Google Scholar
Sjetne IS, Iversen HH, Kjøllesdal JG. A questionnaire to measure women’s experiences with pregnancy, birth and postnatal care: instrument development and assessment following a national survey in Norway. BMC Pregnancy Childbirth. 2015;15:182.
Article Google Scholar
Bjertnaes OA, Iversen HH. User-experience surveys with maternity services: a randomized comparison of two data collection models. Int J Qual Health Care. 2012;24:433–8.
Article Google Scholar
Särndal C-E, Swensson B, Wretman J. Chapter 15: Nonresponse. Model assisted survey sampling. New York: Springer; 1992. p. 556–95.
Google Scholar
Clark LA, Pregibon D. Tree-based models. In: Chambers JM, Hastie TJ, editors. Statistical models in S. Boca Raton: Chapman & Hall/CRC; 1992. p. 377–419.
Google Scholar
Therneau TM, Atkinson EJ. An introduction to recursive partitioning using the RPART routines. 2019. https://cran.r-project.org/web/packages/rpart/vignettes/longintro.pdf. Accessed 25 Apr 2019.
Breiman L, Friedman JH, Olshen RA. Classification and regression trees. Belmont: Wadsworth; 1983.
Google Scholar
R Core Team. R: a language and environment for statistical computing. Vienna: R Foundation for Statistical Computing; 2017.
Google Scholar
Redshaw M, Heikkila K. Delivered with care: a national survey of women's experiences of maternity care Oxford: National Perinatal Epidemiology Unit, University of Oxford; 2010.
Statistisk sentralbyrå [Statistics Norway]. Innvandrere og norskfødte med innvandrerforeldre [Immigrants and Norwegian-born to immigrant parents] Oslo/Kongsvinger: Statistisk sentralbyrå [Statistics Norway]. https://www.ssb.no/en/befolkning/statistikker/innvbef/aar. Accessed 05 Mar 2018.
Blumenberg C, Barros AJD. Response rate differences between web and alternative data collection methods for public health research: a systematic review of the literature. Int J Public Health. 2018;63(6):765–73.
Article Google Scholar
Eurostat. Digital economy and society statistics—households and individuals. 2018.
Fjørtoft TO. Norge i Europatoppen på digitale ferdigheter [Norway among the best in Europe in digital skills]. Oslo/Kongsvinger: Statistisk sentralbyrå [Statistics Norway]; 2017.
Zuidgeest M, Hendriks M, Koopman L, Spreeuwenberg P, Rademakers J. A comparison of a postal survey and mixed-mode survey using a questionnaire on patients' experiences with breast care. J Med Internet Res. 2011;13(3):e68.
Article Google Scholar
Statistisk sentralbyrå [Statistics Norway]. Befolkningens utdanningsnivå [Educational attainment of the population]: Statistics Norway. https://www.ssb.no/utdanning/statistikker/utniv/aar. Accessed 09 June 2018.
Fjørtoft TO. Unge og høyt utdannede er flinkest foran PC-en [Young and well educated are the best in front of the PC]. Statistisk sentralbyrå [Statistics Norway]; 2017.

Download references

Acknowledgements

This study used data from the Medical Birth Registry of Norway. The interpretation and reporting of these data are the sole responsibility of the authors, and no endorsement by the Medical Birth Registry of Norway is intended nor should be inferred.

Advice given by colleague Liv Merete Reinar (RNM MSc) has been a great help during this study.

Funding

The authors are employees at the Norwegian Institute of Public Health. Conducting surveys and evaluating their design is among their regular work activities. The study had no external funding.

Author information

Authors and Affiliations

Health Services Research, Norwegian Institute of Public Health, Oslo, Norway
Ingeborg Strømseng Sjetne, Hilde Hestad Iversen, Olaf Holmboe & Jon Helgeland

Authors

Ingeborg Strømseng Sjetne
View author publications
You can also search for this author in PubMed Google Scholar
Hilde Hestad Iversen
View author publications
You can also search for this author in PubMed Google Scholar
Olaf Holmboe
View author publications
You can also search for this author in PubMed Google Scholar
Jon Helgeland
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

ISS, HHS, OH, and JH contributed substantially to the design of the study and the acquisition, analysis and interpretation of the data. ISS, HHS, OH, and JH participated in drafting and revising the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Ingeborg Strømseng Sjetne.

Ethics declarations

Ethics approval and consent to participate

The Regional Committee for Medical and Health Research Ethics approved the study. Informed consent was considered as obtained when the women actively responded to the survey after having received the written information about the study.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Cite this article

Sjetne, I.S., Iversen, H.H., Holmboe, O. et al. Response probabilities and response-mode preferences in a self-administered survey. BMC Res Notes 12, 289 (2019). https://doi.org/10.1186/s13104-019-4328-7

Download citation

Received: 04 March 2019
Accepted: 21 May 2019
Published: 27 May 2019
DOI: https://doi.org/10.1186/s13104-019-4328-7

Response probabilities and response-mode preferences in a self-administered survey

Abstract

Objective

Results

Introduction

Main text

Methods

A national survey

Statistical procedures

Results

Discussion

Limitations

Availability of data and materials

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Response probabilities and response-mode preferences in a self-administered survey

Abstract

Objective

Results

Introduction

Main text

Methods

A national survey

Statistical procedures

Results

Discussion

Limitations

Availability of data and materials

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation