Study design
We used the SHS, a repeated, cross-sectional, nationally representative survey of residents 15 years and older living in Switzerland. The SHS surveyed participants in three languages (German, French and Italian), depending on their region of residency. The SHS was administered in 1992, 1997, 2002, 2007 and 2012 and the participation rates (the number of participants having participated divided by the number of people invited to participate in the survey) were 70.8%, 68.8%, 63.9%, 66.3% and 53.1%, respectively. For the analyses, we excluded respondents aged less than 18 (N = 2′780), and used data from the remaining 58,023 respondents who participated in waves 2002, 2007 and 2012. The SHS data are anonymous and available upon payment of fees. Ethical approval has been obtained by the Swiss Federal Statistical Office.
Self-rated health forms
The three forms of the self-rated health item are detailed in appendix (Table S2).
Coding schemes of response options
We used four coding schemes of response options (Table S3). First, we coded response options as a binary variable with an emphasis on positive options: “very good, good” versus “middle/moderate/ relatively good, poor, very poor”. This coding scheme was named “Dichotomised with positive focus”. Second, we coded response options as a binary variable with an emphasis on negative options: “very good, good, moderate/relatively good” versus “poor, very poor”. This coding scheme was named “Dichotomised with negative focus”. Third, we treated response options as linear. Fourth, we “linearised” response options with an alternative coding scheme, by recoding response options with ratings values: 1, 2, 3.7, 4.5 and 5, corresponding to an evenly spaced distance on a visual analogue scale [18]. Such transformation improves the interpretation of the mean values of SRH. This recoding has been developed for response options “excellent, very good, good, fair, poor” (three positive and two negative). This linearised coding scheme was applied to form 3, which was the only form to have adopted the “US form” for its responses, but not to forms 1 and 2 which response options were too different (two positive, one neutral, two negative). These four schemes are the most frequently used treatment of response options in health research [19,20,21,22].
Health status variables
The SRH item captures a range of health dimensions [10]. Several health status variables were grouped into four dimensions: physical health, mental health, functional health [11, 23, 24] and health behaviours [10]. The first three dimensions mirror the WHO definition of health, which is “a state of complete physical, mental and social well-being” [25]. Physical health variables included body mass index (BMI), back pains, headaches, cardiac irregularities, chest pain, diarrhoea or constipation, fever and stomach pain or bloating. They also included chronic disease variables such as treatment for allergies, bronchitis, cancer or a tumour, hypertension, kidney stones, mental breakdown, myocardial infarction, stroke and diabetes in the 12 months that preceded the survey. Mental health variables included feeling unable to overcome barriers, loss of control, feeling overwhelmed with problems, feeling tired or exhausted or without energy, and problems with sleeping. Functional health variables included needing assistance to walk, to read and to hear. Health behaviour variables included smoking (yes, no), frequency of alcohol consumption (never, once a day and less, twice a day, three times a day), physical activity during free time, eating fruits daily and eating vegetables daily.
Response options of physical, mental, functional health and chronic disease variables were re-coded as present (1) vs. absent (0). Respondents with missing information were imputed as 0 (absence). BMI was defined following the Quetelet definition (kg/m2). All these health status variables were used as predictors of SRH.
Covariates of self-rated health
We used the following known factors associated with SRH [19, 26,27,28,29,30]: age (continuous), marital status (single, married, widowed, divorced and separated), number of children younger than 15 years living in the household (0, 1, 2, 3 and more), nationality (Swiss, other), education (primary, secondary, tertiary), household monthly income (≤ CHF 2000, CHF 2001–4000, CHF 4001–6000, > CHF 6000), employment status, urban vs. rural area of residency, linguistic region (German, French, Italian), use of medicine in the last 7 days (yes, no) and having friends or relatives to discuss personal issues (yes, no). Employment status had three categories: out of the labour force (including student, unemployed, retired and others), employed full time, and employed part time. Household income was weighted with the number of persons living the household and the number of children less than 15 years old.
Statistical analyses
The three forms of SRH were used as dependent variables. Multivariable regression models were used to assess the contribution of health variables (thirty health status variables, representing thirty health predictors). Linear regression was used when the coding schemes were continuous (linear and linearised) and logistic regression when coding schemes were binary (dichotomous with positive focus and with negative focus). All models were adjusted with covariates of SRH. Age was included as a continuous variable. In sensitivity analyse, age was used as category for stratification purpose (see section Sensitivity analyses). We computed the percentage of explained variances using the adjusted R squared for the linear coding scheme, the MacKelvey and Zavoina pseudo R squared for the dichotomous coding schemes, and reported these percentage of explained variances for the overall model (all health status variables) and by health dimensions (physical health, chronic diseases, mental health, functional health, health behaviours). Analyses were conducted overall and separately for women and men because gendered differences in the production of self-rated health assessments [31, 32].
The three SRH forms were administered at different periods (2002, 2007 and 2012); thus, differences across SRH forms may reflect “true” differences in health status of the general population. To limit the impact of these different periods, all models were adjusted with covariates known to be associated with SRH [19, 26,27,28,29,30]: age, marital status, number of children, nationality (Swiss or foreign), education, income, employment status, living in urban or rural area, linguistic regions, use of medicine in the last 7 days, and having friends or relatives to discuss personal issues.
When the events per variable (EPV) were smaller than 10, we did not estimate the model as they are known to produce incorrect estimates [33]. This occurred to the full model, including all covariates, and using the coding scheme “Dichotomised with negative focus” (out of 7300 patients, 320 had a value of very poor or poor and there were 45 predictors). However, we estimated the models using the coding scheme “Dichotomised with negative focus” for each health dimensions taken separately.
Sensitivity analyses
First, we ran the same analyses on all waves of the SHS surveys, i.e. including waves 1992 and 1997 in which the SRH form was similar to form 1 (2002). Thus, the sample size for SRH form 1 increased to 20,873 men and 25,809 women. Second, we replicated the models stratifying by age groups: 18–35, 36–59 and 60 + . Health status ratings are age dependant [9, 34], as elderly people have been shown to be more optimistic [35,36,37]. Third, we replicated the models stratifying by education because evidence suggests that reliability of SRH may be lower among disadvantaged people [38, 39] and the meaning of rating may vary by education [34].