The study population consisted of the ongoing Kuopio Osteoporosis Risk Factor and Prevention (OSTPRE) study cohort. This population-based long-term follow-up study includes all the 14,220 Caucasian women born between 1932 and 1941 who lived in the Kuopio Province, Eastern Finland in April 1989. A postal questionnaire was mailed to 14,120 of these women at baseline in 1989 with 13,100 (92.8%) responding. The follow-up questionnaires were mailed at 5-year intervals (1994, 1999, 2004, 2009, and 2014) to women who responded to the baseline enquiry and were alive at the time. The response rate varied between 93% and 80% throughout the study. The OSTPRE study was approved by the Kuopio University Hospital Ethics Committee on 28 October 1986 and is performed in accordance with the ethical standards of the Declaration of Helsinki. Informed consent has been provided before the onset of each data collection.
Follow-up and baseline age
The final study sample (n = 12,715) consisted of women with available height and weight information. Due to the 10-year age range, women’s initial follow-up age in 1989 varied from 47 to 57 years. Therefore, start of the follow-up was age matched according to women’s 58th birthday by using either the baseline, 5th, or 10th year questionnaire information, whichever closest. Correspondingly, the start of register follow-up period was also matched at the 58-year birthday for both outcomes of interest (hip fracture and death). The individual baseline day for the analysis was set from the oldest to youngest women between 1 January 1990 and 31 December 1999, respectively. Follow-up was terminated to end of the registry data (31 December 2014) or to the event of interest (hip fracture and death) whichever occurred first. During the follow-up analysis, women reached a mean (median) age of 77.9 (77.8) years over a follow-up time of 18.3 (18.7) years. Hip fracture risk was investigated in two separate age periods as early (58–70 years) and late (> 70 years) due to non-proportional hazard between the groups during the first half of the follow-up.
Bone mineral density measurements
Baseline responders were asked about their willingness to participate in bone densitometry (DXA) and clinical measurement protocols. Altogether, 11,055 women stated their willingness, which formed a pool for a stratified random sample of 3686 women invited to the measurements. Out of these, 3222 women underwent the baseline DXA scan. This sample consisted of a random population sample (n = 2025) and 100% samples (n = 1197) of women with higher risk profiles, including menopause within 2 years, diseases, or medication affecting bone, such as rheumatoid arthritis [17]. The DXA subsample used in this study (n = 3163) consisted of women with valid femoral neck bone densitometry at baseline 1989, at 5th year or at 10th year follow-up, where closest matching to 58 years of age was used as a baseline value. Subjects with both baseline and 25-year BMD follow-up data available (n = 792) formed the subsample for proportional bone loss rate (%) comparison between groups. Body composition follow-up data was not recorded at the time being. Femoral neck T-score value was determined according to the Third National Health and Nutrition Examination Survey (NHANES III) with reference to white women aged 20–29 years [18].
The BMD measurements were carried out using a Lunar DPX scanner (Lunar, Madison, WI, USA) at both baseline (1989) and 5th year follow-up (1994). The original scanner was replaced with a DPX-IQ during the 10th year measurement in 1999. The manufacturer’s quality protocol was followed daily. To reveal the agreement between the scanners, 90 women were scanned with both instruments on the same day. The use of patient measurements rather than phantom measurements was emphasized. The long-term reproducibility of the DXA scanner determined by phantom measurements was 0.4% [1]. The detailed description of OSTPRE clinical measurement follow-up protocol and DXA cross-calibration has been published previously [6].
Body mass index
Body mass index was calculated from self-reported height and weight. The self-reported weight and measured weight have previously shown a good level of agreement in postmenopausal women and are considered reasonably reliable for BMI calculations [19]. In this cohort, the baseline body weight of the DXA subsample (n = 3163) showed a high correlation (Pearson r = 0.97) between the self-reported (67.8 kg (SD 11.7)) and measured baseline values (68.8 kg (SD 11.9) which persisted throughout the follow-up.
The BMI was categorized in two ways: first according to the WHO definition (kg/m2) as normal weight (18.5–24.9), overweight (25.0–29.9), and obese (30.0 or higher) and second, BMI was modeled in deciles to represent the overall 25-year hip fracture incidence. The hip fracture probability associated with a given BMI level was calculated by dividing the total number of hip fractures by the attributable follow-up time in person years for each BMI decile.
Available weight and height information closest to the age of 58 years were used as a baseline value to investigate BMI and early hip fracture incidence between 58 and 70 years. Correspondingly, self-reported weight at 70 years of age was utilized to update the BMI information for late hip fracture analysis. Both BMI values were determined by utilizing available follow-up information (baseline, 5th, 10th, 15th, 20th) whichever was closest to the age of interest. The correlation of baseline weight against the 5th and 10th year values was 0.92 and 0.87 with a mean weight increase of + 2.2 kg ((SD) 4.9) and + 3.3 kg (6.1) towards the latter, respectively. A small proportion of women with low BMI (15.0–18.49, n = 59) were excluded from the normal-weight category in the WHO classification but are included to the lowest decile of BMI presenting the overall incidence rate of hip fractures in the 25-year follow-up.
Registry outcomes: fractures, causes of fracture, and deaths
The cohort’s hip fracture information was obtained from the nationwide Hospital Discharge Register data (HILMO) as well as by postal enquiries sent to the participants (at 10, 15, 20, and 25 follow-up years). All self-reported hip fractures during the years 1989–2014 were validated by cross-checking patient records. The Finnish Hospital Discharge Register has previously been shown to reliably identify virtually all hip fractures [20]. The relevant International Classification of Diseases (ICD) codes were used to include femoral neck, pertrochanteric, and subtrochanteric fractures (ICD-10 S72.0–S72.2). Women with pathologic (e.g., metastasis) and periprosthetic hip fractures were excluded. We have previously shown that the number of hip fractures from the register was significantly higher than the self-reported one. Women who did not respond to the OSTPRE survey were shown to have significantly higher hip fracture incidence. A study based solely on self-reported hip fractures would have resulted in biased risk estimates. Our previous validation study showed self-reports lacking 38% of hip fractures in this long-term follow-up [21].
External causes of hip fracture were categorized according to the Finnish version of ICD-9 that was in use between 1987 and 1995 (E800–E999), and ICD-10 was used since 1996 (V01.0–Y96.9) into low- and high-energy categories. Fractures due to road traffic crashes resulting in motor vehicle, bicycle, or pedestrian injuries or due to a fall from a height of 1 m or over were regarded as high-energy trauma (ICD-9; E803E, E882A, and ICD-10; V01–V89, W10, W11, W17, W22). Fractures related to a fall on level, slip fall, or falling from a height of less than 1 m were considered low-energy trauma (ICD-9; E881A, E883A, E889A, and ICD-10; W00–W06, W19). Events with missing or unspecified information of trauma energy involved (such as stress fracture, intoxication, fire, complication) were categorized as unclear (ICD-9; E929X, E979A, and ICD-10; X50, X57, XX58, X59, Y83).
Causes of death were recorded by a certified physician according to the national adaptation of the ICD. The hip fracture risk analysis was stopped at the end of available information on time and cause of death from the National Causes of Death Register (31 December 2014), which have previously been shown to be accurate [22]. Survival analyses were carried out based on all-cause mortality.
Statistical analyses
The 58th birthday formed the baseline for the analysis. Depending on the outcome of interest, follow-up was terminated on the date of death, hip fracture, or at the end of the registry period in 31 December 2014. Overall, hip fracture risk for normal, overweight, and obese groups was estimated by using Kaplan-Meier estimation with a mean (median) follow-up time of 18.5 (19.0) years, 18.4 (18.8) years, and 17.7 (18.0) years, respectively. Since proportional hazard assumption of obese women was not met across the whole age, we split the follow-up data in two age strata: from 58 to 70 years as “early” and over 70 years as “late.” Cox proportional hazard regression model was used to estimate hazard ratios with their respective 95% CI in two follow-up sections. By using time-dependent covariate analysis in Cox regression, a non-proportional hazard was observed between the study groups hip fracture rate during the first follow-up section from 58 to 70 years of age (p < 0.001). Difference between baseline characteristics, distribution of fracture type, and trauma energy were investigated using t test and chi-squared methods. Overweight group was considered the reference for the primary analysis due to lowest incidence rate in hip fracture outcomes, while normal weight group was the reference group in the mortality analyses.