Background

Consequences of excess gestational weight gain (GWG)

The documented adverse sequelae of excess GWG for both mother and child include undesirable pregnancy/birth outcomes and/or later health outcomes that are likely to impose a substantive downstream burden on healthcare costs [1]. The most common adverse effect of excessive GWG on maternal health is weight retention (five months and even up to three years postpartum) [2,3,4]. Mothers entering subsequent pregnancies at a higher weight are at greater risk for long-term obesity and future co-morbidities such as diabetes and cardiovascular disease [5, 6]. The emerging evidence of developmental origins of health and disease implicates the impact of mothers’ health during pregnancy on “programming” of the growing fetus in utero to develop adverse health outcomes [7, 8]. Maternal excess GWG has been reported as the strongest predictor of obesity in the offspring [9,10,11].

The use of low-fat dairy foods as the source of protein in pregnancy is proposed based on studies in animals [12] and humans [13,14,15] that provide support for the putative effects of components of dairy foods such as calcium and leucine as anti-adipogenic and pro-lipolytic agents. Consumption of high dairy protein may not only benefit GWG but also improve other maternal pregnancy outcomes such as dysglycemia, elevated blood pressure, blood lipid profiles, and risk of pre-eclampsia, the latter due to higher intake of calcium and vitamin D [16].

Guidelines for GWG exist and have been interpreted for practice in Canada [17, 18]. Until there is evidence of effective population level interventions to assist women in achieving healthy weights, programs aimed at interventions during pregnancy to improve weight management and pregnancy outcomes are a starting point. Pregnancy has been described as a “teachable moment” for weight control and obesity prevention in women [19]. The most convincing approach to achieve reduction in GWG is the combination of physical activity and diet counseling, preferably in combination with weight monitoring [1, 20, 21]. Yet current clinical practice guidelines for managing GWG are not founded on randomized trials and lack specific “active intervention ingredients” [21] that are proven effective in achieving target GWG.

Maternal programming of bone health

Findings from epidemiological and animal studies support a role for the programming of fetal, neonatal, and adult bone outcomes in response to exposures during pregnancy such as maternal nutrition and other lifestyle variables [22,23,24,25]. In three longitudinal observational studies, maternal intake of dairy foods during pregnancy was associated with higher bone mass in offspring aged 6–16 years [26,27,28]. In prospective cohort studies, maternal physical activity level has also been positively associated with infant bone mass and geometry at birth [29, 30]. Being born with optimal bone mass may have long-term benefits since childhood bone mass tracks until skeletal maturity, when peak bone mass is achieved [31]. Collectively, these findings suggest that bone mass during childhood and adolescence may be predictive of an individual’s risk for osteoporosis, as peak bone mass is a strong predictor of osteoporotic fracture later in life [32].

To date, only one randomized controlled trial (RCT) in a small sample (N = 36) investigated the effect of dairy product supplementation on bone mineral density (BMD) and bone turn-over in pregnant women and these women entered pregnancy with a habitually low calcium intake [33]. The study showed a beneficial effect of increased milk consumption on maternal bone mass density at the spine and on suppression of bone resorption when measured at 6 ± 1 weeks postpartum. While promising, further studies are needed with larger sample sizes and evaluation of bone health outcomes in the offspring. In addition, no study to date has investigated the impact of dairy product supplementation during pregnancy on bone health outcomes of women after delivery.

Informed by our recent clinical and qualitative research, as well as existing systematic reviews, we have designed a RCT comprising several unique features with the objective of controlling GWG and optimizing bone health of mother and infant. We will test the effectiveness of a novel structured and monitored nutrition (high-protein dairy-based diet) along with an exercise (walking) intervention that is science-based, vetted for feasibility in pregnant women and care providers through focus groups, in pregnant women of most pre-pregnancy weight categories (except extreme obesity) in a community setting (as opposed to hospital-based).

Specific objectives and hypothesis

The primary research objective of the Be Healthy in Pregnancy (BHIP) study is to determine whether introducing a structured and monitored nutrition (high-dairy protein diet) and exercise (walking) program (intervention) in early pregnancy, compared to standard prenatal care (control), will increase the number of women attaining GWG (outcome) within the Institute of Medicine (IOM) recommendations for their pre-pregnancy body mass index (BMI) sub-category [17].

Secondary objectives include determination of the impact of a maternal high-dairy diet with exercise compared to standard care diet during pregnancy on: (1) bone status (e.g. blood biomarkers procollagen type I that contains N-terminal extensions (PINP), C-terminal telopeptide of type I collagen (CTX-I), insulin-like growth factor-1 (IGF-1), 25-hydroxyvitamin D (25(OH)D), and 1,25-dihydroxyvitamin D) in mothers during pregnancy; (2) infant bone status at birth via cord blood; and (3) mother and infant bone status at six months postpartum by dual energy x-ray absorptiometry (DXA) scan.

The over-arching hypothesis is that maternal consumption of a high-dairy diet during pregnancy will have a positive impact on: (1) GWG; (2) bone health status of the mother during and after pregnancy; and (3) bone mass and bone size of the offspring, after adjustment for factors known to influence skeletal status.

Methods/design

Trial design

The BHIP study is a two-arm, two-site prospective RCT. This study is designed as a prospective superiority trial, with 1:1 allocation ratio to either the Intervention group (i.e. Nutrition + Exercise intervention: high-dairy nutrition intervention combined with structured exercise) or the Control group (i.e. usual care as per National Health Canada recommendations) during pregnancy. The study is open-label with blinded endpoints. The study protocol is described following the standard protocol items: recommendations for interventional trials (SPIRIT) guidelines [34]. See Additional file 1 for the SPIRIT checklist and Additional file 2 for the WHO trial registration data set.

The core BHIP trial follows women up to delivery as the primary outcome relates to GWG. The Bone-BHIP study is an extension of the core BHIP trial that continues a follow-up of the women and their offspring until six months after delivery, with the objective to assess maternal and infant bone status. This trial is registered at ClinicalTrial.gov (NCT01689961). The BHIP trial is conducted by collaborative research teams at McMaster University (Hamilton, ON) and Western University (London, ON) in Canada. The study takes place in academic hospitals and community healthcare clinics in London, Burlington, and Hamilton, Ontario, Canada. Recruitment is facilitated by healthcare professionals (e.g. family doctors and midwives) who ask their patients for consent to be contacted by the BHIP study team, as well as poster advertisements in midwifery, family practice, and ultrasound clinics, at community sites such as the YMCA, libraries, and coffee shops, and on Facebook or Kijiji.

Study recruitment began in January 2013 and is expected to be completed by April 2018, reaching the a priori calculated sample size of 242 participants for our primary outcome. All participants who sign consent to contact forms receive a phone call by study personnel and are provided a general overview of the study and expectations using a scripted text. Figure 1 outlines the participant’s timeline in the BHIP study. Participants are screened for eligibility during the telephone interview according to the criteria list below. Eligible women are enrolled in the study and sign informed consent by the end of the first trimester of pregnancy (i.e. 12–17 weeks of gestation).

Inclusion criteria

  • Healthy pregnant women aged > 18 years

  • Singleton pregnancy (either nulliparous or multiparous)

  • Able to be randomized to group allocation by 17 weeks and six days of gestation

  • Pre-pregnancy BMI < 40 kg/m2

  • Planning to deliver at a Hamilton, Burlington, or London regional hospital or by home birth and willing to attend research visits at either study site

  • Approval of primary care provider to participate in exercise

  • Able to provide signed informed consent

Exclusion criteria

  • Not conversant in English

  • Known contraindications to exercise as recommended by the Canadian clinical practice guidelines for pregnancy [35]

  • Severe chronic gastrointestinal, heart, kidney, liver, or pancreatic diseases or conditions

  • Refusal to consume dairy foods due to intolerance or dislike

  • Pre-existing diabetes

  • Currently smoking and will not discontinue smoking during the pregnancy

  • Depression score > 12 on the validated Edinburgh Depression scale

Randomization: allocation and implementation

Block randomization is used with block sizes of two, four, and six selected at random, using an online Research Electronic Data Capture (REDCap) randomization service managed by an independent team in the Biostatistics Unit at St Joseph’s Healthcare in Hamilton, ON, Canada. Randomization to the two study arms occurs in a 1:1 ratio and is stratified by study site and pre-pregnancy BMI category following IOM guidelines (underweight: BMI < 18.5, normal weight: BMI 18.5–24.9, overweight: BMI 25.0–29.9, and obese: BMI ≥ 30.0 kg/m2). The eligibility of participants is confirmed at the first visit, when baseline data are also collected. The research assistant randomizes the participant at the second visit to the study center using an online third-party automatic randomization system. Once randomized, the research assistant consents the participant to the appropriate study arm at a time that is 14–17 weeks/6 days of gestation. Assessments, regardless of the study arm, occur at < 18 weeks of gestation, 26–28 weeks of gestation, 36–38 weeks of gestation, birth, three months postpartum, and six months postpartum. The intervention arm consists of weekly or biweekly in-person visits (based on the participant’s preferences) during pregnancy (i.e. from allocation to study group until 36–38 weeks of gestation). Baseline measures (i.e. before randomization), primary and secondary outcome measures, and other measures assessed during and after pregnancy are measured in all participants, regardless of their study group allocation.

The flow diagram (Fig. 2) will be included in future published results.

Fig. 1
figure 1

Flow diagram

This study is open-label due to the nature of the intervention. The study research assistants (data collectors) administering the personalized Nutrition + Exercise intervention are not blinded to group allocation. In addition, the participants are not blinded to their own group allocation, in order to maximize adherence to the lifestyle treatment. Mothers randomized to the intervention and control groups are assessed at the study clinic on different days, to prevent interaction between participants of each study arm. The primary data outcome collector is blinded to group allocation and is conducting the study visits for all participants, regardless of their group allocation. To maximize the objectivity of findings, the primary outcome assessor and data analysts/statistician are blinded to the study allocation and are not collecting any study data.

Interventions

All participants receive usual care as delivered by their healthcare practitioner (family physician or midwife) during pregnancy. In addition, after enrolment in the RCT and regardless of their group allocation, all participants receive counseling by the study nutritionist on the latest recommendations by Health Canada as published online (https://www.canada.ca/en/health-canada/services/healthy-living/healthy-pregnancy.html). After delivery, both study groups receive usual care from their healthcare practitioners with no further intervention; the research team prospectively follows participants at three months and six months postpartum. Table 1 outlines the study arms.

Table 1 BHIP study arms

Participants randomized to the personalized, monitored, and structured Nutrition + Exercise program group (intervention) begin early in the second trimester of their pregnancy until the end of pregnancy with counseling on a weekly or biweekly basis (Table 1 and Fig. 1). The nutrition component is an individualized nutrition plan tailored to each participant’s energy requirements with a high protein content (25% protein energy) provided primarily by dairy foods. Dairy foods are accepted by women during pregnancy as a healthy choice as indicated from our pilot study [36] and in a recent birth cohort study in the same community in which women consumed an average of ≥ 3 servings of dairy foods per day [37]. Dairy foods are sources of high-quality proteins, calcium, and vitamin D in the case of milk as it is under mandatory fortification with vitamin D in Canada. The Nutrition + Exercise intervention is considered safe. First, the nutrient intake associated with the increased low-fat dairy consumption is less than the tolerable upper intake level for all nutrients as recommended per the Dietary Reference Intake by Health Canada. In addition, walking (the exercise component) is the easiest physical activity to undertake and implement in terms of goal-setting for step count and monitoring adherence using accelerometer-type devices as demonstrated in our previous studies [38,39,40]. Walking was the most practical exercise intervention of choice since women reduce moderate and vigorous physical activity during pregnancy yet maintain levels of walking [41]. Most importantly, these exercise guidelines are based on the Physical Activity Readiness Medical Examination (PARmed-X) for Pregnancy [35]. Data from the Nutrition and Exercise Lifestyle Intervention Program study [39] indicates that the goal of 10,000 daily steps is feasible under free-living conditions and has been effectively utilized in a number of trials [42].

Fig. 2
figure 2

BHIP study protocol timeline

A component of the intervention is discontinued if the medical condition of a participant changes and the intervention is modified if a participant requests a change due to personal reasons, such as food aversion or pain episode. The intervention is personalized for every woman, to ensure the goal of 25% energy from protein and 10,000 steps/day are achieved. Additionally, estimated energy requirements are calculated for each mother using the equation derived in the Dietary Reference Intake report for normal and overweight women [43] with energy intakes adjusted as recommended in the second and third trimesters [17]. The individualized nutrition plan is fashioned upon a standard three-day food intake record completed by the mothers that includes one weekend day as used previously in pregnant women [39]. The individualized nutrition plans and counseling are conducted by a study nutritionist providing the number of food servings to meet their estimated energy requirements. The nutrition plan is modified if GWG monitoring indicates weight loss or excess gain has occurred as recommended in the GWG guidelines [17, 44]. Adverse events related to the intervention are reported to our local ethics boards. Other non-intervention-related adverse events are noted in the participant’s chart and monitored, in accordance with the N2 Network of Networks guidelines Canada [45].

Several strategies are used to improve adherence to the intervention. For the exercise component, participants wear and track their step counts on a weekly basis, increasing motivation and adherence to the treatment. In addition, they go for a walk with the research staff and have a personalized counseling session with the study nutritionist. For the nutrition component, participants are provided with low-fat milk, cottage cheese, and/or yogurt, as per their preference, to ensure they consume the recommended amount of dairy foods. Participants come to the study site at least every other week to receive their dairy foods and the study nutritionist shares strategies and recipes with the participants to ensure they consume the targeted amount of dairy food servings. All these actions improved the adherence to the study protocol.

Participants randomized to the usual care group (control) receive usual prenatal care. Participants are given the most recent advice from Health Canada including Healthy Weight Gain During Pregnancy [17], the Pregnancy Weight Gain Calculator [44], and Eating Well with Canada’s Food Guide [46]. Mothers are followed by their primary care provider who also receives the same Health Canada materials. After delivery, they are followed by their healthcare practitioners. In addition, the control participants are invited to participate in a focus group and to attend an information session led by a midwife; topics discussed include pain relief options during labor and breastfeeding techniques (Table 1).

Outcome measurements

Primary and secondary outcomes are summarized in Table 2 and described below.

  • Primary outcome

    • ○ GWG

  • Secondary outcomes

    • ○ Maternal and cord blood circulating bone markers: PINP, CTX-I, IGF-1, 25(OH)D, 1,25dihydroxyvitamin D

    • ○ Maternal bone status at six months postpartum: whole-body bone mineral content (BMC), whole body BMD, lumbar spine BMD

    • ○ Infant bone status at six months of age: whole body minus the head BMC

  • Other outcomes

    • ○ Maternal body weight and fat mass

    • ○ Maternal metabolic status (including fasting glucose and lipid profiles, leptin and adiponectin)

    • ○ Maternal blood pressure

    • ○ Maternal pregnancy outcomes such as gestational diabetes and pre-eclampsia

    • ○ Infant birth outcomes such as birth weight and body fat mass at six months

    • ○ Safety outcomes

    • ○ Quality of life using the EQ-5D questionnaire

    • ○ Healthcare costs

    • ○ Maternal and cord blood gene variant profiles

Table 2 Analysis plan: objectives, outcomes, hypotheses, and methods of analysis

Data and biosample collection

The pre-pregnancy BMI is calculated using the measured weight at study entry minus the mother’s self-reported GWG. Body weight is measured by trained research assistants using a body impedance Tanita Body Composition Analyzer BF-350 scale (Tanita, IL, USA) at 12–17 weeks of gestation, 26–28 weeks of gestation, 36–38 weeks of gestation, three months postpartum (self-reported weight only), and six months postpartum. GWG is calculated by subtracting the weight at the 36–38 weeks of gestation study visit by the pre-pregnancy weight. Adjustments such as rate of GWG will be made to account for gestational age at delivery, including pre-term births.

Fasted maternal blood samples for metabolic and bone health biomarkers are collected at 12–17 weeks of gestation, 36–38 weeks of gestation, and at six months postpartum; venous cord blood is collected at delivery. Serum is collected in serum-separating with gel (SST™) vacutainers and in spray-coated silica vacutainers; plasma is collected in sodium fluoride/ Na2 ethylenediaminetetraacetic acid (Na2EDTA) vacutainers. Sample are centrifuged for 10 min at 3000 rpm at 4 °C; serum separator tubes are spun for an additional 5 min, aliquoted, and stored at − 80 °C. Samples are batched to include all samples from each mother for analysis in our laboratories when participants have completed the study. Fasting plasma glucose is determined using a hexokinase photometric assay (Architect kit, Abbott, Abbott Park, IL, USA). Serum triglycerides are analyzed using a glycerol phosphate oxidase photometric assay (Architect kit, Abbott, Abbott Park, IL, USA) completed by Hamilton Health Sciences Regional Laboratory Medicine Program. Serum leptin and insulin are analyzed using Luminex® human premixed multi-analyte enzyme-linked immunosorbent assay (ELISA) kit supplied by R&D Systems (Minneapolis, MN, USA). Serum adiponectin and C-reactive protein are analyzed by Luminex® premixed multi-analyte ELISA kit supplied by R&D Systems. Bone formation is assessed by measuring serum PINP by ELISA (Cloud Clone Corp., Houston, TX, USA). Bone resorption is assessed by measuring serum CTX-I by (Serum Crosslaps (CTX-1) ELISA, product code AC-02F1 Immunodiagnostic Systems, UK). Growth is assessed by measuring serum IGF-1 by ELISA (R&D Systems, Minneapolis, MN, USA). All standards and samples are analyzed in duplicate, following the manufacturer’s instructions. A serum quality control is run in triplicate at the beginning and end of each ELISA plate to calculate inter- and intra-assay coefficients of variability. Serum vitamin D metabolites are measured by liquid chromatography tandem mass spectrometry (LC-MS/MS) using a Waters AQUITY Tandem Quadropole Detector coupled to an AQUITY UPLC system (Waters Corporation), after extraction that includes a saponification step as adapted from Hymoller [47]. Accuracy is determined using the certified Vitamin D analytes serum-based standard reference material (SRM 972a) from the US National Institute for Standards and Technology (NIST; National Bureau of Standards, Washington, DC, USA) [48]. Precision is measured using a serum quality control in all runs of samples tested, allowing calculation of inter- and intra-assay coefficients of variability.

DNA extracted from maternal and cord blood plasma samples will be purified using the Chemagen 500 MSM I (PerkinElmer chemagen Technologie GmbH, Baesweiler, Germany) and genotyped using the Human Core Exome Bead Chip by Illumina. The chip is specifically designed to allow imputation of up to 18 million markers in multi-ethnic populations. Data cleaning and quality control of SNPs data are done according to the guidelines published by Winkler et al. [49].

Bone mass is measured by DXA scan using the research dedicated QDR®4500 series Hologic Inc. Discovery™ DXA machine (Waltham, MA, USA; Adult whole body software version 12.3.1 and Infant whole body software) at the McMaster University study site and the General Electric-Luna iDXA (Ames Medical enCORE, Version 14.1, Waukesha, WI, USA; CoreScan, GE) at the Western University study site. Daily quality control tests are conducted using an artificial L1–4 lumbar spine made from hydroxyapatite encased in epoxyresin. Weekly, a calibration test is performed using a step phantom (composed of soft-tissue and lean-tissue equivalent materials); at this time, the uniformity test is also performed to evaluate the contribution of air molecules to the attenuation of the X-rays. The coefficient of variations for BMC and BMD were 0.65% and 0.38%, respectively (Hologic Inc., Bedford, MA, USA) in our past study [37]. Inter-site comparison of DXA measures is conducted by having two phantoms (i.e. Hologic lumbar spine phantom (#2603), BFP® phantom) tested at both study sites. This provides a cross-calibration standard and enables us to account for inter-site differences and to harmonize results. It also allows for verification of the instrument stability over the course of the study.

At six months postpartum, bone scans are performed on mothers and infants at six months by staff following a standard operating procedure. Mothers are dressed in regular clothes without any metal parts or in a light hospital gown, while infants are in a clean diaper only. The primary outcome assessor, who has expertise in analyzing DXA scans, reviews all scans from both study sites. This ensures consistency in analysis across all participants at both study sites. If objects are included in the scan such as a child’s toy, a sub-regional analysis is done to isolate and subtract the object’s contribution to overall results. If considerable movement artifacts are seen in infant scans, the well-captured limb is used as a surrogate for the one with movement [50]. Any scans with unsalvageable distortions due to movement are not included. Z-scores are calculated for all BMC and BMD results using data from an age- and sex-specific standard curve. Results for women are interpreted as z-scores in reference to an adult women population embedded in the Hologic software. Results for infants are expressed as whole body minus the head BMC and BMD. Data are interpreted as z-scores in reference to an infant population from in-house data collection on normal healthy term infants from birth to one year of age.

All participants receive instructions to complete a standard three-day food intake record at 12–17 weeks of gestation, 26–28 weeks of gestation, 36–38 weeks of gestation, and six months postpartum. The food record includes two weekdays and one weekend day, as used previously in pregnant women [39]. It captures food and beverage intake as well as drug and supplement (vitamin and mineral) intake. Trained assessors analyze all food records using standard operating procedure on the computer-based Nutritionist Pro Software (Axxya Systems, Stafford, TX, USA). Nutritionist Pro computes the average amount of nutrients consumed per day.

Physical activity level is assessed through wearing a BodyMedia Sensewear Pro II armband monitor device (BodyMedia, Pittsburgh, PA, USA) in all participants for the same three days (as food monitoring above) in their work/home environment. These small devices are comfortable and easily wearable on the arm [38]. The data collected for three days at each visit (12–17, 26–28, and 36–38 weeks of gestation and six months postpartum) includes daily steps, daily energy expenditure, and minutes of activity level in Metabolic Equivalent of Task (MET; very vigorous, vigorous, moderate, sedentary). The Sensewear unit does not provide feedback to the participant.

Pregnancy and infant outcomes are obtained from participant’s medical records. Details are extracted about Cesarean section and/or vaginal delivery. For infant outcomes, data regarding gestational age, birth weight, birth length, birth head circumference, 1-min and 5-min Apgar scores, complications related to birth, and feeding practices at birth are extracted.

Infant weight and length at birth are recorded from medical charts and at three months of age as reported by the mother during the telephone interview. At the six-month visit, trained assessors measure infant weight by electronic scale (Medela BabyWeigh, McHenry, IL, USA), length using a measuring board for term infants (Pediatric Stadiometer, Ellard Instrumentation Ltd., Monroe, WA, USA), and head circumference using a constant-tension measuring tape (OHAUS, Dundas, ON, Canada), following standard operating procedures.

Infant feeding practices over the first three months are recorded by phone call and at six months in person using a standardized questionnaire administered by a trained assessor [51]. The questionnaire includes feeding history, duration and extent of breastfeeding, introduction of solid foods, and use of vitamins and supplements.

Study management and governance

This RCT is led by investigators from McMaster University (Department of Pediatrics, Department of Family Medicine, Department of Obstetrics and Gynecology, Department of Health Research Methods, Evidence and Impact, and School of Nursing) and Western University (School of Kinesiology) as well as practitioners from the City of Hamilton Public Health, all located in Ontario, Canada. The oversight of the study is guided by the steering committee that is composed of the lead investigators (SAA, SMP, MFM, LT, and EKH). The sponsors of the study are McMaster University and Western University. The sponsors are indemnified for any harms arising from trial participation and they approve protocol amendments when ethics approval has been obtained. Day-to-day running of the study is provided by the trial principal investigators, the study coordinators, and research assistants. The monitoring board consists of three well-established investigators from universities within Canada but outside McMaster and Western Universities.

All case report forms are anonymized using study identifiers and are stored in locked cabinets in a locked office. An administrative assistant to the study team who is not involved in the study holds the key to participants’ names. Research staff who are not involved in data collection enter data in a two-step process (entry and verification, following standard operating procedures) in our REDCap projects hosted at McMaster University [52]. REDCap is a secure, web-based application designed to support data capture for research studies, providing: (1) an intuitive interface for validated data entry; (2) audit trails for tracking data manipulation and export procedures; (3) automated export procedures for seamless data downloads to common statistical packages; and (4) procedures for importing data from external sources. Use of range check for data values promotes data quality. Double data entry is performed to ensure inter-study site reliability.

The research assistant monitors intervention adherence biweekly. For the nutrition portion, this includes ensuring dietary intake meets each participant’s caloric needs and are within an acceptable macronutrient distribution range. In terms of the exercise portion, participants monitor and report biweekly their physical activity using a diary and by wearing a pedometer. Participants report any discomfort experienced (due to the natural course of pregnancy and/or the intervention) to the study coordinator who personalizes the intervention as needed.

Statistical methods

The analysis and reporting of the results will follow the SPIRIT statement for reporting RCTs [53]. The process of patient selection and flow throughout the study will be summarized using a flow-diagram (Fig. 2).

The results of patient baseline characteristics and outcome variables (both primary and secondary) will be summarized using descriptive summary measures: expressed as mean (standard deviation) or median (range) for continuous variables and n (%) for categorical variables. All statistical tests will be performed using two-sided tests at the 0.05 level of significance. Binary outcomes will be analyzed using logistic regression and continuous outcomes will be analyzed using linear regression. All analyses will account for stratification by including study site and participants’ BMI category in the models. For all models, the results will be expressed as estimate of the difference for continuous outcomes (odds ratio [OR] or relative risk [RR] for binary outcomes), corresponding two-sided 95% confidence intervals (CI) and associated p values. P values will be reported to three decimal places with values < 0.001 reported as < 0.001. The Hosmer–Lemshow will be used to assess goodness-of-fit for logistic regression and explore the residuals to assess goodness-of-fit and model assumptions for linear regression.

An intention-to-treat principle will be adopted to analyze all outcomes. Multiple-imputation will also be used to handle missing data during data collection. Sensitivity analyses will be performed using some of the commonly used RCT patient-level methods (e.g. generalized estimating equations, the random-effect model which account for the potential correlation within sites, etc.) to assess the robustness of the results [54]. All the analyses will be done using SAS 9.2 (Cary, NC, USA) or SPSS 13 (Chicago, IL, USA). Table 2 provides a summary of the planned methods of analyses for each outcome and subgroup.

Power calculation

The sample size calculation for the core study is based on the test of the null hypothesis that the percentages of women with GWG within IOM guidelines in the two populations (intervention and control) are equal. To account for the uncertainty in these prior estimates, we calculated the sample size for different values of the percentage of women with weight within IOM guidelines in the control group in the range of 50–65%, with a risk difference of 15%, 20%, 25%, and 30%. While initially we targeted a total sample size of 350, the constraints of our funding timelines (which include a one-year CHIR grant extension) and a recruitment rate that was less than projected despite our best efforts, we revised our desired sample size to 242. Based on our a priori calculation, a sample size 111 per group corresponds to an absolute difference of 30% with our Nutrition + Exercise intervention, resulting in 30% of women in the treatment group having a GWG exceeding the IOM recommendations compared to 65% in the control group. With the revised sample size (i.e. assuming a 1:1 allocation ratio), the study will have power of 80% to yield a statistically significant result assuming a binomial distribution (using an intention-to-treat principle for the analysis) of the difference between percentages of women with GWG within IOM guidelines at alpha = 0.05. We hope to increase the sample size to 242 total participants in order to allow us to account for the two stratification variables – site (2 degrees of freedom) and baseline BMI strata (3 degrees of freedom).

The calculated sample size for the secondary outcome of maternal bone biomarker status is a total of 177 participants (allocation 1:1, α = 0.05, β = 20 for two-group t-test), based on literature with similar outcomes [55, 56]. A sample size was also calculated for the secondary outcome of infant whole body minus the head BMD. The total is 240 participants (allocation 1:1, α = 0.05, β = 20 for two-group t-test) based on the literature with similar primary outcomes in infant populations [57, 58]. For both sample sizes, a 15% attrition rate was calculated as a precaution in this type of clinical trial with follow-up.

Discussion

The BHIP study is novel as it proposes a feasible Nutrition + Exercise intervention that pregnant women could incorporate into their lifestyle and improve adherence to GWG recommendations. In addition, it will contribute to the knowledge of the perinatal developmental programming of body composition and skeletal phenotypes.

The BHIP RCT enrolls women in early pregnancy (12–17 weeks of gestation). Ideally, women should have access to general health interventions before pregnancy; however, currently pregnancy is often the first opportunity for health professionals to address issues beyond those directly related to the pregnancy state. Thus, one of the strengths of the BHIP study is that it starts in early pregnancy and monitors participants until six months postpartum. In addition, measures of health outcomes in both mother and child at various time points throughout pregnancy and postpartum allow for a comprehensive evaluation of the intervention effects. Lastly, one particular strength of the BHIP study is the individualized intervention and weekly monitoring. Evidence shows that the consumption of dairy foods (nutrition component) is well accepted during pregnancy [36], while walking (exercise component) is the best physical activity to promote adherence during pregnancy [38,39,40].

The primary outcome of GWG was selected because it is the key clinical problem we are addressing with our Nutrition + Exercise intervention. Pre-pregnancy obesity and excess GWG are the strongest maternal characteristics associated with offspring obesity [9, 11, 59] and childhood metabolic dysfunction [60]. Excess GWG in pregnancy is a major clinical challenge affecting Canadian women who enter pregnancy overweight and even women of normal pre-pregnancy weight [61, 62]. At the national level, a cross-sectional study on pregnancy experiences in Canada determined that over one-third of women entered pregnancy overweight or obese and nearly 60% experienced GWG greater than that recommended by the IOM recommendations for GWG [63]. The Alberta Pregnancy Outcomes and Nutrition prospective cohort study reported similar statistics with a greater number of overweight and obese women (80%) gaining excess weight during pregnancy [64]. Likewise, in the Hamilton region, data from the Family Atherosclerosis Monitoring In earLY life birth cohort demonstrated that > 50% of women entered pregnancy with a pre-pregnancy BMI of > 25.0 kg/m2 and > 50% exceeded the IOM guidelines for GWG [65]. Evidence to date suggest that the most convincing approach to achieve appropriate GWG is an intervention combining physical activity and nutrition, in combination with weight monitoring [1, 20, 21].

The secondary outcome of bone status was selected due to the emerging evidence of risk of osteoporosis in later life being programmed from the womb [24]. Bone biomarkers were selected as outcomes following the International Osteoporosis Foundation and the International Federation of Clinical Chemistry and Laboratory Medicine recommendations, which is to measure a marker of bone formation and a marker of bone resorption to assess bone turnover [66]. Currently, the most sensitive markers recommended are subgroups of type I collagen representing the predominant component of the bone matrix wherein sub-products can be measured as markers of formation and resorption of the bone [66]. To support fetal bone growth, pregnant women undergo bone metabolic adaptations over the course of pregnancy, where bone resorption usually rises and formation declines [67]. These changes result in high bone turnover during pregnancy [68]. Maternal bone turnover peaks during the third trimester, with a marked increase in bone resorption observed at 38 weeks of gestation [69]. Based on this knowledge, two biomarkers of bone modelling are quantified in maternal blood at 36–38 weeks of gestation and in cord blood. The marker of bone formation is PINP and the marker of bone resorption is CTX-I. Blood samples are consistently collected in a fasted state and in the morning, as suggested by the National Bone Health Alliance [70]. In addition, vitamin D status is measured by serum 25(OH)D as it plays an important role in calcium homeostasis and bone metabolism [71] and is a reflection of dietary vitamin D intake and sun exposure. Both the frequency of milk consumption and sun exposure were shown to be positive predictors of serum 25(OH)D concentrations in the recent multivariate analysis of three cycles of the Canadian Community Health Measures surveys [72]. The active form of vitamin D, 1,25dihydroxyvitamin D, is also measured since it is known to be upregulated during pregnancy [73].

Bone mineral content was selected as an outcome because it is linked to the peak bone mass (i.e. highest bone mass accrued for an individual) and is predictive of osteoporotic fracture [32]. Furthermore, recent evidence suggests that childhood BMC persists until peak bone mass and could indicate individuals at risk of osteoporotic fracture [31].

The nature of the intervention brings a limitation; the research study personnel in direct contact with the participants are not blinded to the study allocation of each participant. To preserve the integrity of the study findings and the internal validity of the study, all other investigators and statisticians are blinded to the study allocation, including people collecting and performing the analysis of primary and secondary outcomes. Some sources of bias are inevitable in the context of the study: recruitment sites are mostly primary care clinics, with few participants recruited from advertisements in the community. Since women are excluded with limited comprehension of the English language, if documented signs of depression or a pre-pregnancy BMI > 40 kg/m2, the generalization of the study would exclude such groups. From previous studies completed in urban Southern Ontario [39, 65] we anticipate that our study population will be mostly Caucasian, holding a university degree, and with a medium to high socioeconomic status. This sample is representative of the populations in the cities were recruitment occurred as they are modern urban centers with universities, colleges, and major commerce. Thus, the results will be generalizable to populations of women living in many Canadian cities with academic centers and major commerce. In addition, our results will still be of high value for the general Canadian pregnant women population who are served through provincial public health units, as it will bring new knowledge to refine the Canada Prenatal Nutrition Program (CPNP, https://www.canada.ca/en/public-health/services/health-promotion/childhood-adolescence/programs-initiatives/canada-prenatal-nutrition-program-cpnp.html), which is currently offered through public health but not based on evidence.

The strengths of the design include that the Nutrition + Exercise intervention was vetted for feasibility in pregnant women and care providers through focus groups and was conducted in pregnant women of all pre-pregnancy BMI categories (except extreme obesity) and in a community setting (as opposed to hospital-based as in many reported studies).

For the timely recruitment of study participants, challenges included prolonged ethics approval processes at secondary recruitment sites, reduced recruitment at community family practices due to influx of immigrants who were non-English speaking (thus not eligible), and competition with other studies sampling pregnant women. In light of these, we have taken measures to achieve the predetermined sample size based on a power analysis for the primary outcome so that the integrity of the study is preserved and the results are able to properly test the stated hypothesis and provide the needed new knowledge. To expand recruitment at McMaster University a new local study site was set up in a nearby city, Burlington, Ontario, with the collaboration of midwifery and obstetric clinics, as well as the local hospital. This partnership is very successful thus helping to achieve the recruitment goal. To date, retention after randomization to the study visit at 36–38 week of gestation when we assessed our primary outcome is 87%. To maximize retention of participants, we implemented a number of procedures such as study visit times at the convenience of the participants, including home visits. We also offered in-person, over the phone, and email correspondence to keep participants engaged. For participants in the intervention group, providing them with the dairy products (which are an integral part of the intervention) increases the likelihood of consuming the recommended amount. For the exercise component, participants receive a free pedometer as a self-motivation tool to reach their step count goals. Lastly, for both groups, participants are compensated on three occasions with $25 gift cards that they can redeem at a local grocery store.

If effective, this RCT will generate high-quality evidence to refine the nutrition guidelines during pregnancy to improve health of women during pregnancy and their offspring in early life, including acquisition of a strong skeleton.

Trial status

Active recruitment continues since a no-cost extension of one year from CIHR was obtained in December 2016, to allow the BHP study to achieve the target sample size.