External validation of prognostic models predicting pre-eclampsia: individual participant data meta-analysis

Snell, Kym I. E.; Allotey, John; Smuk, Melanie; Hooper, Richard; Chan, Claire; Ahmed, Asif; Chappell, Lucy C.; Von Dadelszen, Peter; Green, Marcus; Kenny, Louise; Khalil, Asma; Khan, Khalid S.; Mol, Ben W.; Myers, Jenny; Poston, Lucilla; Thilaganathan, Basky; Staff, Anne C.; Smith, Gordon C. S.; Ganzevoort, Wessel; Laivuori, Hannele; Odibo, Anthony O.; Arenas Ramírez, Javier; Kingdom, John; Daskalakis, George; Farrar, Diane; Baschat, Ahmet A.; Seed, Paul T.; Prefumo, Federico; da Silva Costa, Fabricio; Groen, Henk; Audibert, Francois; Masse, Jacques; Skråstad, Ragnhild B.; Salvesen, Kjell Å.; Haavaldsen, Camilla; Nagata, Chie; Rumbold, Alice R.; Heinonen, Seppo; Askie, Lisa M.; Smits, Luc J. M.; Vinter, Christina A.; Magnus, Per; Eero, Kajantie; Villa, Pia M.; Jenum, Anne K.; Andersen, Louise B.; Norman, Jane E.; Ohkuchi, Akihide; Eskild, Anne; Bhattacharya, Sohinee; McAuliffe, Fionnuala M.; Galindo, Alberto; Herraiz, Ignacio; Carbillon, Lionel; Klipstein-Grobusch, Kerstin; Yeo, Seon Ae; Browne, Joyce L.; Moons, Karel G. M.; Riley, Richard D.; Thangaratinam, Shakila

doi:10.1186/s12916-020-01766-9

External validation of prognostic models predicting pre-eclampsia: individual participant data meta-analysis

Research article
Open access
Published: 02 November 2020

Volume 18, article number 302, (2020)
Cite this article

Download PDF

You have full access to this open access article

BMC Medicine Aims and scope Submit manuscript

External validation of prognostic models predicting pre-eclampsia: individual participant data meta-analysis

Download PDF

Kym I. E. Snell¹,
John Allotey^2,3^na1,
Melanie Smuk³,
Richard Hooper³,
Claire Chan³,
Asif Ahmed⁴,
Lucy C. Chappell⁵,
Peter Von Dadelszen⁵,
Marcus Green⁶,
Louise Kenny⁷,
Asma Khalil⁸,
Khalid S. Khan^2,3,
Ben W. Mol⁹,
Jenny Myers¹⁰,
Lucilla Poston⁵,
Basky Thilaganathan⁸,
Anne C. Staff¹¹,
Gordon C. S. Smith¹²,
Wessel Ganzevoort¹³,
Hannele Laivuori^14,15,16,
Anthony O. Odibo¹⁷,
Javier Arenas Ramírez¹⁸,
John Kingdom¹⁹,
George Daskalakis²⁰,
Diane Farrar²¹,
Ahmet A. Baschat²²,
Paul T. Seed⁵,
Federico Prefumo²³,
Fabricio da Silva Costa²⁴,
Henk Groen²⁵,
Francois Audibert²⁶,
Jacques Masse²⁷,
Ragnhild B. Skråstad^28,29,
Kjell Å. Salvesen^30,31,
Camilla Haavaldsen³²,
Chie Nagata³³,
Alice R. Rumbold³⁴,
Seppo Heinonen³⁵,
Lisa M. Askie³⁶,
Luc J. M. Smits³⁷,
Christina A. Vinter³⁸,
Per Magnus³⁹,
Kajantie Eero^40,41,
Pia M. Villa³⁵,
Anne K. Jenum⁴²,
Louise B. Andersen^43,44,
Jane E. Norman⁴⁵,
Akihide Ohkuchi⁴⁶,
Anne Eskild^32,47,
Sohinee Bhattacharya⁴⁸,
Fionnuala M. McAuliffe⁴⁹,
Alberto Galindo⁵⁰,
Ignacio Herraiz⁵⁰,
Lionel Carbillon⁵¹,
Kerstin Klipstein-Grobusch⁵²,
Seon Ae Yeo⁵³,
Joyce L. Browne⁵²,
Karel G. M. Moons^52,54,
Richard D. Riley¹ &
Shakila Thangaratinam⁵⁵
for the IPPIC Collaborative Network

6434 Accesses
21 Altmetric
Explore all metrics

Abstract

Background

Pre-eclampsia is a leading cause of maternal and perinatal mortality and morbidity. Early identification of women at risk during pregnancy is required to plan management. Although there are many published prediction models for pre-eclampsia, few have been validated in external data. Our objective was to externally validate published prediction models for pre-eclampsia using individual participant data (IPD) from UK studies, to evaluate whether any of the models can accurately predict the condition when used within the UK healthcare setting.

Methods

IPD from 11 UK cohort studies (217,415 pregnant women) within the International Prediction of Pregnancy Complications (IPPIC) pre-eclampsia network contributed to external validation of published prediction models, identified by systematic review. Cohorts that measured all predictor variables in at least one of the identified models and reported pre-eclampsia as an outcome were included for validation. We reported the model predictive performance as discrimination (C-statistic), calibration (calibration plots, calibration slope, calibration-in-the-large), and net benefit. Performance measures were estimated separately in each available study and then, where possible, combined across studies in a random-effects meta-analysis.

Results

Of 131 published models, 67 provided the full model equation and 24 could be validated in 11 UK cohorts. Most of the models showed modest discrimination with summary C-statistics between 0.6 and 0.7. The calibration of the predicted compared to observed risk was generally poor for most models with observed calibration slopes less than 1, indicating that predictions were generally too extreme, although confidence intervals were wide. There was large between-study heterogeneity in each model’s calibration-in-the-large, suggesting poor calibration of the predicted overall risk across populations. In a subset of models, the net benefit of using the models to inform clinical decisions appeared small and limited to probability thresholds between 5 and 7%.

Conclusions

The evaluated models had modest predictive performance, with key limitations such as poor calibration (likely due to overfitting in the original development datasets), substantial heterogeneity, and small net benefit across settings. The evidence to support the use of these prediction models for pre-eclampsia in clinical decision-making is limited. Any models that we could not validate should be examined in terms of their predictive performance, net benefit, and heterogeneity across multiple UK settings before consideration for use in practice.

Trial registration

PROSPERO ID: CRD42015029349.

View this article's peer review reports

External validation, update and development of prediction models for pre-eclampsia using an Individual Participant Data (IPD) meta-analysis: the International Prediction of Pregnancy Complication Network (IPPIC pre-eclampsia) protocol

Article Open access 03 October 2017

Development and validation of prediction models for risk of adverse outcomes in women with early-onset pre-eclampsia: protocol of the prospective cohort PREP study

Article Open access 20 February 2017

Prediction of pre-eclampsia in nulliparous women using routinely collected maternal characteristics: a model development and validation study

Article Open access 06 January 2020

Background

Pre-eclampsia, a pregnancy-specific condition with hypertension and multi-organ dysfunction, is a leading contributor to maternal and offspring mortality and morbidity. Early identification of women at risk of pre-eclampsia is key to planning effective antenatal care, including closer monitoring or commencement of prophylactic aspirin in early pregnancy to reduce the risk of developing pre-eclampsia and associated adverse outcomes. Accurate prediction of pre-eclampsia continues to be a clinical and research priority [1, 2]. To-date, over 120 systematic reviews have been published on the accuracy of various tests to predict pre-eclampsia; more than 100 prediction models have been developed using various combinations of clinical, biochemical, and ultrasound predictors [3,4,5,6]. However, no single prediction model is recommended by guidelines to predict pre-eclampsia. Risk stratification continues to be based on the presence or absence of individual clinical markers, and not by multivariable risk prediction models.

Any recommendation to use a prediction model in clinical practice must be underpinned by robust evidence on the reproducibility of the models, their predictive performance across various settings, and their clinical utility. An individual participant data (IPD) meta-analysis that combines multiple datasets has great potential to externally validate existing models [7,8,9,10]. In addition to increasing the sample size beyond what is feasibly achievable in a single study, access to IPD from multiple studies offers the unique opportunity to evaluate the generalisability of the predictive performance of existing models across a range of clinical settings. This approach is particularly advantageous for predicting the rare but serious condition of early-onset pre-eclampsia that affects 0.5% of all pregnancies [11].

We undertook an IPD meta-analysis to externally validate the predictive performance of existing multivariable models to predict the risk of pre-eclampsia in pregnant women managed within the National Health Service (NHS) in the UK and assessed the clinical utility of the models using decision curve analysis.

Methods

International Prediction of Pregnancy Complications (IPPIC) Network

We undertook a systematic review of reviews by searching Medline, Embase, and the Cochrane Library including DARE (Database of Abstracts of Reviews of Effects) databases, from database inception to March 2017, to identify relevant systematic reviews on clinical characteristics, biochemical, and ultrasound markers for predicting pre-eclampsia [12]. We then identified research groups that had undertaken studies reported in the systematic reviews and invited the authors of relevant studies and cohorts with data on prediction of pre-eclampsia to share their IPD [13] and join the IPPIC (International Prediction of Pregnancy Complications) Collaborative Network. We also searched major databases and data repositories, and directly contacted researchers to identify relevant studies, or datasets that may have been missed, including unpublished research and birth cohorts. The Network includes 125 collaborators from 25 countries, is supported by the World Health Organization, and has over 5 million IPD containing information on various maternal and offspring complications. Details of the search strategy are given elsewhere [12].

Selection of prediction models for external validation

We updated our previous literature search of prediction models for pre-eclampsia [3] (July 2012–December 2017), by searching Medline via PubMed. Details of the search strategy and study selection are given elsewhere (Supplementary Table S1, Additional file 1) [12]. We evaluated all prediction models with clinical, biochemical, and ultrasound predictors at various gestational ages (Supplementary Table S2, Additional file 1) for predicting any, early (delivery < 34 weeks), and late (delivery ≥ 34 weeks’ gestation) onset pre-eclampsia. We did not validate prediction models if they did not provide the full model equation (including the intercept and predictor effects), if any predictor in the model was not measured in the validation cohorts, or if the outcomes predicted by the model were not relevant.

Inclusion criteria for IPPIC validation cohorts

We externally validated the models in IPPIC IPD cohorts that contained participants from the UK (IPPIC-UK subset) to determine their performance within the context of the UK healthcare system and to reduce the heterogeneity in the outcome definitions [14, 15]. We included UK participant whole datasets and UK participant subsets of international datasets (where country was recorded). If a dataset contained IPD from multiple studies, we checked the identity of each study to avoid duplication. We excluded cohorts if one or more of the predictors (i.e. those variables included in the model’s equation) were not measured or if there was no variation in the values of model predictors across individuals (i.e. every individual had the same predicted probability due to strict eligibility criteria in the studies). We also excluded cohorts where no individuals or only one individual developed pre-eclampsia. Since the published models were intended to predict the risk of pre-eclampsia in women with singleton pregnancies only, we excluded women with multi-foetal pregnancies.

IPD collection and harmonisation

We obtained data from cohorts in prospective and retrospective observational studies (including cohorts nested within randomised trials, birth cohorts, and registry-based cohorts). Collaborators sent their pseudo-anonymised IPD in the most convenient format for them, and we then formatted, harmonised, and cleaned the data. Full details on the eligibility criteria, selection of the studies and datasets, and data preparation have previously been reported in our published protocol [13].

Quality assessment of the datasets

Two independent reviewers assessed the quality of each IPD cohort using a modified version of the PROBAST (Prediction study Risk of Bias Assessment) tool [16]. The tool assesses the quality of the cohort datasets and individual studies, and we used three of the four domains: participant selection, predictors, and outcomes. The fourth domain ‘analysis’ was not relevant for assessing the quality of the collected data, as we performed the prediction model analyses ourselves since we had access to the IPD. We classified the risk of bias to be low, high, or unclear for each of the relevant domains. Each domain included signalling questions that are rated as ‘yes’, ‘probably yes’, ‘probably no’, ‘no’, or ‘no information’. Any signalling question that was rated as ‘probably no’ or ‘no’ was considered to have potential for bias and was classed as high risk of bias in that domain. The overall risk of bias of an IPD dataset was considered to be low if it scored low in all domains, high if any one domain had a high risk of bias, and unclear for any other classifications.

Statistical analysis

We summarised the total number of participants and number of events in each dataset, and the overall numbers available for validating each model.

Missing data

We could validate the predictive performance of a model only when the values of all its predictors were available for participants in at least one IPD dataset, i.e. in datasets where none of the predictors was systematically missing (unavailable for all participants). In such datasets, when data were missing for predictors and outcomes in some participants (‘partially missing data’), we used a 3-stage approach. First, where possible, we filled in the actual value that was missing using knowledge of the study’s eligibility criteria or by using other available data in the same dataset. For example, replacing nulliparous = 1 for all individuals in a dataset if only nulliparous women were eligible for inclusion. Secondly, after preliminary comparison of other datasets with the information, we used second trimester information in place of missing first trimester information. For example, early second trimester values of body mass index (BMI) or mean arterial pressure (MAP) were used if the first trimester values were missing. Where required, we reclassified into categories. Women of either Afro-Caribbean or African-American origin were classified as Black, and those of Indian or Pakistani origin as Asian. Thirdly, for any remaining missing values, we imputed all partially missing predictor and outcome values using multiple imputation by chained equations (MICE) [17, 18]. After preliminary checks comparing baseline characteristics in individuals with and without missing values for each variable, data were assumed to be missing at random (i.e. missingness conditional on other observed variables).

We conducted the imputations in each IPD dataset separately. This approach acknowledges the clustering of individuals within a dataset and retains potential heterogeneity across datasets. We generated 100 imputed datasets for each IPD dataset with any missing predictor or outcome values. In the multiple imputation models, continuous variables with missing values were imputed using linear regression (or predictive mean matching if skewed), binary variables were imputed using logistic regression, and categorical variables were imputed using multinomial logistic regression. Complete predictors were also included in the imputation models as auxiliary variables. To retain congeniality between the imputation models and predictive models [19], the scale used to impute the continuous predictors was chosen to match the prediction models. For example, pregnancy-associated plasma protein A (PAPP-A) was modelled on the log scale in many models and was therefore imputed as log(PAPP-A). We undertook imputation checks by looking at histograms, summary statistics, and tables of values across imputations, as well as by checking the trace plots for convergence issues.

Evaluating predictive performance of models

For each model that we could validate, we applied the model equation to each individual i in each (imputed) dataset. For each prediction model, we summarised the overall distribution of the linear predictor values for each dataset using the median, interquartile range, and full range, averaging statistics across imputations where necessary [20].

We examined the predictive performance of each model separately, using measures of discrimination and calibration, firstly in the IPD for each available dataset and then at the meta-analysis level. We assessed model discrimination using the C-statistic with a value of 1 indicating perfect discrimination and 0.5 indicating no discrimination beyond chance [21]. Good values of the C-statistic are hard to define, but we generally considered C-statistic values of 0.6 to 0.75 as moderate discrimination [22]. Calibration was assessed using the calibration slope (ideal value = 1, slope < 1 indicates overfitting, where predictions are too extreme) and the calibration-in-the-large (ideal value = 0). For each dataset containing over 100 outcome events, we also produced calibration plots to visually compare observed and predicted probabilities when there were enough events to categorise participants into 10 risk groups. These plots also included a lowess smoothed calibration curve over all individuals.

Where data had been imputed in a particular IPD dataset, the predictive performance measures were calculated in each of the imputed datasets, and then Rubin’s rules were applied to combine statistics (and corresponding standard errors) across imputations [20, 23, 24].

When it was possible to validate a model in multiple cohorts, we summarised the performance measures across cohorts using a random-effects meta-analysis estimated using restricted maximum likelihood (for each performance measure separately) [25, 26]. Summary (average) performance statistics were reported with 95% confidence intervals (derived using the Hartung-Knapp-Sidik-Jonkman approach as recommended) [27, 28]. We also reported the estimate of between-study heterogeneity (τ²) and the proportion of variability due to between-study heterogeneity (I²). Where there were five or more cohorts in the meta-analysis, we also reported the approximate 95% prediction interval (using the t-distribution to account for uncertainty in τ) [29]. We only reported the model performance in individual cohorts if the total number of events was over 100. We also compared the performance of the models in the same validation cohort where possible. We used forest plots to show a model’s performance in multiple datasets and to compare the average performance (across datasets) of multiple models.

A particular challenge is to predict pre-eclampsia in nulliparous women as they have no history from prior pregnancies (which are strong predictors); therefore, we also conducted a subgroup analysis in which we assessed the performance of the models in only nulliparous women from each study.

Decision curve analysis

For each pre-eclampsia outcome (any, early, or late onset), we compared prediction models using decision curve analysis [30, 31]. Decision curves show the net benefit (i.e. benefit versus harm) over a range of clinically relevant threshold probabilities. The model with the greatest net benefit for a particular threshold is considered to have the most clinical value. For this investigation, we chose the IPD that was most frequently used in the external validation of the prediction models and which allowed multiple models to be compared in the same IPD (thus enabling a direct, within-dataset comparison of the models).

All statistical analyses were performed using Stata MP Version 15. TRIPOD guidelines were followed for transparent reporting of risk prediction model validation studies [32, 33]. Additional details on the missing data checks, performance measures, meta-analysis, and decision curves are given in Supplementary Methods, Additional file 1 [20, 26, 34,35,36,37,38,39,40,41,42,43,44,45].

Results

Of the 131 models published on prediction of pre-eclampsia, only 67 reported the full model equation needed for validation (67/131, 51%) (Supplementary Table S3, Additional file 1). Twenty-four of these 67 models (24/67, 36%) met the inclusion criteria for external validation in the IPD datasets (Table 1) [35, 46,47,48,49,50,51,52,53,54,55,56], and the remaining models (43/67, 64%) did not meet the criteria due to the required predictor information not being available in the IPD datasets (Fig. 1).

Table 1 Pre-eclampsia prediction model equations externally validated in the IPPIC-UK cohorts

Full size table

Characteristics and quality of the validation cohorts

IPD from 11 cohorts contained within the IPPIC network contained relevant predictors and outcomes that could be used to validate at least one of the 24 prediction models. Four of the 11 validation cohorts were prospective observational studies (Allen 2017, POP, SCOPE, and Velauthar 2012) [36, 37, 45], four were nested within randomised trials (Chappell 1999, EMPOWAR, Poston 2006, and UPBEAT) [39,40,41,42], and three were from prospective registry datasets (ALSPAC, AMND, and St George’s) [38, 43, 44, 57]. Six cohorts included pregnant women with high and low risk of pre-eclampsia [37, 38, 43,44,45], four included high-risk women only [39,40,41,42], and one included low-risk women only [36]. Two of the 11 cohorts (SCOPE, POP) included only nulliparous women with singleton pregnancies who were at low risk [36] and at any risk of pre-eclampsia [45]. In the other 9 cohorts, the proportion of nulliparous women ranged from 43 to 65%. Ten of the 11 cohorts reported on any-, early-, and late-onset pre-eclampsia, while one had no women with early-onset pre-eclampsia [40]. The characteristics of the validation cohorts and a summary of the missing data for each predictor and outcome are provided in Supplementary Tables S4, S5, and S6 (Additional file 1), respectively.

A fifth of all validation cohorts (2/11, 18%) were classed as having an overall low risk of bias for all three PROBAST domains of participant selection, predictor evaluation, and outcome assessment. Seven (7/11, 64%) had low risk of bias for participant selection domain, and ten (10/11, 91%) had low risk of bias for predictor assessment, while one had an unclear risk of bias for that domain. For outcome assessment, half of all cohorts had low risk of bias (5/11, 45%) and it was unclear in the rest (6/11, 55%) (Supplementary Table S7, Additional file 1).

Characteristics of the validated models

All of the models we validated were developed in unselected populations of high- and low-risk women. About two thirds of the models (63%, 15/24) included only clinical characteristics as predictors [35, 46, 47, 49, 51,52,53, 55], five (21%) included clinical characteristics and biomarkers [46, 48, 50, 54], and four (17%) included clinical characteristics and ultrasound markers [50, 56]. Most models predicted the risk of pre-eclampsia using first trimester predictors (21/24, 88%), and three using first and second trimester predictors (13%). Eight models predicted any-onset pre-eclampsia, nine early-onset, and seven predicted late-onset pre-eclampsia (Table 1). The sample size of only a quarter of the models (25%, 6/24) [35, 47, 48, 56] was considered adequate, based on having at least 10 events per predictor evaluated to reduce the potential for model overfitting.

External validation and meta-analysis of predictive performance

We validated the predictive performance of each of the 24 included models in at least one and up to eight validation cohorts. The distributions of the linear predictor and the predicted probability are shown for each model and validation cohort in Supplementary Table S8 (Additional file 1). Performance of models is given for each cohort separately (including smaller datasets) in Supplementary Table S9 (Additional file 1).

Performance of models predicting any-onset pre-eclampsia

Two clinical characteristics models (Plasencia 2007a; Poon 2008) with predictors such as ethnicity, family history of pre-eclampsia, and previous history of pre-eclampsia showed reasonable discrimination in validation cohorts with summary C-statistics of 0.69 (95% CI 0.53 to 0.81) for both models (Table 2). The models were potentially overfitted (summary calibration slope < 1) indicating extreme predictions compared to observed events, with wide confidence intervals, and large heterogeneity in discrimination and calibration (Table 2). The third model (Wright 2015a) included additional predictors such as history of systemic lupus erythematosus, anti-phospholipid syndrome, history of in vitro fertilisation, chronic hypertension, and interval between pregnancies, and showed less discrimination (summary C-statistic 0.62, 95% CI 0.48 to 0.75), with observed overfitting (summary calibration slope 0.64) (Table 2).

Table 2 Summary estimates of predictive performance for each model across validation cohorts

Full size table

The three models with clinical and biochemical predictors (Baschat 2014a; Goetzinger 2010; Odibo 2011a) showed moderate discrimination (summary C-statistics 0.66 to 0.72) (Table 2). We observed underfitting (summary calibration slope > 1) with predictions that do not span a wide enough range of probabilities compared to what was observed in the validation cohorts (Fig. 2). Amongst these three models, the Odibo 2011a model with ethnicity, BMI, history of hypertension, and PAPP-A as predictors showed the highest discrimination (summary C-statistic 0.72, 95% CI 0.51 to 0.86), with a summary calibration slope of 1.20 (95% CI 0.24 to 2.00) due to heterogeneity in calibration performance across the three cohorts.

When validated in individual cohorts, the Odibo 2011a model demonstrated better discrimination in the POP cohort of any risk nulliparous women (C-statistics 0.78, 95% CI 0.74 to 0.81) than in the St George’s cohort of all pregnant women (C-statistics 0.67, 95% CI 0.65 to 0.69). The calibration estimates for Odibo 2011a model in these two cohorts showed underfitting in the POP cohort (calibration slope 1.49, 95% CI 1.33 to 1.65) and reasonably adequate calibration in the St George’s cohort (slope 0.96, 95% CI 0.89 to 1.04). The calibration-in-the-large of the Odibo 2011a showed systematic overprediction in the St George’s cohort (− 0.90, 95% CI − 0.95 to − 0.85) and less so in the POP cohort with value close to 0. Both Baschat 2014a and Goetzinger 2010 models also showed moderate discrimination in the POP cohort with C-statistics ranging from 0.70 to 0.76. When validated in the POP cohort, the Baschat 2014a model systematically underpredicted risk with calibration-in-the-large (0.66, 95% CI 0.53 to 0.78) and less so for the Goetzinger 2010 model. One model (Yu 2005a) that included second trimester ultrasound markers and clinical characteristics had low discrimination (C-statistic 0.61, 95% CI 0.57 to 0.65) and poor calibration (slope 0.08, 95% CI 0.01 to 0.14), and was only validated in the POP cohort (Table 3).

Table 3 Predictive performance statistics for models in the individual IPPIC-UK cohorts with over 100 events

Full size table

Performance of models predicting early-onset pre-eclampsia

We then considered the prediction of early-onset pre-eclampsia. The two clinical characteristics models, Baschat 2014b with predictors such as history of diabetes, hypertension, and mean arterial pressure [46], and Kuc 2013a model with maternal height, weight, parity, age, and smoking status [49], showed reasonable discrimination (summary C-statistics 0.68, 0.66, respectively) with minimal heterogeneity when validated in up to six datasets. The summary calibration was suboptimal with either under- or overfitting. When validated in individual cohorts (Poston 2006, St George’s, and AMND cohorts), the Kuc model showed moderate discrimination in the St George’s and AMND cohorts of unselected pregnant women with values ranging from 0.64 to 0.68, respectively. But the model was overfitted in both the cohorts (calibration slope 0.34 and 0.47) and systematically overpredicted the risks (calibration-in-the-large > 1). In the external cohort of obese pregnant women (Poston 2006), Baschat 2014b model showed moderate discrimination (C-statistic 0.67, 95% CI 0.63 to 0.72). There was some evidence that predictions did not span a wide enough range of probabilities and that the model systematically underpredicted the risks (Table 3).

The other six models were validated with a combined total of less than 50 events between the cohorts [35, 47, 51, 52, 55]. Of these, the clinical characteristics models of Scazzocchio 2013a and Wright 2015b, and the clinical and biochemical marker-based model of Poon 2009a showed promising discrimination (summary C-statistic 0.74), but with imprecise estimates indicative of the small sample size in the validation cohorts. All three models were observed to be overfitted (summary calibration slopes ranging from 0.45 to 0.91), though again confidence intervals were wide. The second trimester Yu 2005b model with ultrasound markers and clinical characteristics was validated in one cohort with 10 events, resulting in very imprecise estimates but still indicative of the model being overfitted (calibration slope 0.56, 95% CI 0.29 to 0.82).

Performance of models predicting late-onset pre-eclampsia

Of the five clinical characteristics models, four (Crovetto 2015b, Kuc 2010b, Plasencia 2007c, Poon 2010b) were validated across cohorts. The models showed reasonable discrimination with summary C-statistics ranging between 0.62 and 0.67 [47, 49, 51, 52]. We observed overfitting (summary calibration slope 0.56 to 0.66) with imprecision except for the Kuc 2013b model. The models appeared to either systematically underpredict (Plasencia 2007c, Poon 2010b) or overpredict (Crovetto 2015b, Kuc 2013b), with imprecise calibration-in-the-large estimates. There was moderate to large heterogeneity in both discrimination and calibration measures.

When validated in the POP cohort of nulliparous women, the Crovetto 2015b model with predictors such as maternal ethnicity, parity, chronic hypertension, smoking status, and previous history of pre-eclampsia showed good discrimination (C-statistic 0.78, 95% CI 0.75 to 0.81) but with evidence of some underfitting (calibration slope 1.25, 95% CI 1.10 to 1.38); the model also systematically underpredicted the risks (calibration-in-the-large 1.31, 95% CI 1.18 to 1.44). The corresponding performance of the Kuc 2010b model in the POP cohort showed low discrimination (C-statistic 0.60, 95% CI 0.56 to 0.64) and calibration (calibration slope 0.67, 95% CI 0.45 to 0.89). In the ALSPAC, St George’s, and AMND unselected pregnancy cohorts, the Kuc 2010b model showed varied discrimination with C-statistics ranging from 0.64 to 0.84, but with overfitting (calibration slope < 1) and systematic overprediction (calibration-in-the-large − 1.97, 95% CI − 1.57 to − 1.44). In the POP cohort, the Yu 2005c model with clinical and second trimester ultrasound markers had a C-statistic of 0.61 (95% CI 0.57 to 0.64) with severe overfitting (calibration slope 0.08, 95% CI 0.01 to 0.15).

Supplementary Table S10 (Additional file 1) shows the performance of the models in nulliparous women only in the IPPIC-UK datasets and in the POP cohort only separately.

Heterogeneity

Where it was possible to estimate it, heterogeneity across studies varied from small (e.g. Plasencia 2007a and Poon 2008 models had I² ≤ 3%, τ² ≤ 0.002) to large heterogeneity (e.g. Goetzinger 2010 and Odibo 2011a models had I² ≥ 90%, τ² ≥ 0.1) for the C-statistic (on the logit scale), and moderate to large heterogeneity in the calibration slope for about two thirds (8/13, 62%) of all models validated in datasets with around 100 events in total. All models validated in multiple IPD datasets had high levels of heterogeneity in calibration-in-the-large performance. For the majority of models validated in cohorts with a combined event size of around 100 events in total (9/13, 69%), the summary calibration slope was less than or equal to 0.7 suggesting a general concern of overfitting in the model development (as ideal value is 1, and values < 1 indicate predictions are too extreme). The exceptions to this were Baschat 2014a, Goetzinger 2010, and Odibo 2011a models (for any-onset pre-eclampsia) and Baschat 2014b (for early-onset pre-eclampsia).

Decision curve analysis

We compared the clinical utility of models for any-onset pre-eclampsia in SCOPE (3 models), Allen 2017 (6 models), UPBEAT (4 models), and POP cohorts (3 models) as they allowed us to compare more than one model. Of the three models validated in the POP cohort [46, 48, 50], the Odibo 2011a model had the highest clinical utility for a range of thresholds for predicting any-onset pre-eclampsia (Fig. 3). But this net benefit was not observed either for Odibo 2011a or for other models when validated in the other cohorts. Decision curves for early- and late-onset pre-eclampsia models are given in Supplementary Figure S1 and S2 (Additional file 1), respectively. These showed that there was little opportunity for net benefit of the early-onset pre-eclampsia prediction models, primarily because of how rare the condition is. For late-onset pre-eclampsia, the models showed some net benefit across a very narrow range of threshold probabilities.

Discussion

Summary of findings

Of the 131 prediction models developed for predicting the risk of pre-eclampsia, only half published the model equation that is necessary for others to externally validate these models, and of those remaining, only 25 included predictors available to us in the datasets of the validation cohorts. One model could not be validated because of too few events in the validation cohorts. In general, models moderately discriminated between women who did and did not develop any-, early-, or late-onset pre-eclampsia. The performance did not appear to vary noticeably according to the type of predictors (clinical characteristics only; additional biochemical or ultrasound markers) or the trimester. Overall calibration of predicted risks was generally suboptimal. In particular, the summary calibration slope was often much less than 1, suggesting that the developed models were overfitted to their development dataset and thus do not transport well to new populations. Even for those with promising summary calibration performance (e.g. summary calibration slopes close to 1 from the meta-analysis), we found large heterogeneity across datasets, indicating that the calibration performance of the models is unlikely to be reliable across all UK settings represented by the validation cohorts. Some models showed promising performance in nulliparous women, but this was not observed in other populations.

Strengths and limitations

To our knowledge, this is the first IPD meta-analysis to externally validate existing prediction models for pre-eclampsia. Our comprehensive search identified over 130 published models, illustrating the desire for risk prediction in this field, but also the confusion about which models are reliable. The global IPPIC Network brought together key researchers involved in this field, and their cohorts provided access to the largest IPD on prediction of pregnancy complications. We evaluated whether any of the identified models demonstrated good predictive performance in the UK health system, both on average and within individual cohorts. Access to raw data meant that we could exclude ineligible women, account for timing of predictor measurement and outcome, and increase the sample size for rare outcomes such as early-onset pre-eclampsia.

We could only validate 24 of the 131 published pre-eclampsia prediction models and were restricted by poor reporting of published models, as well as the unavailability of predictors used in some reported models within our IPD. It is possible that a better performing model exists which we have been unable to validate. However, the issue of missing predictors may also reflect the availability of these predictors in routine clinical practice, and the inconvenience in their measurement, highlighting the need for a practical prediction model with easy to measure and commonly reported variables [58].

We limited our validation to UK datasets to reduce the heterogeneity arising from outcome definitions and variations in management. Despite this, often considerable heterogeneity remained in predictive performance. Direct comparison of the prediction models is difficult due to different datasets contributing towards the validation of each model.

Comparison to existing studies

Currently, none of the published models on pre-eclampsia has been recommended for clinical practice. We consider the following issues to contribute to this phenomenon. Firstly, most of the models have never been externally validated, and their performance in other populations is unknown [6, 37, 59,60,61]. Secondly, even when validated, the findings are limited by the relatively small numbers of events in the validation cohort to draw robust conclusions, for example about calibration performance. Recently, first trimester models for any pre-eclampsia comprising of easily available predictors were validated in two separate Dutch cohorts in line with current recommendations. Both validation cohorts comprised of less than 100 events each, which is recommended as the minimum sample size required [6]. Discrimination of these models was moderate and similar to what we observed. Most models showed overfitting and systematic overprediction of the risks. The clinical utility of the best performing models showed net benefit over a narrow range of probabilities. Thirdly, there is fatigue amongst the research community and the clinicians due to the vast numbers of prediction models that have been published with various combinations and permutations of predictor variables, often in overlapping populations without external validation [35, 51, 53, 54, 62,63,64,65,66,67,68,69,70,71,72,73,74,75,76,77,78,79].

Fourthly, many models have been developed by considering them as a ‘screening test’ for pre-eclampsia, similar to the approach used in Down syndrome screening with biomarkers. In addition to the lack of information on multiple of the median (MoM) values in validating cohorts, such an approach has inherent limitations. The models’ performances are reported in terms of detection rate (sensitivity) for a specific false positive rate of 10% [35, 51, 54, 63,64,65,66, 68,69,70,71, 75, 77,78,79], but unlike diagnostic tests (where focus is on sensitivity and specificity), when predicting future outcomes it is more important to provide absolute risk predictions, potentially across the whole spectrum of risk (from 0 to 1) [80]. Such risk predictions then guide patient counselling, shared clinical decision-making, and personalisation of healthcare. As such, calibration of such risk predictions must be checked. In population-based cohorts, only a small proportion of individuals are at high risk of pre-eclampsia, with a preponderance of those at low or very low risk. However, the performance of many models continues to be evaluated and compared solely on the basis of their discrimination ability, with calibration ignored [81].

In the recent ASPRE (Combined Multimarker Screening and Randomized Patient Treatment with Aspirin for Evidence-Based Preeclampsia Prevention) trial [82], aspirin significantly reduced the risk of pre-eclampsia in women stratified for high risk of preterm pre-eclampsia using the prediction model by Akolekar 2013 [62]. In the control group, 4.3% of women were considered to have preterm pre-eclampsia against the 7.6% expected to be identified by the model. The discrimination of the model was published recently, and its calibration reported in two separate datasets [83]. The so-called competing risks model appears to have exceptional performance and very high discrimination (> 0.8) when validated in datasets from a standardised population akin to that used for model development. While this is laudable, caution is needed. The model showed evidence of some problems with calibration-in-the-large and did not examine heterogeneity in calibration performance across centres. Even if all centres across the UK use the same standardisation as the SPREE studies (in terms of timing and methods of predictor measurement), there may still be heterogeneity in the model performance, for example if the baseline risk of pre-eclampsia varied across centres. Therefore, before widespread uptake or implementation of this model, detailed exploration of the performance in a wide range of realistic settings of application is needed, including decision curve analyses. We were not able to validate this model in IPPIC-UK datasets due to lack of information on predictors, and other information needed to calculate the MoMs.

Relevance to clinical practice

A clinically useful prediction model should be able to accurately identify women who are at risk of pre-eclampsia in all healthcare settings that the model will be used. There is no evidence from this IPD meta-analysis that, for the subset of published models we could evaluate, any model is applicable for use across all populations within UK healthcare setting. In particular, the poor observed calibration and the large heterogeneity across different datasets suggest that the subset of models are not robust enough for widespread use. It is likely that the predictive performance of the models would be improved by recalibration to particular settings and populations, for which local data are needed. This may not be practical in practice.

Recommendations for further research

A major issue is that, based on the subset of models evaluated, existing prediction models in the pre-eclampsia field appear to suffer from calibration slopes < 1 in new data, which is likely to reflect overfitting when developing the model. This is known to be a general problem for the prediction model field in other disease areas [84]. To reduce the impact of overfitting, predictor effects might be corrected by shrinking the predictor effects (i.e. using penalisation techniques during model development—a similar concept is regression to the mean) [85,86,87,88] and performing appropriate internal validation (e.g. using bootstrapping) [89]. Furthermore, to improve the overall calibration across settings, the baseline risk (through the intercept) may need to be tailored to the different settings. This can, for instance, be achieved by comparing the ‘local’ outcome incidence with the reported incidence from the original development study or by re-estimating the intercept using new patient data. Another important option is to extend the existing models by including new predictors, to both improve the discrimination performance and reduce heterogeneity in baseline risk. To address this, further work could include imputation of systematically missing predictors by borrowing information across studies; techniques for across-dataset imputation are only recently being developed [90,91,92,93,94], and further evidence on their performance is needed before implementation. There is a need to improve homogeneity across studies, for example in predictor measurement method, timing of predictor measurement, and outcome definition. The various risk thresholds that mothers would consider for making decisions on management need to be identified to apply the findings of decision curve analysis.

Conclusion

A pre-eclampsia prediction model with good predictive performance would be beneficial to the UK NHS, but the evidence here suggests that, of the 24 models we could validate, their predictive performance is generally moderate, with miscalibration and heterogeneity across UK settings represented by the dataset available. Thus, there is not enough evidence to warrant recommendation for their routine use in clinical practice. Other models exist that we could not validate, which should also be examined in terms of their predictive performance, net benefit, and any heterogeneity across multiple UK settings before consideration for use in practice.

Availability of data and materials

The data that support the findings of this study are available from the IPPIC data sharing committee, but restrictions apply to the availability of these data, which were used under licence for the current study, and so are not publicly available. Data are however available from the authors upon reasonable request and with permission of contributing collaborators.

Abbreviations

IPD:: Individual participant data
IPPIC:: International Prediction of Pregnancy Complications
BMI:: Body mass index
MAP:: Mean arterial pressure
PAPP-A:: Pregnancy-associated plasma protein
LP:: Linear predictor

References

Cantwell R, Clutton-Brock T, Cooper G, Dawson A, Drife J, Garrod D, et al. Saving mothers’ lives: reviewing maternal deaths to make motherhood safer: 2006-2008. The Eighth Report of the Confidential Enquiries into Maternal Deaths in the United Kingdom. BJOG. 2011;118(Suppl 1):1–203.
PubMed Google Scholar
Ng VK, Lo TK, Tsang HH, Lau WL, Leung WC. Intensive care unit admission of obstetric cases: a single centre experience with contemporary update. Hong Kong Med J. 2014;20(1):24–31.
PubMed Google Scholar
Kleinrouweler CE, Cheong-See Mrcog FM, Collins GS, Kwee A, Thangaratinam S, Khan KS, et al. Prognostic models in obstetrics: available, but far from applicable. Am J Obstet Gynecol. 2016;214(1):79–90.e36.
Herraiz I, Arbues J, Camano I, Gomez-Montes E, Graneras A, Galindo A. Application of a first-trimester prediction model for pre-eclampsia based on uterine arteries and maternal history in high-risk pregnancies. Prenat Diagn. 2009;29(12):1123–9.
Article CAS PubMed Google Scholar
Farina A, Rapacchia G, Freni Sterrantino A, Pula G, Morano D, Rizzo N. Prospective evaluation of ultrasound and biochemical-based multivariable models for the prediction of late pre-eclampsia. Prenat Diagn. 2011;31(12):1147–52.
Article PubMed Google Scholar
Meertens LJE, Scheepers HCJ, van Kuijk SMJ, Aardenburg R, van Dooren IMA, Langenveld J, et al. External validation and clinical usefulness of first trimester prediction models for the risk of preeclampsia: a prospective cohort study. Fetal Diagn Ther. 2019;45(6):381–93.
Riley RD, Ensor J, Snell KI, Debray TP, Altman DG, Moons KG, et al. External validation of clinical prediction models using big datasets from e-health records or IPD meta-analysis: opportunities and challenges. BMJ. 2016;353:i3140.
Article PubMed PubMed Central Google Scholar
Debray TP, Riley RD, Rovers MM, Reitsma JB, Moons KG, Cochrane IPDM-aMg. Individual participant data (IPD) meta-analyses of diagnostic and prognostic modeling studies: guidance on their use. PLoS Med. 2015;12(10):e1001886.
Article PubMed PubMed Central Google Scholar
Debray TPA, Moons KGM, Ahmed I, Koffijberg H, Riley RD. A framework for developing, implementing, and evaluating clinical prediction models in an individual participant data meta-analysis. Stat Med. 2013;32(18):3158–80.
Article PubMed Google Scholar
Debray TP, Vergouwe Y, Koffijberg H, Nieboer D, Steyerberg EW, Moons KG. A new framework to enhance the interpretation of external validation studies of clinical prediction models. J Clin Epidemiol. 2015;68(3):279–89.
Article PubMed Google Scholar
Lisonkova S, Joseph KS. Incidence of preeclampsia: risk factors and outcomes associated with early- versus late-onset disease. Am J Obstet Gynecol. 2013;209(6):544 e1–e12.
Article Google Scholar
Townsend R, Khalil A, Premakumar Y, Allotey J, Snell KIE, Chan C, et al. Prediction of pre-eclampsia: review of reviews. Ultrasound Obstet Gynecol. 2019;54(1):16–27.
Allotey J, Snell KIE, Chan C, Hooper R, Dodds J, Rogozinska E, et al. External validation, update and development of prediction models for pre-eclampsiausing an Individual Participant Data (IPD) meta-analysis: the International Prediction of Pregnancy Complication Network (IPPIC pre-eclampsia) protocol. Diagn Progn Res. 2017;1:16.
National Institute for Health and Care Excellence. Hypertension in pregnancy: diagnosis and management: NICE guidance (CG107); 2010. [updated 01/2011. Available from: https://www.nice.org.uk/guidance/cg107/chapter/1-guidance.
Myatt L, Redman CW, Staff AC, et al. Strategy for standardization of preeclampsia research study design. Hypertension. 2014;63(6):1293–301.
Wolff RF, Moons KGM, Riley RD, Whiting PF, Westwood M, Collins GS, et al. PROBAST: a tool to assess the risk of bias and applicability of prediction model studies. Ann Intern Med. 2019;170(1):51–8.
Article PubMed Google Scholar
van Buuren S, Boshuizen HC, Knook DL. Multiple imputation of missing blood pressure covariates in survival analysis. Stat Med. 1999;18(6):681–94.
Article PubMed Google Scholar
White IR, Royston P, Wood AM. Multiple imputation using chained equations: issues and guidance for practice. Stat Med. 2011;30(4):377–99.
Article PubMed Google Scholar
Meng XL. Multiple-imputaiton inferences with uncongenial sources of input. Stat Sci. 1994;9(4):538–73.
Article Google Scholar
Rubin DB. Multiple imputation for nonresponse in surveys, vol. 1987. New York: Wiley; 1987.
Book Google Scholar
Hosmer DW, Lemeshow S. Assessing the fit of the model. Applied logistic regression. 2nd ed. New York: Wiley; 2000. p. 143–202.
Book Google Scholar
Hosmer DW, Lemeshow S. Applied logistic regression. 2nd ed. New York: Wiley; 2000.
Book Google Scholar
Marshall A, Altman DG, Holder RL, Royston P. Combining estimates of interest in prognostic modelling studies after multiple imputation: current practice and guidelines. BMC Med Res Methodol. 2009;9:57.
Wood AM, Royston P, White IR. The estimation and use of predictions for the assessment of model performance using large samples with multiply imputed data. Biom J. 2015;57(4):614–32.
Article PubMed PubMed Central Google Scholar
Snell KI, Hua H, Debray TP, Ensor J, Look MP, Moons KG, et al. Multivariate meta-analysis of individual participant data helped externally validate the performance and implementation of a prediction model. J Clin Epidemiol. 2016;69:40–50.
Article PubMed PubMed Central Google Scholar
Debray TP, Damen JA, Snell KI, Ensor J, Hooft L, Reitsma JB, et al. A guide to systematic review and meta-analysis of prediction model performance. BMJ. 2017;356:i6460.
Article PubMed Google Scholar
Hartung J, Knapp G. A refined method for the meta-analysis of controlled clinical trials with binary outcome. Stat Med. 2001;20(24):3875–89.
Article CAS PubMed Google Scholar
Langan D, Higgins JPT, Jackson D, Bowden J, Veroniki AA, Kontopantelis E, et al. A comparison of heterogeneity variance estimators in simulated random-effects meta-analyses. Res Synth Methods. 2019;10(1):83–98.
Article PubMed Google Scholar
Higgins JPT, Thompson SG, Spiegelhalter DJ. A re-evaluation of random-effects meta-analysis. J R Stat Soc Ser A Stat Soc. 2009;172:137–59.
Article PubMed PubMed Central Google Scholar
Vickers AJ, Elkin EB. Decision curve analysis: a novel method for evaluating prediction models. Med Decis Mak. 2006;26(6):565–74.
Article Google Scholar
Vickers AJ, Van Calster B, Steyerberg EW. Net benefit approaches to the evaluation of prediction models, molecular markers, and diagnostic tests. BMJ. 2016;352:i6.
Article PubMed PubMed Central CAS Google Scholar
Collins GS, Reitsma JB, Altman DG, Moons KGM; members of the TRIPOD group. Transparent Reporting of a Multivariable Prediction Model for Individual Prognosis or Diagnosis (TRIPOD): The TRIPOD Statement. Eur Urol. 2015;67(6):1142–51.
Moons KG, Altman DG, Reitsma JB, Ioannidis JP, Macaskill P, Steyerberg EW, et al. Transparent Reporting of a multivariable prediction model for Individual Prognosis or Diagnosis (TRIPOD): explanation and elaboration. Ann Intern Med. 2015;162(1):W1–73.
Article PubMed Google Scholar
Snell KI, Ensor J, Debray TP, Moons KG, Riley RD. Meta-analysis of prediction model performance across multiple studies: Which scale helps ensure between-study normality for the C-statistic and calibration measures?. Stat Methods Med Res. 2018;27(11):3505–22.
Wright D, Syngelaki A, Akolekar R, Poon LC, Nicolaides KH. Competing risks model in screening for preeclampsia by maternal characteristics and medical history. Am J Obstet Gynecol. 2015;213(1):62 e1–e10.
Article Google Scholar
North RA, McCowan LM, Dekker GA, Poston L, Chan EH, Stewart AW, et al. Clinical risk prediction for pre-eclampsia in nulliparous women: development of model in international prospective cohort. BMJ. 2011;342:d1875.
Article PubMed PubMed Central Google Scholar
Allen RE, Zamora J, Arroyo-Manzano D, Velauthar L, Allotey J, Thangaratinam S, et al. External validation of preexisting first trimester preeclampsia prediction models. Eur J Obstet Gynecol Reprod Biol. 2017;217:119–25.
Article PubMed Google Scholar
Fraser A, Macdonald-Wallis C, Tilling K, Boyd A, Golding J, Davey Smith G, et al. Cohort profile: the Avon Longitudinal Study of Parents and Children: ALSPAC mothers cohort. Int J Epidemiol. 2013;42(1):97–110.
Article PubMed Google Scholar
Chappell LC, Seed PT, Briley AL, Kelly FJ, Lee R, Hunt BJ, et al. Effect of antioxidants on the occurrence of pre-eclampsia in women at increased risk: a randomised trial. Lancet. 1999;354(9181):810–6.
Article CAS PubMed Google Scholar
Chiswick C, Reynolds RM, Denison F, Drake AJ, Forbes S, Newby DE, et al. Effect of metformin on maternal and fetal outcomes in obese pregnant women (EMPOWaR): a randomised, double-blind, placebo-controlled trial. Lancet Diabetes Endocrinol. 2015;3(10):778–86.
Article CAS PubMed PubMed Central Google Scholar
Poston L, Briley AL, Seed PT, Kelly FJ, Shennan AH. Vitamin C and vitamin E in pregnant women at risk for pre-eclampsia (VIP trial): randomised placebo-controlled trial. Lancet. 2006;367(9517):1145–54.
Article CAS PubMed Google Scholar
Poston L, Bell R, Croker H, Flynn AC, Godfrey KM, Goff L, et al. Effect of a behavioural intervention in obese pregnant women (the UPBEAT study): a multicentre, randomised controlled trial. Lancet Diabetes Endocrinol. 2015;3(10):767–77.
Article PubMed Google Scholar
Stirrup OT, Khalil A, D'Antonio F, Thilaganathan B, Southwest Thames Obstetric Research C. Fetal growth reference ranges in twin pregnancy: analysis of the Southwest Thames Obstetric Research Collaborative (STORK) multiple pregnancy cohort. Ultrasound Obstet Gynecol. 2015;45(3):301–7.
Article CAS PubMed Google Scholar
Ayorinde AA, Wilde K, Lemon J, Campbell D, Bhattacharya S. Data resource profile: the Aberdeen Maternity and Neonatal Databank (AMND). Int J Epidemiol. 2016;45(2):389–94.
Article PubMed Google Scholar
Sovio U, White IR, Dacey A, Pasupathy D, Smith GCS. Screening for fetal growth restriction with universal third trimester ultrasonography in nulliparous women in the Pregnancy Outcome Prediction (POP) study: a prospective cohort study. Lancet. 2015;386(10008):2089–97.
Article PubMed PubMed Central Google Scholar
Baschat AA, Magder LS, Doyle LE, Atlas RO, Jenkins CB, Blitzer MG. Prediction of preeclampsia utilizing the first trimester screening examination. Am J Obstet Gynecol. 2014;211(5):514 e1–7.
Article Google Scholar
Crovetto F, Figueras F, Triunfo S, Crispi F, Rodriguez-Sureda V, Dominguez C, et al. First trimester screening for early and late preeclampsia based on maternal characteristics, biophysical parameters, and angiogenic factors. Prenat Diagn. 2015;35(2):183–91.
Article PubMed Google Scholar
Goetzinger KR, Singla A, Gerkowicz S, Dicke JM, Gray DL, Odibo AO. Predicting the risk of pre-eclampsia between 11 and 13 weeks’ gestation by combining maternal characteristics and serum analytes, PAPP-A and free beta-hCG. Prenat Diagn. 2010;30(12–13):1138–42.
Article CAS PubMed PubMed Central Google Scholar
Kuc S, Koster MP, Franx A, Schielen PC, Visser GH. Maternal characteristics, mean arterial pressure and serum markers in early prediction of preeclampsia. PLoS One. 2013;8(5):e63546.
Article CAS PubMed PubMed Central Google Scholar
Odibo AO, Zhong Y, Goetzinger KR, Odibo L, Bick JL, Bower CR, et al. First-trimester placental protein 13, PAPP-A, uterine artery Doppler and maternal characteristics in the prediction of pre-eclampsia. Placenta. 2011;32(8):598–602.
Article CAS PubMed PubMed Central Google Scholar
Plasencia W, Maiz N, Bonino S, Kaihura C, Nicolaides KH. Uterine artery Doppler at 11 + 0 to 13 + 6 weeks in the prediction of pre-eclampsia. Ultrasound Obstet Gynecol. 2007;30(5):742–9.
Article CAS PubMed Google Scholar
Poon LC, Kametas NA, Chelemen T, Leal A, Nicolaides KH. Maternal risk factors for hypertensive disorders in pregnancy: a multivariate approach. J Hum Hypertens. 2010;24(2):104–10.
Article CAS PubMed Google Scholar
Poon LC, Kametas NA, Pandeva I, Valencia C, Nicolaides KH. Mean arterial pressure at 11(+0) to 13(+6) weeks in the prediction of preeclampsia. Hypertension. 2008;51(4):1027–33.
Article CAS PubMed Google Scholar
Poon LC, Maiz N, Valencia C, Plasencia W, Nicolaides KH. First-trimester maternal serum pregnancy-associated plasma protein-A and pre-eclampsia. Ultrasound Obstet Gynecol. 2009;33(1):23–33.
Article CAS PubMed Google Scholar
Scazzocchio E, Figueras F, Crispi F, Meler E, Masoller N, Mula R, et al. Performance of a first-trimester screening of preeclampsia in a routine care low-risk setting. Am J Obstet Gynecol. 2013;208(3):203 e1–e10.
Article Google Scholar
Yu CK, Smith GC, Papageorghiou AT, Cacho AM, Nicolaides KH, Fetal Medicine Foundation Second Trimester Screening G. An integrated model for the prediction of preeclampsia using maternal factors and uterine artery Doppler velocimetry in unselected low-risk women. Am J Obstet Gynecol. 2005;193(2):429–36.
Article PubMed Google Scholar
Boyd A, Golding J, Macleod J, Lawlor DA, Fraser A, Henderson J, et al. Cohort profile: the ‘children of the 90s’--the index offspring of the Avon Longitudinal Study of Parents and Children. Int J Epidemiol. 2013;42(1):111–27.
Article PubMed Google Scholar
Levine RJ, Lindheimer MD. First-trimester prediction of early preeclampsia: a possibility at last! Hypertension. 2009;53(5):747–8.
Article CAS PubMed Google Scholar
Oliveira N, Magder LS, Blitzer MG, Baschat AA. First-trimester prediction of pre-eclampsia: external validity of algorithms in a prospectively enrolled cohort. Ultrasound Obstet Gynecol. 2014;44(3):279–85.
Article CAS PubMed Google Scholar
Park FJ, Leung CH, Poon LC, Williams PF, Rothwell SJ, Hyett JA. Clinical evaluation of a first trimester algorithm predicting the risk of hypertensive disease of pregnancy. Aust N Z J Obstet Gynaecol. 2013;53(6):532–9.
Article PubMed Google Scholar
Skrastad RB, Hov GG, Blaas HG, Romundstad PR, Salvesen KA. Risk assessment for preeclampsia in nulliparous women at 11-13 weeks gestational age: prospective evaluation of two algorithms. BJOG. 2015;122(13):1781–8.
Article CAS PubMed Google Scholar
Akolekar R, Syngelaki A, Poon L, Wright D, Nicolaides KH. Competing risks model in early screening for preeclampsia by biophysical and biochemical markers. Fetal Diagn Ther. 2013;33(1):8–15.
Article PubMed Google Scholar
Gallo DM, Wright D, Casanova C, Campanero M, Nicolaides KH. Competing risks model in screening for preeclampsia by maternal factors and biomarkers at 19–24 weeks’ gestation. Am J Obstet Gynecol. 2016;214(5):619 e1–e17.
Article CAS Google Scholar
O'Gorman N, Wright D, Poon LC, Rolnik DL, Syngelaki A, de Alvarado M, et al. Multicenter screening for pre-eclampsia by maternal factors and biomarkers at 11-13 weeks’ gestation: comparison with NICE guidelines and ACOG recommendations. Ultrasound Obstet Gynecol. 2017;49(6):756–60.
Article CAS PubMed Google Scholar
O'Gorman N, Wright D, Poon LC, Rolnik DL, Syngelaki A, Wright A, et al. Accuracy of competing-risks model in screening for pre-eclampsia by maternal factors and biomarkers at 11-13 weeks’ gestation. Ultrasound Obstet Gynecol. 2017;49(6):751–5.
Article CAS PubMed Google Scholar
O'Gorman N, Wright D, Syngelaki A, Akolekar R, Wright A, Poon LC, et al. Competing risks model in screening for preeclampsia by maternal factors and biomarkers at 11–13 weeks gestation. Am J Obstet Gynecol. 2016;214(1):103 e1–e12.
Article Google Scholar
Poon LC, Kametas NA, Maiz N, Akolekar R, Nicolaides KH. First-trimester prediction of hypertensive disorders in pregnancy. Hypertension. 2009;53(5):812–8.
Article CAS PubMed Google Scholar
Poon LC, Syngelaki A, Akolekar R, Lai J, Nicolaides KH. Combined screening for preeclampsia and small for gestational age at 11-13 weeks. Fetal Diagn Ther. 2013;33(1):16–27.
Article PubMed Google Scholar
Rolnik DL, Wright D, Poon LCY, Syngelaki A, O'Gorman N, de Paco MC, et al. ASPRE trial: performance of screening for preterm pre-eclampsia. Ultrasound Obstet Gynecol. 2017;50(4):492–5.
Article CAS PubMed Google Scholar
Wright D, Akolekar R, Syngelaki A, Poon LC, Nicolaides KH. A competing risks model in early screening for preeclampsia. Fetal Diagn Ther. 2012;32(3):171–8.
Article PubMed Google Scholar
Akolekar R, Syngelaki A, Sarquis R, Zvanca M, Nicolaides KH. Prediction of early, intermediate and late pre-eclampsia from maternal factors, biophysical and biochemical markers at 11-13 weeks. Prenat Diagn. 2011;31(1):66–74.
Article PubMed Google Scholar
Akolekar R, Etchegaray A, Zhou Y, Maiz N, Nicolaides KH. Maternal serum activin a at 11-13 weeks of gestation in hypertensive disorders of pregnancy. Fetal Diagn Ther. 2009;25(3):320–7.
Article PubMed Google Scholar
Akolekar R, Minekawa R, Veduta A, Romero XC, Nicolaides KH. Maternal plasma inhibin A at 11-13 weeks of gestation in hypertensive disorders of pregnancy. Prenat Diagn. 2009;29(8):753–60.
Article CAS PubMed Google Scholar
Akolekar R, Veduta A, Minekawa R, Chelemen T, Nicolaides KH. Maternal plasma P-selectin at 11 to 13 weeks of gestation in hypertensive disorders of pregnancy. Hypertens Pregnancy. 2011;30(3):311–21.
Article CAS PubMed Google Scholar
Akolekar R, Zaragoza E, Poon LC, Pepes S, Nicolaides KH. Maternal serum placental growth factor at 11 + 0 to 13 + 6 weeks of gestation in the prediction of pre-eclampsia. Ultrasound Obstet Gynecol. 2008;32(6):732–9.
Article CAS PubMed Google Scholar
Akolekar R, Syngelaki A, Beta J, Kocylowski R, Nicolaides KH. Maternal serum placental protein 13 at 11-13 weeks of gestation in preeclampsia. Prenat Diagn. 2009;29(12):1103–8.
Article PubMed Google Scholar
Garcia-Tizon Larroca S, Tayyar A, Poon LC, Wright D, Nicolaides KH. Competing risks model in screening for preeclampsia by biophysical and biochemical markers at 30-33 weeks’ gestation. Fetal Diagn Ther. 2014;36(1):9–17.
Article PubMed Google Scholar
Tayyar A, Garcia-Tizon Larroca S, Poon LC, Wright D, Nicolaides KH. Competing risk model in screening for preeclampsia by mean arterial pressure and uterine artery pulsatility index at 30-33 weeks' gestation. Fetal Diagn Ther. 2014;36(1):18–27.
Article PubMed Google Scholar
Lai J, Garcia-Tizon Larroca S, Peeva G, Poon LC, Wright D, Nicolaides KH. Competing risks model in screening for preeclampsia by serum placental growth factor and soluble fms-like tyrosine kinase-1 at 30-33 weeks’ gestation. Fetal Diagn Ther. 2014;35(4):240–8.
Article PubMed Google Scholar
Riley RD, van der Windt D, Croft P, Moons KG, editors. Prognosis research in healthcare: concepts, methods and impact. Oxford: Oxford University Press; 2019.
Google Scholar
Tan MY, Syngelaki A, Poon LC, Rolnik DL, O'Gorman N, Delgado JL, et al. Screening for pre-eclampsia by maternal factors and biomarkers at 11-13 weeks’ gestation. Ultrasound Obstet Gynecol. 2018;52(2):186–95.
Article CAS PubMed Google Scholar
Rolnik DL, Wright D, Poon LC, O'Gorman N, Syngelaki A, de Paco MC, et al. Aspirin versus placebo in pregnancies at high risk for preterm preeclampsia. N Engl J Med. 2017;377(7):613–22.
Article CAS PubMed Google Scholar
Wright D, Tan MY, O'Gorman N, Poon LC, Syngelaki A, Wright A, et al. Predictive performance of the competing risk model in screening for preeclampsia. Am J Obst Gynecol. 2019;220(2):199 e1–e13.
Article Google Scholar
Collins GS, de Groot JA, Dutton S, Omar O, Shanyinde M, Tajar A, et al. External validation of multivariable prediction models: a systematic review of methodological conduct and reporting. BMC Med Res Methodol. 2014;14:40.
Article PubMed PubMed Central Google Scholar
Le Cessie S, Van Houwelingen JC. Ridge estimators in logistic regression. J R Stat Soc Ser C Appl Stat. 1992;41(1):191–201.
Google Scholar
Tibshirani R. Regression shrinkage and selection via the lasso. J R Stat Soc Ser B Methodol. 1996;58(1):267–88.
Google Scholar
Zou H, Hastie T. Regularization and variable selection via the elastic net. J R Stat Soc Ser Stat Methodol. 2005;67(2):301–20.
Article Google Scholar
Pavlou M, Ambler G, Seaman S, De Iorio M, Omar RZ. Review and evaluation of penalised regression methods for risk prediction in low-dimensional data with few events. Stat Med. 2016;35(7):1159–77.
Article PubMed Google Scholar
Steyerberg EW, Harrell FE, Borsboom GJJM, Eijkemans MJC, Vergouwe Y, Habbema JDF. Internal validation of predictive models: efficiency of some procedures for logistic regression analysis. J Clin Epidemiol. 2001;54(8):774–81.
Article CAS PubMed Google Scholar
Audigier V, White IR, Jolani S, Debray TPA, Quartagno M, Carpenter J, et al. Multiple imputation for multilevel data with continuous and binary variables. Stat Sci. 2018;33(2):160–83.
Article Google Scholar
Held U, Kessels A, Garcia Aymerich J, Basagana X, Ter Riet G, Moons KG, et al. Methods for handling missing variables in risk prediction models. Am J Epidemiol. 2016;184(7):545–51.
Article PubMed Google Scholar
Jolani S, Debray TP, Koffijberg H, van Buuren S, Moons KG. Imputation of systematically missing predictors in an individual participant data meta-analysis: a generalized approach using MICE. Stat Med. 2015;34(11):1841–63.
Article PubMed Google Scholar
Resche-Rigon M, White IR. Multiple imputation by chained equations for systematically and sporadically missing multilevel data. Stat Methods Med Res. 2018;27(6):1634–49.
Article PubMed Google Scholar
Quartagno M, Carpenter JR. Multiple imputation for IPD meta-analysis: allowing for heterogeneity and studies with missing covariates. Stat Med. 2016;35(17):2938–54.
Article CAS PubMed Google Scholar

Download references

Acknowledgements

The following are members of the IPPIC Collaborative Network ⁺

Alex Kwong—University of Bristol; Ary I. Savitri—University Medical Center Utrecht; Kjell Åsmund Salvesen—Norwegian University of Science and Technology; Sohinee Bhattacharya—University of Aberdeen; Cuno S.P.M. Uiterwaal—University Medical Center Utrecht; Annetine C. Staff—University of Oslo; Louise Bjoerkholt Andersen—University of Southern Denmark; Elisa Llurba Olive—Hospital Universitari Vall d’Hebron; Christopher Redman—University of Oxford; George Daskalakis—University of Athens; Maureen Macleod—University of Dundee; Baskaran Thilaganathan—St George’s University of London; Javier Arenas Ramírez—University Hospital de Cabueñes; Jacques Massé—Laval University; Asma Khalil—St George’s University of London; Francois Audibert—Université de Montréal; Per Minor Magnus—Norwegian Institute of Public Health; Anne Karen Jenum—University of Oslo; Ahmet Baschat—Johns Hopkins University School of Medicine; Akihide Ohkuchi—University School of Medicine, Shimotsuke-shi; Fionnuala M. McAuliffe—University College Dublin; Jane West—University of Bristol; Lisa M. Askie—University of Sydney; Fionnuala Mone—University College Dublin; Diane Farrar—Bradford Teaching Hospitals; Peter A. Zimmerman—Päijät-Häme Central Hospital; Luc J.M. Smits—Maastricht University Medical Centre; Catherine Riddell—Better Outcomes Registry & Network (BORN); John C. Kingdom—University of Toronto; Joris van de Post—Academisch Medisch Centrum; Sebastián E. Illanes—University of the Andes; Claudia Holzman—Michigan State University; Sander M.J. van Kuijk—Maastricht University Medical Centre; Lionel Carbillon—Assistance Publique-Hôpitaux de Paris Université; Pia M. Villa—University of Helsinki and Helsinki University Hospital; Anne Eskild—University of Oslo; Lucy Chappell—King’s College London; Federico Prefumo—University of Brescia; Luxmi Velauthar—Queen Mary University of London; Paul Seed—King’s College London; Miriam van Oostwaard—IJsselland Hospital; Stefan Verlohren—Charité University Medicine; Lucilla Poston—King’s College London; Enrico Ferrazzi—University of Milan; Christina A. Vinter—University of Southern Denmark; Chie Nagata—National Center for Child Health and Development; Mark Brown—University of New South Wales; Karlijn C. Vollebregt—Academisch Medisch Centrum; Satoru Takeda—Juntendo University; Josje Langenveld—Atrium Medisch Centrum Parkstad; Mariana Widmer—World Health Organization; Shigeru Saito—Osaka University Medical School; Camilla Haavaldsen—Akershus University Hospital; Guillermo Carroli—Centro Rosarino De Estudios Perinatales; Jørn Olsen—Aarhus University; Hans Wolf—Academisch Medisch Centrum; Nelly Zavaleta—Instituto Nacional De Salud; Inge Eisensee—Aarhus University; Patrizia Vergani—University of Milano-Bicocca; Pisake Lumbiganon—Khon Kaen University; Maria Makrides—South Australian Health and Medical Research Institute; Fabio Facchinetti—Università degli Studi di Modena e Reggio Emilia; Evan Sequeira—ga Khan University; Robert Gibson—University of Adelaide; Sergio Ferrazzani—Università Cattolica del Sacro Cuore; Tiziana Frusca—Università degli Studi di Parma; Jane E. Norman—University of Edinburgh; Ernesto A. Figueiró-Filho—Mount Sinai Hospital; Olav Lapaire—Universitätsspital Basel; Hannele Laivuori—University of Helsinki and Helsinki University Hospital; Jacob A. Lykke—Rigshospitalet; Agustin Conde-Agudelo—Eunice Kennedy Shriver National Institute of Child Health and Human Development; Alberto Galindo—Universidad Complutense de Madrid; Alfred Mbah—University of South Florida; Ana Pilar Betran—World Health Organization; Ignacio Herraiz—Universidad Complutense de Madrid; Lill Trogstad—Norwegian Institute of Public Health; Gordon G.S. Smith—Cambridge University; Eric A.P. Steegers—University Hospital Nijmegen; Read Salim—HaEmek Medical Center; Tianhua Huang—North York General Hospital; Annemarijne Adank—Erasmus Medical Centre; Jun Zhang—National Institute of Child Health and Human Development; Wendy S. Meschino—North York General Hospital; Joyce L Browne—University Medical Centre Utrecht; Rebecca E. Allen—Queen Mary University of London; Fabricio Da Silva Costa—University of São Paulo; Kerstin Klipstein-Grobusch —University Medical Centre Utrecht; Caroline A. Crowther—University of Adelaide; Jan Stener Jørgensen—Syddansk Universitet; Jean-Claude Forest—Centre hospitalier universitaire de Québec; Alice R. Rumbold—University of Adelaide; Ben W. Mol—Monash University; Yves Giguère—Laval University; Louise C. Kenny—University of Liverpool; Wessel Ganzevoort—Academisch Medisch Centrum; Anthony O. Odibo—University of South Florida; Jenny Myers—University of Manchester; SeonAe Yeo—University of North Carolina at Chapel Hill; Francois Goffinet—Assistance publique – Hôpitaux de Paris; Lesley McCowan—University of Auckland; Eva Pajkrt—Academisch Medisch Centrum; Bassam G. Haddad—Portland State University; Gustaaf Dekker—University of Adelaide; Emily C. Kleinrouweler—Academisch Medisch Centrum; Édouard LeCarpentier—Centre Hospitalier Intercommunal Creteil; Claire T. Roberts—University of Adelaide; Henk Groen—University Medical Center Groningen; Ragnhild Bergene Skråstad—St Olavs Hospital; Seppo Heinonen—University of Helsinki and Helsinki University Hospital; Kajantie Eero—University of Helsinki and Helsinki University Hospital.

We would like to acknowledge all researchers who contributed data to this IPD meta-analysis, including the original teams involved in the collection of the data, and participants who took part in the research studies. We are extremely grateful to all the families who took part in this study, the midwives for their help in recruiting them, and the whole ALSPAC team, which includes interviewers, computer and laboratory technicians, clerical workers, research scientists, volunteers, managers, receptionists, and nurses.

We are thankful to members of the Independent Steering Committee, which included Prof Arri Coomarasamy (Chairperson, University of Birmingham), Dr. Aris Papageorghiou (St George’s University Hospital), Mrs. Ngawai Moss (Katies Team), Prof. Sarosh Rana (University of Chicago), and Dr. Thomas Debray (University Medical Center Utrecht), for their guidance and support throughout the project.

Funding

This project was funded by the National Institute for Health Research Health Technology Assessment Programme (ref no: 14/158/02). Kym Snell is funded by the National Institute for Health Research School for Primary Care Research (NIHR SPCR). The UK Medical Research Council and Wellcome (grant ref.: 102215/2/13/2) and the University of Bristol provide core support for ALSPAC. This publication is the work of the authors, and ST, RR, KS, and JA will serve as guarantors for the contents of this paper. The views expressed are those of the authors and not necessarily those of the NHS, the NIHR, or the Department of Health and Social Care.

Author information

Kym IE Snell and John Allotey are joint first authors (both contributed equally).

Authors and Affiliations

Centre for Prognosis Research, School of Primary, Community and Social Care, Keele University, Keele, UK
Kym I. E. Snell & Richard D. Riley
Barts Research Centre for Women’s Health (BARC), Barts and the London School of Medicine and Dentistry, Queen Mary University of London, London, UK
John Allotey & Khalid S. Khan
Pragmatic Clinical Trials Unit, Barts and the London School of Medicine and Dentistry, Queen Mary University of London, London, UK
John Allotey, Melanie Smuk, Richard Hooper, Claire Chan & Khalid S. Khan
MirZyme Therapeutics, Innovation Birmingham Campus, Birmingham, UK
Asif Ahmed
Department of Women and Children’s Health, School of Life Course Sciences, King’s College London, London, UK
Lucy C. Chappell, Peter Von Dadelszen, Lucilla Poston & Paul T. Seed
Action on Pre-eclampsia (APEC) Charity, Worcestershire, UK
Marcus Green
Faculty Health & Life Sciences, University of Liverpool, Liverpool, UK
Louise Kenny
Fetal Medicine Unit, St George’s University Hospitals NHS Foundation Trust and Molecular and Clinical Sciences Research Institute, St George’s University of London, London, UK
Asma Khalil & Basky Thilaganathan
Department of Obstetrics and Gynaecology, Monash University, Monash Medical Centre, Clayton, Victoria, Australia
Ben W. Mol
Maternal and Fetal Health Research Centre, Manchester Academic Health Science Centre, University of Manchester, Central Manchester NHS Trust, Manchester, UK
Jenny Myers
Division of Obstetrics and Gynaecology, Oslo University Hospital, and Faculty of Medicine, University of Oslo, Oslo, Norway
Anne C. Staff
Department of Obstetrics and Gynaecology, NIHR Biomedical Research Centre, Cambridge University, Cambridge, UK
Gordon C. S. Smith
Department of Obstetrics, Amsterdam UMC University of Amsterdam, Amsterdam, The Netherlands
Wessel Ganzevoort
Department of Medical and Clinical Genetics, University of Helsinki and Helsinki University Hospital, Helsinki, Finland
Hannele Laivuori
Institute for Molecular Medicine Finland, Helsinki Institute of Life Science, University of Helsinki, Helsinki, Finland
Hannele Laivuori
Department of Obstetrics and Gynecology, Faculty of Medicine and Health Technology, Tampere University Hospital and Tampere University, Tampere, Finland
Hannele Laivuori
University of South Florida, Tampa, FL, USA
Anthony O. Odibo
Department of Obstetrics and Gynaecology, University Hospital de Cabueñes, Gijón, Spain
Javier Arenas Ramírez
Maternal-Fetal Medicine Division, Department OBGYN, Mount Sinai Hospital, University of Toronto, Toronto, Canada
John Kingdom
Department of Obstetrics and Gynecology, National and Kapodistrian University of Athens, Alexandra Hospital, Athens, Greece
George Daskalakis
Bradford Institute for Health Research, Bradford Teaching Hospitals, Bradford, UK
Diane Farrar
Johns Hopkins Center for Fetal Therapy, Department of Gynecology & Obstetrics, Johns Hopkins University School of Medicine, Baltimore, MD, USA
Ahmet A. Baschat
Department of Obstetrics and Gynaecology, University of Brescia, Brescia, Italy
Federico Prefumo
Department of Gynecology and Obstetrics, Ribeirão Preto Medical School, University of São Paulo, Ribeirão Preto, São Paulo, Brazil
Fabricio da Silva Costa
Department of Epidemiology, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands
Henk Groen
Department of Obstetrics and Gynecology, CHU Ste Justine, Université de Montréal, Montreal, Canada
Francois Audibert
Department of Molecular Biology, Medical Biochemistry and Pathology, Laval University, Quebec City, Canada
Jacques Masse
Department of Clinical and Molecular Medicine, Faculty of Medicine and Health Sciences, Norwegian University of Science and Technology – NTNU, Trondheim, Norway
Ragnhild B. Skråstad
Department of Clinical Pharmacology, St. Olav University Hospital, Trondheim, Norway
Ragnhild B. Skråstad
Department of Obstetrics and Gynecology, Trondheim University Hospital, Trondheim, Norway
Kjell Å. Salvesen
Department of Laboratory Medicine, Children’s and Women’s Health, Norwegian University of Science and Technology, Trondheim, Norway
Kjell Å. Salvesen
Department of Obstetrics and Gynaecology, Akershus University Hospital, Lørenskog, Norway
Camilla Haavaldsen & Anne Eskild
Department of Education for Clinical Research, National Center for Child Health and Development, Tokyo, Japan
Chie Nagata
South Australian Health and Medical Research Institute and Robinson Research Institute, The University of Adelaide, Adelaide, Australia
Alice R. Rumbold
Department of Obstetrics and Gynaecology, University of Helsinki and Helsinki University Hospital, Helsinki, Finland
Seppo Heinonen & Pia M. Villa
NHMRC Clinical Trials Centre, University of Sydney, Sydney, Australia
Lisa M. Askie
Care and Public Health Research Institute, Maastricht University Medical Centre, Maastricht, The Netherlands
Luc J. M. Smits
Department of Gynecology and Obstetrics, Odense University Hospital, University of Southern Denmark, Odense, Denmark
Christina A. Vinter
Centre for Fertility and Health, Norwegian Institute of Public Health, Oslo, Norway
Per Magnus
National Institute for Health and Welfare, Helsinki, Finland
Kajantie Eero
Children’s Hospital, University of Helsinki and Helsinki University Hospital, Helsinki, Finland
Kajantie Eero
General Practice Research Unit (AFE), Department of General Practice, Institute of Health and Society, Faculty of Medicine, University of Oslo, Oslo, Norway
Anne K. Jenum
Institute for Clinical Research, University of Southern Denmark, Odense, Denmark
Louise B. Andersen
Department of Obstetrics and Gynecology, Odense University Hospital, Odense, Denmark
Louise B. Andersen
MRC Centre for Reproductive Health, University of Edinburgh, Edinburgh, UK
Jane E. Norman
Department of Obstetrics and Gynecology, Jichi Medical University School of Medicine, Shimotsuke-shi, Tochigi, Japan
Akihide Ohkuchi
Institute of Clinical Medicine, University of Oslo, Oslo, Norway
Anne Eskild
Obstetrics & Gynaecology, Institute of Applied Health Sciences, School of Medicine, Medical Sciences and Nutrition, University of Aberdeen, Aberdeen, UK
Sohinee Bhattacharya
UCD Perinatal Research Centre, School of Medicine, University College Dublin, National Maternity Hospital, Dublin, Ireland
Fionnuala M. McAuliffe
Fetal Medicine Unit, Maternal and Child Health and Development Network (SAMID), Department of Obstetrics and Gynaecology, Hospital Universitario, Instituto de Investigación Hospital, Universidad Complutense de Madrid, Madrid, Spain
Alberto Galindo & Ignacio Herraiz
Department of Obstetrics and Gynecology, Assistance Publique-Hôpitaux de Paris Université Paris, Paris, France
Lionel Carbillon
Julius Centre for Health Sciences and Primary Care, University Medical Centre Utrecht, Utrecht University, Utrecht, The Netherlands
Kerstin Klipstein-Grobusch, Joyce L. Browne & Karel G. M. Moons
University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
Seon Ae Yeo
Cochrane Netherlands, Utrecht, The Netherlands
Karel G. M. Moons
Institute of Metabolism and Systems Research, WHO Collaborating Centre for Women’s Health, University of Birmingham, Birmingham, UK
Shakila Thangaratinam

Authors

Kym I. E. Snell
View author publications
You can also search for this author in PubMed Google Scholar
John Allotey
View author publications
You can also search for this author in PubMed Google Scholar
Melanie Smuk
View author publications
You can also search for this author in PubMed Google Scholar
Richard Hooper
View author publications
You can also search for this author in PubMed Google Scholar
Claire Chan
View author publications
You can also search for this author in PubMed Google Scholar
Asif Ahmed
View author publications
You can also search for this author in PubMed Google Scholar
Lucy C. Chappell
View author publications
You can also search for this author in PubMed Google Scholar
Peter Von Dadelszen
View author publications
You can also search for this author in PubMed Google Scholar
Marcus Green
View author publications
You can also search for this author in PubMed Google Scholar
Louise Kenny
View author publications
You can also search for this author in PubMed Google Scholar
Asma Khalil
View author publications
You can also search for this author in PubMed Google Scholar
Khalid S. Khan
View author publications
You can also search for this author in PubMed Google Scholar
Ben W. Mol
View author publications
You can also search for this author in PubMed Google Scholar
Jenny Myers
View author publications
You can also search for this author in PubMed Google Scholar
Lucilla Poston
View author publications
You can also search for this author in PubMed Google Scholar
Basky Thilaganathan
View author publications
You can also search for this author in PubMed Google Scholar
Anne C. Staff
View author publications
You can also search for this author in PubMed Google Scholar
Gordon C. S. Smith
View author publications
You can also search for this author in PubMed Google Scholar
Wessel Ganzevoort
View author publications
You can also search for this author in PubMed Google Scholar
Hannele Laivuori
View author publications
You can also search for this author in PubMed Google Scholar
Anthony O. Odibo
View author publications
You can also search for this author in PubMed Google Scholar
Javier Arenas Ramírez
View author publications
You can also search for this author in PubMed Google Scholar
John Kingdom
View author publications
You can also search for this author in PubMed Google Scholar
George Daskalakis
View author publications
You can also search for this author in PubMed Google Scholar
Diane Farrar
View author publications
You can also search for this author in PubMed Google Scholar
Ahmet A. Baschat
View author publications
You can also search for this author in PubMed Google Scholar
Paul T. Seed
View author publications
You can also search for this author in PubMed Google Scholar
Federico Prefumo
View author publications
You can also search for this author in PubMed Google Scholar
Fabricio da Silva Costa
View author publications
You can also search for this author in PubMed Google Scholar
Henk Groen
View author publications
You can also search for this author in PubMed Google Scholar
Francois Audibert
View author publications
You can also search for this author in PubMed Google Scholar
Jacques Masse
View author publications
You can also search for this author in PubMed Google Scholar
Ragnhild B. Skråstad
View author publications
You can also search for this author in PubMed Google Scholar
Kjell Å. Salvesen
View author publications
You can also search for this author in PubMed Google Scholar
Camilla Haavaldsen
View author publications
You can also search for this author in PubMed Google Scholar
Chie Nagata
View author publications
You can also search for this author in PubMed Google Scholar
Alice R. Rumbold
View author publications
You can also search for this author in PubMed Google Scholar
Seppo Heinonen
View author publications
You can also search for this author in PubMed Google Scholar
Lisa M. Askie
View author publications
You can also search for this author in PubMed Google Scholar
Luc J. M. Smits
View author publications
You can also search for this author in PubMed Google Scholar
Christina A. Vinter
View author publications
You can also search for this author in PubMed Google Scholar
Per Magnus
View author publications
You can also search for this author in PubMed Google Scholar
Kajantie Eero
View author publications
You can also search for this author in PubMed Google Scholar
Pia M. Villa
View author publications
You can also search for this author in PubMed Google Scholar
Anne K. Jenum
View author publications
You can also search for this author in PubMed Google Scholar
Louise B. Andersen
View author publications
You can also search for this author in PubMed Google Scholar
Jane E. Norman
View author publications
You can also search for this author in PubMed Google Scholar
Akihide Ohkuchi
View author publications
You can also search for this author in PubMed Google Scholar
Anne Eskild
View author publications
You can also search for this author in PubMed Google Scholar
Sohinee Bhattacharya
View author publications
You can also search for this author in PubMed Google Scholar
Fionnuala M. McAuliffe
View author publications
You can also search for this author in PubMed Google Scholar
Alberto Galindo
View author publications
You can also search for this author in PubMed Google Scholar
Ignacio Herraiz
View author publications
You can also search for this author in PubMed Google Scholar
Lionel Carbillon
View author publications
You can also search for this author in PubMed Google Scholar
Kerstin Klipstein-Grobusch
View author publications
You can also search for this author in PubMed Google Scholar
Seon Ae Yeo
View author publications
You can also search for this author in PubMed Google Scholar
Joyce L. Browne
View author publications
You can also search for this author in PubMed Google Scholar
Karel G. M. Moons
View author publications
You can also search for this author in PubMed Google Scholar
Richard D. Riley
View author publications
You can also search for this author in PubMed Google Scholar
Shakila Thangaratinam
View author publications
You can also search for this author in PubMed Google Scholar

Consortia

Contributions

ST, RR, KSK, KGMM, RH, BT, and AK developed the protocol. KS wrote the statistical analysis plan, performed the analysis, produced the first draft of the article, and revised the article. RR oversaw the statistical analyses and analysis plan. MS and CC formatted, harmonised, and cleaned all of the UK datasets, in preparation for analysis. JA and MS mapped the variables in the available datasets, and cleaned and quality checked the data. AK contributed to the systematic review and development of the IPPIC Network. JA, ST, and MS undertook the literature searches and study selection, acquired the individual participant data, contributed to the development of all versions of the manuscript, and led the project. BT, AK, LK, LCC, MG, JM, ACS,GCS, WG, HL, AOO, AAB, PTS, FP, FdS, HG, FA, CN, ARR, SH, LMA, LS, CAV, BWM, LP, JAR, JK, GD, DF, PTS, JM, RBS, and CH contributed data to the project and provided input at all stages of the project. LCC, MG, JM, ACS, BWM, GCS, WG, HL, AOO, AAB, PTS, FP, FdSC, HG, FA, CH, CN, ARR, SH, LMA, LJMS, CAV, PMM, PMV, AKJ, LBA, JEN, AO, AE, SB, FMM, AG, IH, LC, KK, SY, and JB provided input into the protocol development and the drafting of the initial manuscript. All authors helped revise the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Kym I. E. Snell.

Ethics declarations

Ethics approval and consent to participate

Not applicable. The study involved secondary analysis of existing anonymised data.

Consent for publication

Not applicable

Competing interests

The authors disclose support from NIHR HTA for the submitted work. LCC reports being Chair of the HTA CET Committee from January 2019. AK reports being a member of the NIHR HTA board. BWM reports grants from Merck; personal fees from OvsEva, Merck, and Guerbet; and other from NHMRC, Guerbet, and Merch, outside the submitted work. GS reports grants and personal fees from GlaxoSmithKline Research and Development Limited, grants from Sera Prognostics Inc., non-financial support from Illumina Inc., and personal fees and non-financial support from Roche Diagnostics Ltd., outside the submitted work. JK reports personal fees from Roche Canada, outside the submitted work. JM reports grants from National Health Research and Development Program, Health and Welfare Canada, during the conduct of the study. JEN reports grants from Chief Scientist Office Scotland, other from GlaxoSmithKline and Dilafor, outside the submitted work. AG reports personal fees from Roche Diagnostics, outside the submitted work. IH reports personal fees from Roche Diagnostics and Thermo Fisher, outside the submitted work. RR reports personal fees from the BMJ, Roche, and Universities of Leeds, Edinburgh, and Exeter, outside the submitted work.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Additional file 1: Supplementary methods:

Additional details for handling missing data and evaluating predictive performance of models. Table S1: Search strategy for pre-eclampsia prediction models. Table S2: Predictors evaluated in the models externally validated in the IPPIC-UK cohorts. Table S3: Prediction models and equations identified from the literature search. Table S4: Study level characteristics of IPPIC-UK cohorts. Table S5: Patient characteristics of IPPIC-UK cohorts. Table S6: Number and proportion missing for each predictor in each cohort used for external validation. Table S7: Risk of bias assessment of the IPPIC-UK cohorts using the PROBAST tool. Table S8: Summary of linear predictor values and predicted probabilities for each model in each cohort. Table S9: Predictive performance statistics for models in the individual IPPIC-UK cohorts. Table S10: Predictive performance statistics for models in nulliparous women in all cohorts and in the POP cohort. Fig. S1: Decision curves for early pre-eclampsia models in SCOPE, UPBEAT and POP. Fig. S2: Decision curves for late pre-eclampsia models in SCOPE, Allen 2017, UPBEAT and POP.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Snell, K.I.E., Allotey, J., Smuk, M. et al. External validation of prognostic models predicting pre-eclampsia: individual participant data meta-analysis. BMC Med 18, 302 (2020). https://doi.org/10.1186/s12916-020-01766-9

Download citation

Received: 19 March 2020
Accepted: 26 August 2020
Published: 02 November 2020
DOI: https://doi.org/10.1186/s12916-020-01766-9

External validation of prognostic models predicting pre-eclampsia: individual participant data meta-analysis

Abstract

Background

Methods

Results

Conclusions

Trial registration

Similar content being viewed by others

External validation, update and development of prediction models for pre-eclampsia using an Individual Participant Data (IPD) meta-analysis: the International Prediction of Pregnancy Complication Network (IPPIC pre-eclampsia) protocol

Development and validation of prediction models for risk of adverse outcomes in women with early-onset pre-eclampsia: protocol of the prospective cohort PREP study

Prediction of pre-eclampsia in nulliparous women using routinely collected maternal characteristics: a model development and validation study

Background

Methods

International Prediction of Pregnancy Complications (IPPIC) Network

Selection of prediction models for external validation

Inclusion criteria for IPPIC validation cohorts

IPD collection and harmonisation

Quality assessment of the datasets

Statistical analysis

Missing data

Evaluating predictive performance of models

Decision curve analysis

Results

Characteristics and quality of the validation cohorts

Characteristics of the validated models

External validation and meta-analysis of predictive performance

Performance of models predicting any-onset pre-eclampsia

Performance of models predicting early-onset pre-eclampsia

Performance of models predicting late-onset pre-eclampsia

Heterogeneity

Decision curve analysis

Discussion

Summary of findings

Strengths and limitations

Comparison to existing studies

Relevance to clinical practice

Recommendations for further research

Conclusion

Availability of data and materials

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Consortia

for the IPPIC Collaborative Network

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher’s Note

Supplementary information

Additional file 1: Supplementary methods:

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation