Introduction

It is challenging to determine the optimal number of cleavage stage embryos to transfer at one time. Current recommendations from the American Society for Reproductive Medicine suggest an upper limit of embryos to transfer at once based on age, embryo stage, and embryo prognosis. Individual clinics are encouraged to use their own data to aid in this decision in order to minimize multiple gestations [1]. Although there is limited information on how to develop a quantitative model for predicting transfer outcomes, determining the live birth rate per embryo is an essential starting point. Once live birth rates per embryo are known, models incorporating factors that affect all embryos (such as uterine receptivity) can be used to predict rates of singleton, twin, and higher order multiple gestations after transfer of more than one embryo.

Determination of the live birth rate for individual cleavage stage embryos based on morphology poses three unique challenges that limit research on this topic. First, since cleavage stage embryos are often transferred in multiples, it is difficult to determine how individual embryo characteristics impact the live birth rate. Previously published studies have dealt with this by stratifying the analysis by single or double embryo transfer and omitting transfers of three or more embryos. Analysis of double embryo transfers is sometimes limited to embryos of the same grade or the grade of the embryo with the more advanced stage is used in the model for both embryos [2, 3].

Second, analysis is complicated by the large number of data points for each cleavage stage embryo (oocyte age at retrieval, cell number, fragmentation, and cell symmetry). The Society for Assisted Reproductive Technology Clinic Outcome Reporting System (SART CORS) collects detailed information on embryo morphology and allows clinics to assign embryos an overall grade of good, fair, or poor [4]. Since clinics may assign overall embryo grades differently, modeling studies using the overall grade are limited by a lack of standardization across clinics. Assigning the overall embryo grade based on the live birth rate for an embryo with given morphological characteristics is preferred.

Third, accounting for age-related fertility decline is a challenge. Many analyses use fixed age groups of < 35 years, 35–37 years, 38–40 years, 41–42 years, and > 42 years [2, 3]. With fixed age groups, data for patients at the end of an age range is less accurate because it is influenced by data from the opposite end. Since most age-related fertility decline occurs between age 35 and 40, patients 37 years of age experience a sharp drop in predicted outcomes when they turn 38 years old when these age groups are used.

Logistic regression has often been used to model embryo transfers but has several limitations. First of all, only dichotomous outcomes (dependent variables) can be incorporated in the model. The dichotomous outcome most often used is live birth or no live birth. In most models, differentiation between outcomes of singleton, twins, and higher order multiple births cannot be done. Logistic regression also limits use of independent variables in the model. For example, one analysis omitted double embryo transfers that resulted in a single live birth because in these scenarios it cannot be known precisely which embryo resulted in the live birth. As a result, the embryo transfer information could not be incorporated into the model [5]. Another logistic regression model only took into account transfer of one or two embryos. In this model, when two embryos were transferred, the morphology of the embryo with the most advanced stage was used; the separate morphologies of each embryo were not incorporated into the model [3].

Although single blastocyst transfer is a good option for many patients, transfer of one or more cleavage stage embryos may be desirable for poor prognosis patients such as those with embryos that failed to make it to the blastocyst stage in prior cycles. Current literature shows an increased risk of multiple gestations when more than one cleavage stage embryo is transferred [6]. With double cleavage stage embryo transfer, the rate of multiple birth can be as high as 30–50% while the rate of multiple birth with single cleavage stage embryo transfer is less than 2% [7,8,9,10,11]. Accurate prediction of the live birth rate for a specific cleavage stage embryo may help determine when elective single cleavage stage embryo transfer may be appropriate [12].

In order to safely limit multiple gestations after cleavage stage embryo transfers, the live birth rate per cleavage stage embryo needs to be more precisely determined. The first objective of this study is to apply new techniques to determine the live birth rate per cleavage stage embryo based on morphology and age at oocyte retrieval. The second objective is to use these rates to assign overall embryo grades of good, fair, and poor.

Materials and Methods

Study Population

A model building analysis was performed using data from 357 fresh and frozen autologous cleavage stage embryo transfers in 267 unique patients at a single center. All day 3 embryo transfer cycles from March 15, 2015, through December 30, 2018, were included. Transfers were excluded if one or more compacting embryos or morulas were transferred, if oocyte retrieval occurred before January 2015, or if an embryo was thawed and grown to day 3 (Fig. 1). A total of 996 embryos were transferred for an average of 2.8 embryos transferred at once. The mean maternal age at oocyte retrieval was 38.9 (SD 3.8) years and the mean maternal BMI was 24.1 (4.4). Other maternal demographics and transfer cycle characteristics are shown in Table 1 and the embryo morphology distribution is shown in Table 2. The embryo transfers resulted in a total of 93 ongoing gestations (defined as a fetal heartbeat at 6 to 8 weeks of gestation) and 75 live born infants (61 singleton deliveries and 7 twin deliveries).

Fig. 1
figure 1

Flow diagram with inclusion and exclusion criteria

Table 1 Maternal demographics and transfer cycle characteristics
Table 2 Embryo morphology distribution of 996 embryos

Cleavage Stage Embryo Grading

At our center, day 3 embryos are assessed for number of cells, fragmentation, cell symmetry, and overall quality. Fragmentation is recorded as 0%, 1–4%, 5%, 10%, 15%, 20%, 25%, 30%, or > 30%. Morphology grade is assessed as follows: A for a symmetric blastomeres with proper cleavage rate (4 cells on day 2 and 8 cells on day 3) and no fragmentation; AB for equal size cells with <5% fragmentation or slightly irregular cells with no fragmentation; B for slow or fast cleavage, 5–15% fragmentation, or equal size with < 5% fragmentation and slightly irregular; BC for 16–30% fragmentation, extreme irregularity, or slow/fast cleavage rate with 5–15% fragmentation; C for > 30% fragmentation, extremely irregular cells with 5–15% fragmentation, or slow/fast cleavage rate with 16–30% fragmentation; CD for in between C and D; D for few blastomeric cells of any size and/or severe or complete fragmentation. A notation is made if embryos are partially compacting, compacting, or have reached the morula stage. Additional details on the in vitro fertilization protocols used are included in the Supplemental Methods.

Determination of Live Birth Rates for Each Embryo Morphology Category

To determine the live birth rate per embryo, each of the 357 embryo transfers was modeled as an equation with unknown variables representing the live birth rate (LBR) per embryo for each category of embryo morphology. The coefficients (N) in the equation represent the number of embryos in each category that were transferred. The sum of the coefficients multiplied by their respective unknown LBR variables was set equal to the number of live births that resulted from the embryo transfer (Eq. 1). The equations were solved for the unknown variables with linear algebra to give the least squares solution to the system of equations using MATLAB version 9.5 (MathWorks). The MATLAB code used for the data analysis in this manuscript is being provided through Mendeley Data and can be accessed through the link in the reference [13].

$$ {N}_{\mathrm{category}\ 1}\times {\mathrm{LBR}}_{\mathrm{category}\ 1}+{N}_{\mathrm{category}\ 2}\times {\mathrm{LBR}}_{\mathrm{category}\ 2}+\dots =\mathrm{number}\ \mathrm{of}\ \mathrm{live}\ \mathrm{births} $$
(1)

Moving Centered Age Groups

Live birth rates for each embryo morphology category were determined by age at oocyte retrieval in 1-year increments for patients aged 35 to 42 years old. For each age, we determined live birth rates based off of embryo transfers in patients that were 4 years younger to 4 years older giving us a 9-year age range centered on the age of interest. For example, the 32- to 40-year age range is centered on age 36 years. A 9-year age range was used because this was the smallest age range that smoothed out random variation in the data when analyzing 3 to 5 different embryo categories concurrently. Since the average age in each 9-year age group differed slightly from the original center age of interest, linear interpolation was used to re-center the live birth rates at the intended center age.

Determination of Analysis Groups

The distribution of embryo morphological characteristics (Table 2) was reviewed to determine how best to group embryos for analysis of live birth rates. Morphology grade was determined to be less useful than number of cells and fragmentation since 69% of embryos were scored as a grade B. For this reason, morphology grade as assigned by the embryologist was not used in favor of the more quantitative measures of embryo cell number and fragmentation percentage. Embryos were first grouped by number of cells into categories of 2–5 cells, 6–7 cells, 8 cells, and 9–12 cells. This grouping put over 100 embryos into each category and showed a smooth decline in LBR with advancing oocyte age. Analysis with more categories based on cell number showed evidence of overfitting such as obvious signs of random error resulting in changes in LBR with age that were not biologically plausible.

There were 416 embryos with 8 cells and a decision was made to split this category up into two groups for ≤ 5% fragmentation and > 5% fragmentation. Other cell number categories were not able to be split up further because doing so gave results that did not fit the biological expectation that the LBR per embryo for a given category would smoothly decline with advancing age. An attempt was made to first group embryos by fragmentation percentage and then further by cell number but this produced results that were more difficult to interpret and appeared less biologically plausible.

When analyzing all transfers for all ages together, the best fit LBR for the 2–5 cell group was − 1% which was interpreted to be likely close to 0 (Fig. 2). For this reason, the analysis with moving centered age groups was performed assuming that embryos with 2–5 cells do not significantly contribute to the LBR (Fig. 3). This step was determined to be important to prevent overfitting of the model.

Fig. 2
figure 2

Best fit live birth rates per embryo by morphology group for all cleavage stage transfers

Fig. 3
figure 3

Best fit live birth rate per cleavage stage embryo by morphology group and age at oocyte retrieval. Nine-year moving age groups were utilized in the data analysis for this figure. Linear interpolation was used to center live birth rates on each integer of age

Statistical Analysis

Linear algebra was used to account for the number of embryos transferred and embryo morphology in the model. Stratification by age was achieved by using moving centered age groups to control for the confounding effect of age on the relationship between morphology and live birth rates. The results of the model are displayed graphically in Fig. 3 and were determined to fit the expectation of declining LBRs with increasing age for each embryo morphology group. The best fit results were consistent with those of other published studies.

Results

The best fit live birth rates for all transfers (mean age 38.9 years) analyzed concurrently were 15%, 8%, 4%, 3%, and − 1% for embryos in the 8-cell with ≤ 5% fragmentation, 8-cell with > 5% fragmentation, 9–12 cell, 6–7 cell, and 2–5 cell groups, respectively (Fig. 2). The best fit live birth rates based on 9-year moving centered age groups are shown in Fig. 3. In the 35-year-old age group, the 8-cell embryos with ≤ 5% fragmentation had the highest best fit LBR (29%, good quality) followed by 8-cell embryos with > 5% fragmentation (13%, fair quality), 9–12 cell embryos (10%, poor quality), and 6–7 cell embryos (9%, poor quality). The 2–5 cell group was excluded from this analysis to prevent overfitting.

We have previously validated an embryo transfer model based on the logic that if universal factors (such as uterine factors and others) are not favorable, no embryos will implant, but if universal factors are favorable, embryos are more likely to implant (Fig. 4). The model found favorable universal factors approximately 70% of the time which is consistent with the results of other studies [14, 15]. The model assigns LBR per embryo based on age at oocyte retrieval, cleavage or blastocyst stage transfer, and fresh or frozen embryo transfer. Due to the number of other factors considered, embryo morphology was not considered in the validated universal factors model. This current study is intended as an extension of that study to determine how cleavage stage embryo morphology affects the live birth rate per embryo.

Fig. 4
figure 4

Logic for predicting outcomes of a double embryo transfer incorporating a universal factors fraction and live birth rates for embryo 1 and embryo 2. The universal factors fraction (UNI) is a fraction from 0 (never favorable) to 1 (always favorable). The probability of each outcome is equal to the product of the terms next to the corresponding arrows above and the corresponding terms on the perimeter of the square. For example, the probability of twins is UNI × \( \frac{\mathrm{LBRe}1\ }{\mathrm{UNI}}\times \frac{\mathrm{LBRe}2\ }{\mathrm{UNI}} \). This same logic can be applied to transfer of more than two embryos. A UNI value of 0.70, representing adequate universal factors (such as uterine receptivity) 70% of the time, can be used as a best estimate for both fresh and frozen embryo transfers. UNI, universal factors fraction; LBRe1, live birth rate for embryo 1; LBRe2, live birth rate for embryo 2

In order to estimate the risk of multiples for a cleavage stage embryo transfer, the live birth rate per embryo and universal factors can be considered. For example, assume a 36-year-old has two day 3 embryos: one 8-cell embryo with 5% fragmentation and one 8-cell embryo with 10% fragmentation. From Fig. 3, the corresponding live birth rates per embryo are 23% and 14% respectively. The universal factors model suggests that 30% of the time universal factors are not favorable and neither embryo will result in a live birth. The remaining 70% of the time each embryo behaves independently of the other embryo with live birth rates of 0.23/0.7 and 0.14/0.7, or 33% and 20% respectively (Fig. 4). The end result is that incorporating universal factors does not change the live birth rate of an individual embryo but accounts for the increased risk of multiples when universal factors are favorable.

To simplify calculation of the risk of twins, the average live birth rate of all transferred embryos can be considered. Using the average will make our calculation more conservative by slightly overestimating the risk of twins. For the above example, the average of 23% and 14% is 18.5%. A simple table based on the logic in Fig. 4 can be used to give the expected rates of live birth and multiple birth for any given combination of number of embryos transferred and average live birth rate per embryo (Fig. 5). For transfer of two embryos with an average live birth rate rounded to 18%, the table shows a 31% live birth rate with 15% of the live deliveries predicted to be twin deliveries. The exact predicted result without averaging the live birth rates or rounding is a 32% live birth rate with 14% of those deliveries predicted to be twins.

Fig. 5
figure 5

Predicted transfer outcomes by average live birth rate per embryo and number of embryos for a universal factors fraction of 0.70. The multiples column results are shaded green, yellow, red, or gray to indicate the risk of multiples at delivery (0–9%, 10–19%, 20–29%, or ≥ 30% respectively). LBR, total live birth rate per embryo transfer; % mult., percentage of live deliveries that are multiples; % twins, percentage of live deliveries that are twin deliveries; % trip. or >, percentage of live deliveries that are triplets or greater

Discussion

In order to assess the live birth rates from embryo transfers of single and multiple embryos, a system of equations approach is useful as it allows for solving with linear algebra. We grouped embryos into five groups for our analysis of live birth rate per embryo using all embryo transfers: 2–5 cells, 6–7 cells, 8 cells with ≤ 5% fragmentation, 8 cells with > 5% fragmentation, and 9–12 cells (Fig. 2). The best fit live birth rate for the 2–5 cell group of − 1% can have two interpretations. First, this may indicate that transferring these embryos decreased the live birth rate of other embryos. While this interpretation may be biologically plausible, we are not aware of any studies that support this possibility. The second possible interpretation is that the best fit live birth rate of embryos with 2–5 cells is close to zero. This is biologically plausible and is supported by other studies [3, 5]. With the linear algebra approach, an unknown variable with a true value of zero is never expected to have a best fit value of exactly zero. It is expected that the best fit value would be a small positive number 50% of the time and a small negative number 50% of the time. In the case here, a value of − 1% is interpreted to be a small negative number close to zero. The average age at oocyte retrieval in this study was 39 years old. It is likely that embryos with 4–5 cells on day 3 from younger patients have a small but clinically significant implantation potential. In order to prevent overfitting, embryos with 2–5 cells were omitted from our analysis based on age (Fig. 3).

Based on live birth rates per embryo, we determined how to best assign overall embryo grades of good, fair, and poor. Embryos consisting of 8 cells with ≤ 5% fragmentation (good) had the highest best fit live birth rates followed by 8 cells with > 5% fragmentation (fair) and 6–7 or 9–12 cells (poor) as shown in Fig. 3. Other studies have also found the highest live birth rates for embryos with normal cleavage rates (exactly 4 cells on day 2 or 8 cells on day 3) [5, 16]. Future database studies could consider using these same groupings for greater consistency across clinics rather than a subjectively assigned overall embryo grade. The SART CORS database records fragmentation in four categories: 0%, 1–10%, 11–25%, and > 25% [4]. 8-cell embryos with 0% or 1–10% fragmentation could be considered to have good overall quality and 8-cell embryos with 11–25% or > 25% fragmentation could be considered to have fair overall quality.

Using moving groups centered on the age of interest is more preferable than fixed age groups. Our analysis is repeated for a group of embryos centered at each age from 35 through 42. This allows us to derive clinically useful age-related information that can be used for patient counseling. We assume that errors from including a large age range are offset by centering the range on the age of interest. With this method, even large age ranges such as 9-year age groups introduce only small errors as long as the decrease in live birth rate over the age range is approximately linear.

Our study, as is the case in many studies of individual clinics, was limited by a small dataset. However, our methods were designed to optimize use of a small dataset through using a moving centered age group and including both single and multiple embryo transfers. To avoid overcomplicating the model, we only considered age, cell number, and fragmentation. Transfer cycle type (fresh or frozen cycle) was not able to be incorporated into the model because further stratification was not possible after accounting for age and four different morphological categories. Although our clinical data from a previous analysis shows a higher live birth rate per embryo with fresh cleavage stage embryo transfers compared with frozen cleavage stage embryo transfers, this is likely due to selection bias since the best embryos are selected for fresh transfers. Finally, it is unclear if using either aggregated data from multicenter databases or data from an individual clinic is applicable to other individual clinics. There may be limited ability to compare this data with that of other published studies because age may be reported or grouped differently between studies.

In conclusion, age and embryo morphology can be used to estimate the live birth rate and risk of multiples for cleavage stage embryo transfers. Incorporation of universal factors into the model is important because with multiple embryo transfers, there are factors affecting all embryos transferred together. These factors may result in higher rates of multiple gestation than if embryos implant independently. This model, which can be fit to data from other clinics, can help to more accurately guide clinicians and patients on the maximum number of embryos that can be safely transferred at one time. Since the applicability of data from one clinic to another is uncertain, if this data is used by other clinics, it should only be used to limit the planned number of embryos to transfer. We hope these methods and our results help the field of infertility move towards quantitatively assessing the risk of multiple gestations prior to every embryo transfer.