An adaptive detection method for fetal chromosomal aneuploidy using cell-free DNA from 447 Korean women

Kim, Sunshin; Jung, HeeJung; Han, Sung Hee; Lee, SeungJae; Kwon, JeongSub; Kim, Min Gyun; Chu, Hyungsik; Han, Kyudong; Kwak, Hwanjong; Park, Sunghoon; Joo, Hee Jae; An, Minae; Ha, Jungsu; Lee, Kyusang; Kim, Byung Chul; Zheng, Hailing; Zhu, Xinqiang; Chen, Hongliang; Bhak, Jong

doi:10.1186/s12920-016-0222-5

An adaptive detection method for fetal chromosomal aneuploidy using cell-free DNA from 447 Korean women

Research article
Open access
Published: 03 October 2016

Volume 9, article number 61, (2016)
Cite this article

Download PDF

You have full access to this open access article

BMC Medical Genomics Aims and scope Submit manuscript

An adaptive detection method for fetal chromosomal aneuploidy using cell-free DNA from 447 Korean women

Download PDF

Sunshin Kim¹,
HeeJung Jung²,
Sung Hee Han³,
SeungJae Lee²,
JeongSub Kwon²,
Min Gyun Kim⁴,
Hyungsik Chu⁵,
Kyudong Han⁶,
Hwanjong Kwak⁷,
Sunghoon Park⁷,
Hee Jae Joo⁷,
Minae An¹,
Jungsu Ha¹,
Kyusang Lee⁸,
Byung Chul Kim⁸,
Hailing Zheng⁹,
Xinqiang Zhu⁹,
Hongliang Chen⁹ &
…
Jong Bhak^1,8,10,11

3590 Accesses
10 Citations
1 Altmetric
Explore all metrics

Abstract

Background

Noninvasive prenatal testing (NIPT) using massively parallel sequencing of cell-free DNA (cfDNA) is increasingly being used to predict fetal chromosomal abnormalities. However, concerns over erroneous predictions which occur while performing NIPT still exist in pregnant women at high risk for fetal aneuploidy. We performed the largest-scale clinical NIPT study in Korea to date to assess the risk of false negatives and false positives using next-generation sequencing.

Methods

A total of 447 pregnant women at high risk for fetal aneuploidy were enrolled at 12 hospitals in Korea. They underwent definitive diagnoses by full karyotyping by blind analysis and received aneuploidy screening at 11–22 weeks of gestation. Three steps were employed for cfDNA analyses. First, cfDNA was sequenced. Second, the effect of GC bias was corrected using normalization of samples as well as LOESS and linear regressions. Finally, statistical analysis was performed after selecting a set of reference samples optimally adapted to a test sample from the whole reference samples. We evaluated our approach by performing cfDNA testing to assess the risk of trisomies 13, 18, and 21 using the sets of extracted reference samples.

Results

The adaptive selection algorithm presented here was used to choose a more optimized reference sample, which was evaluated by the coefficient of variation (CV), demonstrated a lower CV and higher sensitivity than standard approaches. Our adaptive approach also showed that fetal aneuploidies could be detected correctly by clearly splitting the z scores obtained for positive and negative samples.

Conclusions

We show that our adaptive reference selection algorithm for optimizing trisomy detection showed improved reliability and will further support practitioners in reducing both false negative and positive results.

View this article's peer review reports

Statistical Approach to Decreasing the Error Rate of Noninvasive Prenatal Aneuploid Detection caused by Maternal Copy Number Variation

Article Open access 04 November 2015

Novel Algorithms for Improved Sensitivity in Non-Invasive Prenatal Testing

Article Open access 12 May 2017

Genome-wide detection of additional fetal chromosomal abnormalities by cell-free DNA testing of 15,626 consecutive pregnant women

Article 02 August 2018

Background

In 1997 Lo et al. reported that Y-chromosome derived, male, cell-free fetal DNA exists in maternal female blood plasma and serum similar to tumor DNA using a polymerase chain method [1]. Since then, molecular screening of cell-free DNA (cfDNA) for detecting fetal aneuploidy has generated much interest because aneuploidy and other chromosome aberrations are fairly common (nine out of 1,000 live births) [2]. As a result, the discovery has inspired the development of many detection methods [3]. However, the main obstacle in the development of fast and low-cost diagnostic assays remains the low fraction (<4 %) of cell-free, fetal DNA in mothers [4]. Especially when cell-free fetal DNA is less than 3.5 %, the number of unique DNA fragments increases exponentially to retain the required aneuploidy detection power [5]. In addition, detecting fetal aneuploidy at an early diagnostic stage is still difficult because the fraction of original fetal DNA is proportional to gestational age [6]. Earlier detection could facilitate further diagnoses and actions. In twin pregnancies, it is more difficult to detect fetal aneuploidy because the fetal fraction (FF) of an affected fetus may be far lower than 4 % [7]. FF could be reduced by 50 % owing to the proportion of a second normal fetus.

A high risk of fetal aneuploidy has been identified by the first or second trimester screening, including maternal age, ultrasound and maternal serum markers [8]. Women at high risk are subjected to invasive sampling of fetal materials by amniocentesis for gestational age at week 15 and by chorionic villus sampling for gestational age at week 12 [9, 10]. However, these tests carry the risk of iatrogenic pregnancy loss [11]. CfDNA screening, on the other hand, offers two, major, clinical benefits compared to invasive prenatal diagnoses: no risk of pregnancy loss and earlier detection. CfDNA screening does have several limitations, such as requirements for further invasive tests to confirm positive outcomes in the case of discordant results that might arise from placental or maternal cell mosaicism [12–14], the average size of cfDNA being only around 150 base pairs (bp) [15] and short half-life [16]. Even with these shortcomings, sequencing-based, cfDNA screening using statistically improved counting methods has risen in popularity among pregnant women [17–19].

Since cfDNA screening for fetal aneuploidy was introduced, reducing GC bias to detect aneuploidy with higher sensitivities by reducing the coefficient of variation (CV) has become a key issue. Fan et al. [17], for example, detected fetal aneuploidy initially by counting the number of unique reads within each sliding window, enabling clear separation of fetal trisomy outliers. They successfully detected nine cases of trisomy 21 (T21), two cases of T18, and one case of T13 in a cohort of 18 pregnancies by measuring sequence tag density relative to the corresponding value of the genome DNA control to remove GC bias representing the higher GC content. Meanwhile, Chiu et al. suggested a method of detecting fetal aneuploidy involving counting the unique reads mapped to each chromosome and calculating z-scores with the percentage of all the unique reads of each chromosome for a sample [18]. They correctly detected 14 T21 fetuses and 14 euploid fetuses with z score > 3 without considering GC bias; however, the higher GC content for chromosome X produced a smaller z score. They also performed a large-scale validity study using a previously established method that employs next-generation sequencing to detect fetal trisomy 21 in high-risk pregnancies with high accuracy. They detected 86 T21 fetuses with 100 % sensitivity and 97.9 % specificity among 753 pregnancies [20]. Jiang et al. improved cfDNA screening by employing GC-correlation to minimize GC-bias and estimate the fraction of cell-free fetal DNA as a key index to detect autosomal and sex chromosome aneuploidies with high accuracy [5]. In a total of 903 pregnancies, they detected autosomal aneuploidies with 100 % sensitivity and 99.9 % specificity, and sex chromosome aneuploidies with 85.7 % sensitivity and 99.9 % specificity by employing GC-correlation and data normalization. Recently, Liao et al. [21], reported a methodology used to systematically detect both autosomal and sex chromosomal aneuploidies with high accuracy. They employed an integrated method for GC correction, which includes LOESS regression, normalization and linear regression to reduce the effect of GC bias in a total of 515.

Despite these advances, the risks of false negatives and false positives still exist. In particular, cfDNA screening at low or high risk for fetal trisomy generates more false negative and false positive results [21]. In this study, we designed a new algorithm based on selecting reference samples adaptively according to the shared ranges of GC content and DNA reads fraction of a test sample (The GC-related terminologies used here were defined in Table 1).

Table 1 GC-related terminologies

Full size table

Methods

Study participants and testing methods

From December 2014 through April 2015, 447 women at high risk for fetal aneuploidy were enrolled into this study from 12 hospitals (Mirae & Heemang, Namujungwon, and GN and others in Korea). The characteristics of the pregnant women are outlined in Table 2. The mean maternal age was 35, and ranged from 25 to 42 years. The mean gestational age was 15 weeks, and ranged from 11 to 22. Of these women, 29 were carrying twins, and their features are outlined in Table 3. All 447 women endured invasive prenatal diagnostic testing (amniocentesis) for fetal karyotyping, the results of which were obtained by blind analysis. The institutional review board at each participating hospital approved this study. Written informed consent was obtained from all participants.

Table 2 Demographic characteristics in 447 pregnancies. Demographic characteristics of 447 pregnant women in 12 hospitals in Korea

Full size table

Table 3 Demographic characteristics in 29 twin pregnancies. Demographic characteristics of 29 twin pregnancies from 12 hospitals in Korea

Full size table

All women underwent standard prenatal aneuploidy screening using accredited clinical laboratories. First-trimester screening includes the measurement of serum pregnancy-associated plasma protein A, total or free beta subunit of human chorionic gonadotropin (hCG), and nuchal translucency. Second-trimester screening comprises measuring maternal serum alpha-fetoprotein, hCG, unconjugated estriol and inhibin A.

CfDNA preparation and maternal plasma DNA sequencing

About 10 mL of peripheral blood was collected from each participant in a BCT™ tube (Streck, Omaha, NE, USA). The blood sample was centrifuged at 1,200 × g for 15 min at 4 °C. The plasma portion of blood was transferred to microcentrifuge tubes and centrifuged again at 16,000 × g for 10 min at 4 °C. CfDNA was extracted from 1 mL of plasma using a QIAamp Circulating Nucleic Acid Kit (Qiagen, Netherland). The end-repair of the plasma cfDNA was carried out using T4 DNA polymerase, Klenow DNA polymerase, and T4 polymerase kinase. DNA libraries for the Ion Proton sequencing systems were constructed according to the protocol provided by the manufacturer (Life Technologies, SD, USA). Proton PI Chip Kit version 2.0 was used to yield an average 0.3× sequencing coverage depth per nucleotide.

Data analysis

We used three processing steps for our comparative cfDNA analyses. First, cfDNA was sequenced massively using the Ion Proton system. All raw reads obtained from the Ion Torrent Suite Software (Life Technologies) were trimmed from the 3′ end using a sequencing quality threshold value of 20 (Q score) and filtered by a read length threshold of 50 bp. The remaining reads were aligned to the human genomic reference sequences (hg19) using BWA [22]. Duplicate DNA reads were filtered out by the Picard program (http://broadinstitute.github.io/picard/). Second, the effect of GC bias was reduced using LOESS regression [23], normalization of samples [5] and linear regression [24]. Each chromosome was divided into bins of 20 kb. After LOESS correction [24], given the corrected unique reads (RC_ij) on chromosome j of sample i, the fraction of reads (\( R{f}_{i{j}^{\hbox{'}}} \)) was calculated as follows: \( R{f}_{i{j}^{\hbox{'}}}=R{C}_{ij}/{\displaystyle \sum_{j=1}^{22}}R{C}_{ij} \). The normalization of samples was calculated as follows: \( R{f}_{i^{\prime }{j}^{\prime }}=R{f}_{i{j}^{\hbox{'}}}/{\displaystyle \sum_{i=1}^N}R{f}_{i{j}^{\hbox{'}}} \). The final step was to perform a statistical analysis after selecting reference samples adapted to a test sample from all the reference samples [24]. The full linear regression model was established based on equation, \( R{f}_{i^{\prime }{j}^{\prime }}=\alpha + \beta \times G{C}_{i^{\prime }{j}^{\prime }}+\mathrm{e} \) where \( G{C}_{i^{\prime }{j}^{\prime }} \) was the GC content of chromosome j’ of sample i’, β was the coefficient factor between the fraction of reads and GC content, and e was the error term. Fitting of the predicted fraction of reads was calculated as \( R{f}_{i^{\prime }{j}^{\prime}}^{\hbox{'}} = \upalpha +\upbeta \times G{C}_{i^{\prime }{j}^{\prime }} \). The residual obtained by the equation \( \mathrm{R} = R{f}_{i^{\prime }{j}^{\prime }}-R{f}_{i^{\prime }{j}^{\prime}}^{\hbox{'}} \) was fitted to a normal distribution and was used to derive a z score for fetal aneuploidies [24].

Optimally adaptive reference samples were extracted from all reference samples belonging to a shared range of the GC content and the reads fraction of a test sample as shown in Additional file 1: Figure S1. The GC content range in this study was set from −0.001 to +0.001 as a stepping unit value when setting the GC content of a test sample as the median. The reads fraction range was set from −0.00005 to +0.00005 as a stepping unit value when setting the reads fraction of a test sample as the median, which was determined by the fitting predicted fraction of reads calculated as \( R{f}_{i^{\prime }{j}^{\prime}}^{\hbox{'}} = \upalpha +\upbeta \times G{C}_{i^{\prime }{j}^{\prime }} \) from all reference samples. The representative sample in each test group of samples was selected and used to generate a set of reference samples increasing by 0.001 in GC content and by 0.00005 in reads fraction. By adjusting the unit value of GC content or reads fraction, the resolution of sets of reference samples could be changed. That is, a smaller unit could make more conservative ranges of sets, while a larger unit could make less conservative ranges of sets. In this study, changes of 0.001 in GC content and 0.00005 in reads fraction were set as the default stepping unit values.

We reasoned that a suboptimal threshold could result from a suboptimal reference sample collection that is not adapted statistically optimally to the test sample. Therefore, we tried to collect a set of more optimized, or “adaptive”, reference samples. First, the positive samples in 0.01 intervals of GC content value were categorized into four groups of 0.41, 0.42, 0.43, and 0.44 GC content regions. This was to efficiently collect adaptive reference samples before extending a shared range of the GC content and the reads fraction of a test sample using the unit value of GC content or reads fraction. Thus, the two positive test samples in the 0.41 GC content region, the five positive test samples in the 0.42 region, the two positive test samples in the 0.43 region, and the four positive test samples in the 0.44 region were clustered according to the GC regions, respectively. Second, if a sample size was >2, the median G + C content in each group was chosen as the representative test sample. If there were only 2 samples in a group, a representative was arbitrarily chosen. Third, the representative sample was used to generate a set of reference samples by increasing the GC content by 0.001 and the reads fraction by 0.00005. This sets the common region shared with the GC content of the representative sample ± 0.001 and the reads fraction of the representative sample ± 0.00005, to generate a set of reference samples from the shared region. We repeated this to extend the reads fraction of the representative sample by ±0.0001 (increasing an absolute unit value of reads fraction) until the reads fraction reached the preset threshold (±0.001). We repeated this after extending the GC content of the representative sample by ±0.002 (increasing the absolute unit value of GC content) until the GC content reached the preset threshold (±0.02). Finally, sets of optimized reference samples were selected by checking the CVs, which were used to evaluate the quality of the set of reference samples.

A z score > 3 indicated the fraction of chromosome reads greater than that of the 99.9th percentile of the set of the reference samples for a one-tailed distribution [18]. We evaluated our method by performing cfDNA testing to assess the risk of trisomies 21, 18, and 13.

Results

From 447 plasma samples with existing karyotyping diagnoses, we showed that the adaptive selection strategy of reference samples produced a more reliable and robust result than the previous approach of using all reference samples. There were 13 fetuses with T21 (including three twin samples), one fetus with T18 in a twin pregnancy, one fetus with T13, and two fetuses with XXY. Seventeen samples with aneuploidy, 29 samples with twins, and five samples recognized as outliers were excluded from 447 samples to produce more reliable reference samples. Thus, we compared the adaptive selection method with the non-adaptive selection method using 396 reference samples. An average of approximately 7.4 ± 2.1 million raw reads were obtained per sample. When sequence reads mapped to only one genome location in the reference human genome, they were termed unique reads. Approximately 44.6 %, or 3.3 million unique reads, of the total raw reads were retained. The distribution of GC contents of these 396 samples ranged from 40 % to 51 %. The CV to evaluate the performance between the traditional and new methods were used.

As shown in Additional file 1: Figure S2, GC correction played an important role in reducing the CV. Bars represent the CV for chromosomes 13, 18, and 21 with and without LOESS-based GC correction among reference samples (n = 396). However, despite the GC correction of the samples, Fig. 1 shows that the threshold is still suboptimal in separating positive and negative results using a traditional method for chromosome 21 perhaps due to suboptimal reference samples. Figure 2 shows the CV for chromosome 21 with and without adaptive sample selection using a representative sample with a GC = 0.424. Every CV for the adaptive approach was lower than those for the baseline approach. Therefore, the adaptive approach provided higher sensitivity for T21. In fact, Fig. 3 shows the clear thresholds obtained using the six sets of adaptively selected samples. One, a representative test sample, of five T21 samples containing GC contents in the 0.42 region was used to select adaptive samples according to a GC range and a reads fraction range of the representative sample. The remaining four T21 samples were used to evaluate positive results using the six sets. We also selected six sets of the euploid samples within 0.424 ± 0.001 to evaluate negative results.

Figures 4 and 5 show similar results with adaptive sample selection using a representative sample with a GC = 0.437. In addition, Additional file 1: Figure S3.1 and Additional file 1: Figure S3.2, using a representative sample with a GC = 0.416, and Additional file 1: Figure S4.1 and Additional file 1: Figure S4.2, using a representative sample with a GC = 0.446, show similar results to our adaptive sample selection. As we had only one T18 and one T13 sample, we could not test these results using other T18 and T13 samples. However, we found that only one T18 or T13 reference could generate a good set of adaptively selected samples to clearly separate the z scores obtained for positive and negative samples (Additional file 1: Figure S5 and Additional file 1: Figure S6). A significant linear model was set up to analyze the relationship of the reads fractions and the GC contents of samples (Additional file 1: Figure S7).

Notably, we correctly detected three T21 and one T18 aneuploid samples in twin pregnancies. Currently, it requires an FF of at least 4 % to get reliable results for accurate cfDNA analyses [20, 25–30]. In our twin pregnancies, three positive T21 results were dizygotic twin pregnancies and one positive T13 result was a monozygotic twin pregnancy. Therefore, the three positive results could have been false negatives because the FF of the affected fetus could be below the 4 % threshold. Instead of determining FF in this study, we used the z score of aneuploid chromosomes, which shows a positive correlation with FF [24] as it is difficult to determine FF precisely. Notably, setting cutoff values of the z score as 2 for a negative result and 4 for a positive result showed that the specific results of this study satisfied the criteria (Fig. 3; f, Fig. 5; d, e).

Discussion

We have noted that the number of unique reads is correlated statistically with the GC content [5]. Therefore, obtaining robust results for fetal aneuploidy detection suggests the hypothesis that the GC content of the sample under test belongs to the range of GC contents of the reference samples. The reason for this being that the key criteria for detecting fetal aneuploidy in a test sample is the fitting of the predicted value from the reference samples. Thus, the predictability of the state of the test sample depends on the statistical state of the reference samples. Therefore, we applied this concept to detect fetal aneuploidies by selecting the reference samples adaptively according to the GC content of a test sample.

We observed that in the process of selecting adaptive reference samples, the range of the reads fraction of a test sample is important to the collection of suitable reference samples. Therefore, we investigated the adaptive reference samples belonging to the shared region of a GC content value and a reads fraction value.

In earlier studies, cfDNA screening for fetal aneuploidy was performed successfully using smaller sized, reference samples [17, 18]. Fan et al. [17] successfully detected 12 fetal aneuploidies by counting the number of unique reads within each sliding window and separating the outliers of fetal trisomy clearly with six reference samples using the higher sample GC contents of (range, 42 % to 50 %). The GC distribution of these samples was very similar to the GC distribution of our 396 reference samples (GC range, 40 % to 51 %). We investigated why the previous method did not detect fetal aneuploidy correctly, although it used a comparatively large number (n = 396) of reference samples. Considering the distribution of reads fractions vs. GC contents of samples as shown in Additional file 1: Figure S7, we hypothesized that the reason was the unbalanced distribution of the reads fractions of samples according to the increasing GC content values, especially at higher GC contents. On the other hand, Liao et al. [24] detected aneuploidies with high accuracy in 515 pregnancies (GC range, 38 % to 42 %) with a comparatively balanced distribution of reads ratios vs. GC contents of samples. Nevertheless, their results also had borderline values of the z scores of chromosome 21 for positive and negative samples. This means that the quality of a set of reference samples is more important than the sample size. Until now, mainstream cfDNA screening has focused on the issue of reducing GC bias by increasing the sample size. This study suggested an adaptive selection approach to collect samples adaptively according to the GC content of a test sample.

Although this approach was practically feasible in our data for detecting chromosome 21 aneuploidy, an independent, larger sample size is required to confirm our results. A sufficiently large sample size is necessary to decide how many reference samples would be required to obtain sufficient evidence of the reliability and validity of our results. In addition, although only one T18 or T13 test sample could generate a good set of adaptively selected samples (Additional file 1: Figure S5 and Additional file 1: Figure S6), we need to confirm our results by detecting T18 or T13 in independent positive samples, using the set of selected reference samples.

Conclusions

Using 447 samples, we developed a new adaptive method of selecting reference samples according to the combined values of the GC content and the reads fraction of a test sample. The approach was compared with the previous method using all reference samples in order to detect fetal aneuploidy and demonstrated to be reliable and robust.

Abbreviations

bp:: Base pairs
cfDNA:: Cell-free DNA
CV:: Coefficient of variation
hCG:: Human chorionic gonadotropin
NIPT:: Non-invasive prenatal testing
SD:: Standard deviation

References

Lo YM, Corbetta N, Chamberlain PF, Rai V, Sargent IL, Redman CW, Wainscoat JS. Presence of fetal DNA in maternal plasma and serum. Lancet. 1997;350(9076):485–7.
Article CAS PubMed Google Scholar
Cunningham F, et al. Williams Obstetrics. McGraw-Hill Professional. New York. 2002; p. 942.
Benn P, Cuckle H, Pergament E. Non-invasive prenatal testing for aneuploidy: current status and future prospects. Ultrasound Obstet Gynecol. 2013;42(1):15–33.
Article CAS PubMed Google Scholar
Norton ME, Brar H, Weiss J, Karimi A, Laurent LC, Caughey AB, Rodriguez MH, Williams III J, Mitchell ME, Adair CD, et al. Non-Invasive Chromosomal Evaluation (NICE) Study: results of a multicenter prospective cohort study for detection of fetal trisomy 21 and trisomy 18. Am J Obstet Gynecol. 2012;207(2):137 e131-138.
Article Google Scholar
Jiang F, Ren J, Chen F, Zhou Y, Xie J, Dan S, Su Y, Yin B, Su W, Zhang H, et al. Noninvasive Fetal Trisomy (NIFTY) test: an advanced noninvasive prenatal diagnosis methodology for fetal autosomal and sex chromosomal aneuploidies. BMC Med Genomics. 2012;5:57.
Article CAS PubMed PubMed Central Google Scholar
Wang E, Batey A, Struble C, Musci T, Song K, Oliphant A. Gestational age and maternal weight effects on fetal cell-free DNA in maternal plasma. Prenat Diagn. 2013;33(7):662–6.
Article CAS PubMed Google Scholar
del Mar GM, Quezada MS, Bregant B, Syngelaki A, Nicolaides KH. Cell-free DNA analysis for trisomy risk assessment in first-trimester twin pregnancies. Fetal Diagn Ther. 2014;35(3):204–11.
Article Google Scholar
Nicolaides KH. Screening for fetal aneuploidies at 11 to 13 weeks. Prenat Diagn. 2011;31(1):7–15.
Article PubMed Google Scholar
Fortuny A, Borrell A, Soler A, Casals E, Costa D, Carrio A, Puerto B, Seres A, Cararach J, Delgado R. Chorionic villus sampling by biopsy forceps. Results of 1580 procedures from a single centre. Prenat Diagn. 1995;15(6):541–50.
Article CAS PubMed Google Scholar
Crandall BF, Kulch P, Tabsh K. Risk assessment of amniocentesis between 11 and 15 weeks. comparison to later amniocentesis controls. Prenat Diagn. 1994;14(10):913–9.
Article CAS PubMed Google Scholar
Tabor A, Alfirevic Z. Update on procedure-related risks for prenatal diagnosis techniques. Fetal Diagn Ther. 2010;27(1):1–7.
Article PubMed Google Scholar
Grati FR, Malvestiti F, Ferreira JC, Bajaj K, Gaetani E, Agrati C, Grimi B, Dulcetti F, Ruggeri AM, De Toffol S, et al. Fetoplacental mosaicism: potential implications for false-positive and false-negative noninvasive prenatal screening results. Genet Med. 2014;16(8):620–4.
Article PubMed Google Scholar
McNamara CJ, Limone LA, Westover T, Miller RC. Maternal source of false-positive fetal sex chromosome aneuploidy in noninvasive prenatal testing. Obstet Gynecol. 2015;125(2):390–2.
Article PubMed Google Scholar
Wang Y, Chen Y, Tian F, Zhang J, Song Z, Wu Y, Han X, Hu W, Ma D, Cram D, et al. Maternal mosaicism is a significant contributor to discordant sex chromosomal aneuploidies associated with noninvasive prenatal testing. Clin Chem. 2014;60(1):251–9.
Article PubMed Google Scholar
Chan KC, Zhang J, Hui AB, Wong N, Lau TK, Leung TN, Lo KW, Huang DW, Lo YM. Size distributions of maternal and fetal DNA in maternal plasma. Clin Chem. 2004;50(1):88–92.
Article CAS PubMed Google Scholar
Lo YM, Zhang J, Leung TN, Lau TK, Chang AM, Hjelm NM. Rapid clearance of fetal DNA from maternal plasma. Am J Hum Genet. 1999;64(1):218–24.
Article CAS PubMed PubMed Central Google Scholar
Fan HC, Blumenfeld YJ, Chitkara U, Hudgins L, Quake SR. Noninvasive diagnosis of fetal aneuploidy by shotgun sequencing DNA from maternal blood. Proc Natl Acad Sci U S A. 2008;105(42):16266–71.
Article CAS PubMed PubMed Central Google Scholar
Chiu RW, Chan KC, Gao Y, Lau VY, Zheng W, Leung TY, Foo CH, Xie B, Tsui NB, Lun FM, et al. Noninvasive prenatal diagnosis of fetal chromosomal aneuploidy by massively parallel genomic sequencing of DNA in maternal plasma. Proc Natl Acad Sci U S A. 2008;105(51):20458–63.
Article CAS PubMed PubMed Central Google Scholar
No CO. 640: Cell-Free DNA Screening For Fetal Aneuploidy. Obstet Gynecol. 2015;126(3):e31–37.
Article Google Scholar
Chiu RW, Akolekar R, Zheng YW, Leung TY, Sun H, Chan KC, Lun FM, Go AT, Lau ET, To WW, et al. Non-invasive prenatal assessment of trisomy 21 by multiplexed maternal plasma DNA sequencing: large scale validity study. BMJ. 2011;342:c7401.
Article PubMed PubMed Central Google Scholar
Cheung SW, Patel A, Leung TY. Accurate description of DNA-based noninvasive prenatal screening. N Engl J Med. 2015;372(17):1675–7.
Article PubMed Google Scholar
Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009;25(14):1754–60.
Article CAS PubMed PubMed Central Google Scholar
Alkan C, Kidd JM, Marques-Bonet T, Aksay G, Antonacci F, Hormozdiari F, Kitzman JO, Baker C, Malig M, Mutlu O, et al. Personalized copy number and segmental duplication maps using next-generation sequencing. Nat Genet. 2009;41(10):1061–7.
Article CAS PubMed PubMed Central Google Scholar
Liao C, Yin AH, Peng CF, Fu F, Yang JX, Li R, Chen YY, Luo DH, Zhang YL, Ou YM, et al. Noninvasive prenatal diagnosis of common aneuploidies by semiconductor sequencing. Proc Natl Acad Sci U S A. 2014;111(20):7415–20.
Article CAS PubMed PubMed Central Google Scholar
Ehrich M, Deciu C, Zwiefelhofer T, Tynan JA, Cagasan L, Tim R, Lu V, McCullough R, McCarthy E, Nygren AO, et al. Noninvasive detection of fetal trisomy 21 by sequencing of DNA in maternal blood: a study in a clinical setting. Am J Obstet Gynecol. 2011;204(3):205 e201-211.
Article Google Scholar
Palomaki GE, Kloza EM, Lambert-Messerlian GM, Haddow JE, Neveux LM, Ehrich M, van den Boom D, Bombard AT, Deciu C, Grody WW, et al. DNA sequencing of maternal plasma to detect Down syndrome: an international clinical validation study. Genet Med. 2011;13(11):913–20.
Article CAS PubMed Google Scholar
Sehnert AJ, Rhees B, Comstock D, de Feo E, Heilek G, Burke J, Rava RP. Optimal detection of fetal chromosomal abnormalities by massively parallel DNA sequencing of cell-free fetal DNA from maternal blood. Clin Chem. 2011;57(7):1042–9.
Article CAS PubMed Google Scholar
Ashoor G, Syngelaki A, Wagner M, Birdir C, Nicolaides KH. Chromosome-selective sequencing of maternal plasma cell-free DNA for first-trimester detection of trisomy 21 and trisomy 18. Am J Obstet Gyneco. 2012;206(4):322 e321-325.
Article Google Scholar
Hudecova I, Sahota D, Heung MM, Jin Y, Lee WS, Leung TY, Lo YM, Chiu RW. Maternal plasma fetal DNA fractions in pregnancies with low and high risks for fetal chromosomal aneuploidies. PLoS One. 2014;9(2):e88484.
Article PubMed PubMed Central Google Scholar
Palomaki GE, Deciu C, Kloza EM, Lambert-Messerlian GM, Haddow JE, Neveux LM, Ehrich M, van den Boom D, Bombard AT, Grody WW, et al. DNA sequencing of maternal plasma reliably identifies trisomy 18 and trisomy 13 as well as Down syndrome: an international collaborative study. Genet Med. 2012;14(3):296–305.
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We thank MyungJun Jeong and Kyungtae Min for facilitating and participating in the experiments and assisting with the manuscript preparation.

Funding

This work was supported by GenomeCare internal research funding. JB was primarily supported by the 2014 Research Fund (1.140113.01) of UNIST (Ulsan National Institute of Science & Technology). JB is partly supported by The Genome Research Foundation and Geromics Inc. internal research funding. The Ion Proton platform is supported by Creative Dasan LINC-Dankook University.

Availability of data and materials

Additional information supporting the conclusions of this article is included in the supporting information file.

Authors’ contributions

Conceived and designed the experiments: SK, BK, HLC and JB. Performed the experiments: HK, SP, JH and HLC. Analyzed the data: SK, SP, SL, JK, MK, HC, KL, HJJ, HZ, MA, JH and XZ. Contributed reagents/materials/analysis tools: HJ, SH, SL, JK, MK, HC, HLC, KH, HZ, XZ, KL and HK. Wrote the manuscript: SK, HJ, MA, SH, KH, BK, HLC and JB. All authors read and approved the final manuscript.

Competing interests

We declare the following interests: co-authors Sunshin Kim, Minae An, and Jungsu Ha are employed by GenomeCare. Co-authors Hwanjong Kwak, Sunghoon Park, and Hee Jae Joo are employed by TheragenEtex Bio Institute. Co-authors Hailing Zheng, Xinqiang Zhu, and Hongliang Chen are employed by Xiamen Vangenes BioTech. Co-authors Kyusang Lee and Byung Chul Kim are employed by Clinomics. Co-author Jong Bhak is employed by TGI, UNIST and Geromics Inc.

Consent for publication

All authors have seen and approved the manuscript for publication.

Ethics approval and consent to participate

This study was approved by the institutional review board of all the participating hospitals (Mirae & Heemang, Namujungwon Maternity, GN Maternity, W-woman, Koeun Women, MIREA LADYS, SUMOK WOMEN, SHILLA Women, WOMEN’s i, YONSEI WOMEN & CHILDREN, WIN Women, and POHANG WOMEN). All patients provided written informed consent to participate.

Author information

Authors and Affiliations

GenomeCare, Suwon, Republic of Korea
Sunshin Kim, Minae An, Jungsu Ha & Jong Bhak
Mirae & Heemang OB/GYN Clinic, Seoul, Republic of Korea
HeeJung Jung, SeungJae Lee & JeongSub Kwon
Seoul Clinical Laboratories (SCL), Yongin, Republic of Korea
Sung Hee Han
Namujungwon Maternity Hospital, Yangju, Republic of Korea
Min Gyun Kim
GN Maternity Hospital, Pyeongtak, Republic of Korea
Hyungsik Chu
Department of Nanobiomedical Science, BK21 PLUS NBM Global Research Center for, Regenerative Medicine, Dankook University, Cheonan, Republic of Korea
Kyudong Han
TheragenEtex, Suwon, Republic of Korea
Hwanjong Kwak, Sunghoon Park & Hee Jae Joo
The Genomics Institute (TGI), BioMedical Engineering, UNIST, Ulsan, 687-798, Republic of Korea
Kyusang Lee, Byung Chul Kim & Jong Bhak
Xiamen Vangenes BioTech, Xiamen, Fujian, China
Hailing Zheng, Xinqiang Zhu & Hongliang Chen
Geromics, Ulsan, 687-798, Republic of Korea
Jong Bhak
Genome Research Foundation, Osong, Chungbuk, Republic of Korea
Jong Bhak

Authors

Sunshin Kim
View author publications
You can also search for this author in PubMed Google Scholar
HeeJung Jung
View author publications
You can also search for this author in PubMed Google Scholar
Sung Hee Han
View author publications
You can also search for this author in PubMed Google Scholar
SeungJae Lee
View author publications
You can also search for this author in PubMed Google Scholar
JeongSub Kwon
View author publications
You can also search for this author in PubMed Google Scholar
Min Gyun Kim
View author publications
You can also search for this author in PubMed Google Scholar
Hyungsik Chu
View author publications
You can also search for this author in PubMed Google Scholar
Kyudong Han
View author publications
You can also search for this author in PubMed Google Scholar
Hwanjong Kwak
View author publications
You can also search for this author in PubMed Google Scholar
Sunghoon Park
View author publications
You can also search for this author in PubMed Google Scholar
Hee Jae Joo
View author publications
You can also search for this author in PubMed Google Scholar
Minae An
View author publications
You can also search for this author in PubMed Google Scholar
Jungsu Ha
View author publications
You can also search for this author in PubMed Google Scholar
Kyusang Lee
View author publications
You can also search for this author in PubMed Google Scholar
Byung Chul Kim
View author publications
You can also search for this author in PubMed Google Scholar
Hailing Zheng
View author publications
You can also search for this author in PubMed Google Scholar
Xinqiang Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Hongliang Chen
View author publications
You can also search for this author in PubMed Google Scholar
Jong Bhak
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Hongliang Chen or Jong Bhak.

Additional file

Additional file 1:

Figure S1 showed optimally adaptive reference samples extracted from all reference samples. Figure S2 showed that GC correction played an important role in reducing the CV. Figures S3.1, S3.2, S4.1, S4.2, S5 and S6 represented similar results to our adaptive sample selection. Figure S7 represented the relationship of the reads fractions and the GC contents of samples. (DOCX 2063 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Cite this article

Kim, S., Jung, H., Han, S.H. et al. An adaptive detection method for fetal chromosomal aneuploidy using cell-free DNA from 447 Korean women. BMC Med Genomics 9, 61 (2016). https://doi.org/10.1186/s12920-016-0222-5

Download citation

Received: 04 March 2016
Accepted: 21 September 2016
Published: 03 October 2016
DOI: https://doi.org/10.1186/s12920-016-0222-5

An adaptive detection method for fetal chromosomal aneuploidy using cell-free DNA from 447 Korean women