Novel Algorithms for Improved Sensitivity in Non-Invasive Prenatal Testing

Johansson, L. F.; de Boer, E. N.; de Weerd, H. A.; van Dijk, F.; Elferink, M. G.; Schuring-Blom, G. H.; Suijkerbuijk, R. F.; Sinke, R. J.; te Meerman, G. J.; Sijmons, R. H.; Swertz, M. A.; Sikkema-Raddatz, B.

doi:10.1038/s41598-017-02031-5

Novel Algorithms for Improved Sensitivity in Non-Invasive Prenatal Testing

Article
Open access
Published: 12 May 2017

Volume 7, article number 1838, (2017)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Novel Algorithms for Improved Sensitivity in Non-Invasive Prenatal Testing

Download PDF

L. F. Johansson^1,2^na1,
E. N. de Boer¹^na1,
H. A. de Weerd ORCID: orcid.org/0000-0001-7804-1177^1,2,
F. van Dijk^1,2,
M. G. Elferink³,
G. H. Schuring-Blom³,
R. F. Suijkerbuijk¹,
R. J. Sinke¹,
G. J. te Meerman¹,
R. H. Sijmons¹,
M. A. Swertz^1,2 &
…
B. Sikkema-Raddatz¹

5649 Accesses
12 Citations
5 Altmetric
Explore all metrics

Abstract

Non-invasive prenatal testing (NIPT) of cell-free DNA in maternal plasma, which is a mixture of maternal DNA and a low percentage of fetal DNA, can detect fetal aneuploidies using massively parallel sequencing. Because of the low percentage of fetal DNA, methods with high sensitivity and precision are required. However, sequencing variation lowers sensitivity and hampers detection of trisomy samples. Therefore, we have developed three algorithms to improve sensitivity and specificity: the chi-squared-based variation reduction (χ²VR), the regression-based Z-score (RBZ) and the Match QC score. The χ²VR reduces variability in sequence read counts per chromosome between samples, the RBZ allows for more precise trisomy prediction, and the Match QC score shows if the control group used is representative for a specific sample. We compared the performance of χ²VR to that of existing variation reduction algorithms (peak and GC correction) and that of RBZ to trisomy prediction algorithms (standard Z-score, normalized chromosome value and median-absolute-deviation-based Z-score). χ²VR and the RBZ both reduce variability more than existing methods, and thereby increase the sensitivity of the NIPT analysis. We found the optimal combination of algorithms was to use both GC correction and χ²VR for pre-processing and to use RBZ as the trisomy prediction method.

Statistical Approach to Decreasing the Error Rate of Noninvasive Prenatal Aneuploid Detection caused by Maternal Copy Number Variation

Article Open access 04 November 2015

NIPTeR: an R package for fast and accurate trisomy prediction in non-invasive prenatal testing

Article Open access 17 December 2018

An adaptive detection method for fetal chromosomal aneuploidy using cell-free DNA from 447 Korean women

Article Open access 03 October 2016

Introduction

The discovery of cell-free fetal DNA (cffDNA) fragments in the maternal bloodstream¹ in combination with the development of massively parallel sequencing has made it possible to perform non-invasive prenatal testing (NIPT). The traditional invasive procedures for prenatal aneuploidy testing, amniocentesis and chorionic villi biopsy, are associated with an elevated miscarriage risk². This disadvantage can be overcome by NIPT, which can detect fetal aneuploidies in maternal blood as early as ten weeks into the pregnancy without the need for an invasive procedure³. NIPT makes use of cell-free DNA fragments isolated from blood plasma. Some of these fragments, the cffDNA, originate from the placenta and are informative of the fetus: when a chromosomal trisomy is present, the number of fragments originating from that chromosome will be higher than what is expected based upon statistical analysis using a set of non-trisomy control samples. Because NIPT is based upon analysis of very small amounts of DNA, measurements are very sensitive to the introduction of variability between samples and experiments. The statistical analysis in NIPT was first improved by the introduction of the Z-score calculation⁴, which compares the individual sample with a set of non-trisomy controls. However, when applying the standard Z-score calculation without prior data correction, a high variability was found for chromosomes 13 and 18⁵. This is undesirable because it lowers the sensitivity of the test. Thus, if a low fraction of cffDNA is present, there is a risk of false-negative results.

An important cause of variability is the guanine and cytosine (GC) content of the DNA fragments analyzed. There are various GC-bias correction methods, such as those based on locally weighted scatterplot smoothing regression (LOESS)^5,6,7,8 or on the average coverage of genomic regions having a similar GC-content⁹. We used the latter method in combination with a peak correction that removes regions having significantly more reads than average⁹.

Variability can also be reduced by adapting the Z-score calculation, for instance by using the normalized chromosome value (NCV)^{6, 10} or the median absolute deviation (MAD) based Z-score¹¹.

Our aim here was to further decrease variability and thus increase the sensitivity of NIPT. We therefore developed three new algorithms: the chi-squared-based variation reduction (χ²VR), the regression-based Z-score (RBZ), and the Match QC score. The χ²VR reduces the weight of the number of reads in regions that have a higher variation than expected by chance, regardless of the origin of the bias. The RBZ uses a model based on forward regression for prediction. The Match QC score calculates whether the non-trisomy control set is representative for the analyzed sample.

We compared the performance of our algorithms against and in combination with existing algorithms. Furthermore, we show that the Match QC score can indicate whether a sample fits within a control set.

Material and Methods

To assess the added value of the χ²VR, RBZ and the Match QC score to the sensitivity and quality control of trisomy prediction, the performance of the algorithms was compared to that of existing variation reduction methods (peak correction and bin or LOESS GC correction) and trisomy prediction methods (standard Z-score, NCV and MAD-based Z-score) (Fig. 1). We included all methods used, except peak correction and the MAD-based Z-score, in NIPTeR, an R package publicly available under the GNU GPL open source license on CRAN and at https://github.com/molgenis/NIPTeR.

We focused on whole genome sequencing analysis, in which the fraction of sequenced reads originating from the chromosome of interest in the sample is compared with that of a set of non-trisomy control samples. In all analyses, only data from autosomal chromosomes was used.

Each chromosome was partitioned into bins of 50,000 base pairs. This bin size is in line with previous methods^{3, 5,6,7, 9}. In each bin, the number of reads aligned to the forward and reverse strands reads were counted. The bin counts were used as the basic components for all further processing.

Chi-squared-based variation reduction

The novel χ²VR reduces the weight of the number of reads in bins that have a higher variation than expected by chance and thus reduces the impact of these bins on the chromosomal fractions. No prior knowledge on the origin of the variation is needed. The χ²VR performs a sum of squares calculation: per bin, the sum of the chi-squared value is calculated over all the selected control samples. For this calculation, the observed read counts o are first normalized by multiplying them with a normalization factor. This factor is the mean number of observed total read counts for all autosomal bins i of all control samples j divided by the mean number of observed total read counts for all autosomal bins of the sample s. In short, the observed normalized read count for a specific bin (on _i) can be calculated as follows:

$$o{n}_{is}={o}_{is}\times \frac{({\sum }_{ij=1}^{n}{o}_{ij})/({n}_{i}\times {n}_{j})}{({\sum }_{i=1}^{n}{o}_{is})/{n}_{i}}$$

(1)

where n_i is the number of bins and n_j is the number of control samples. Then, the chi-squared value for each bin i is calculated for each control sample j by dividing the squared difference between the expected and observed normalized read count by the expected normalized read count for that bin, where the expected normalized read count is the average normalized read count for a specific bin in all control samples (µ_ij). The sum chi-squared value is calculated by adding up the chi-squared values of all the control samples for the bin:

$$\sum _{j=1}^{n}{\chi }_{ij}^{2}=\frac{{({\mu }_{ij}-o{n}_{ij})}^{2}}{{\mu }_{ij}}$$

(2)

The sum chi-squared value for each bin is transformed to a standard normal distribution N(0, 1) by subtracting the degrees of freedom df (number of control samples minus one) from the sum chi-squared value and dividing this by the square root of two times the degrees of freedom.

$$N(0,1)=\frac{({\sum }_{j=1}^{n}{\chi }_{ij}^{2})-df}{\sqrt{2df}}$$

(3)

This results in a Z-score, which shows the number of standard deviations (SD) an observation differs from the expectation. Reads in bins with a Z-score higher than 3.5 are divided by the sum chi-squared value divided by the degrees of freedom, thereby reducing the variability between the samples. Normalized read counts in bins with a Z-score lower than 3.5 are not corrected. The justification for this procedure is that probability plots show the expected chi-squared distribution up to a Z-value of about 3.5. Values above 3.5 are much more frequent than would be expected, so instead of ignoring those bins we chose to reduce the weights, assuming that there is still information present in the over-dispersed bin counts. An overview of the analysis steps and their effects is shown in Supplement 1.

Regression-based Z-score

The RBZ combines linear regression with a Z-score calculation. In the RBZ calculation the fraction of the chromosome of interest is predicted using stepwise regression with forward selection, in short forward regression. The reads aligned to the forward and reverse strands are used as separate predictors, because several chromosomes show a small, but consistent, over- or underrepresentation of reads aligned to the forward or reverse strand (Supplement 2). However, all reads aligned to the chromosome of interest are taken together rather than separated, because the higher number of reads leads to a lower variability in the number of reads aligned to the chromosome of interest.

For each chromosome of interest, the four best predictor sets, which each consist of four predictors, are determined by forward regression, using the adjusted R squared of the model as a selection criterion. The predictors can have either a positive or a negative correlation with the chromosome of interest. Within each predictor set only one predictor can be selected from each chromosome, limiting the risk of introducing bias.

Using the models created for each control sample s the expected chromosomal fraction (ef) is calculated for the chromosome of interest. Subsequently, the observed chromosomal fraction of the total read count of the chromosome of interest (of) is divided by this expected fraction. In combination with the standard deviation of the prediction, a Z-score is calculated for each sample. Because the mean of the control group after regression is one, the coefficient of variation of the control group has the same value as the SD.

In short, the RBZ can be formulated as:

$$\frac{o{f}_{s}/e{f}_{s}-1}{\sqrt{{\sum }_{j=1}^{n}{(o{f}_{j}/e{f}_{j}-\overline{of/ef})}^{2}/n-1}}$$

(4)

where s is the sample of interest, j is an individual control sample and n is the total number of control samples.

The RBZ not only uses information from chromosomes having a positive correlation of read counts with the chromosome of interest, but also from chromosomes showing a negative correlation. An overview of an example RBZ calculation is shown in Supplement 3.

Match QC score

For the sample of interest, the novel Match QC score algorithm calculates how well the overall pattern of chromosomal fractions matches the pattern of the control samples. If the pattern of the sample differs too much from that of the controls, the sample does not fit within the control group, making the control set non-representative for the sample. Cut-offs are control-group-specific and can be set using the Match QC scores of the individual control group samples. The Match QC score uses the data used for trisomy prediction as input. Variation reduction, e.g. GC-correction or χ²VR, is applied before calculating the Match QC score.

To obtain the Match QC score, first the chromosomal fractions (of) are calculated for the sample and all control samples. This is done by dividing the (weighted or corrected) total read count of each chromosome by the total read count of all autosomal chromosomes, excluding chromosomes 13, 18 and 21. Subsequently, for each control sample, the sum of squared differences of the chromosomal fractions between the sample and the control for all autosomal chromosomes, excluding chromosomes 13, 18 and 21, is calculated.

In short, the Match QC score between a sample of interest s and an individual control sample j can be formulated as:

$$\sum _{k=1}^{m}{(o{f}_{ks}-o{f}_{kj})}^{2}$$

(5)

where k is the chromosome and m is the total number of chromosomes, excluding chromosomes 13, 18 and 21.

Smaller differences indicate a better match. An overall Match QC score is calculated by taking the average of the results of all samples. The formula for the overall Match QC score is:

$$\frac{{\sum }_{j=1}^{n}{\sum }_{k=1}^{m}{(o{f}_{ks}-o{f}_{kj})}^{2}}{j}$$

(6)

where n is the number of control samples.

Validation of algorithms

Samples

To assess the effects of different variation reduction and trisomy prediction algorithms, we sequenced 128 non-trisomy and 43 trisomy samples using the SOLiD Wildfire platform (Life Technologies, Carlsbad, CA, USA) and 142 non-trisomy and 7 trisomy samples using the HiSeq 2500 platform (Illumina, San Diego, CA, USA). A further 34 non-trisomy samples had an alternative plasma-isolation and were sequenced on a HiSeq. The trisomy status of all samples was determined using karyotyping or quantitative fluorescence PCR following amniocentesis or chorionic villi biopsy.

Samples were selected in accordance with and as part of the trial by Dutch laboratories for evaluation of non-invasive prenatal testing (TRIDENT) program, supported by the Dutch Ministry of Health, Welfare and Sport (11016-118701-PG). The program was also approved by the Ethics Committee of the University Medical Center Groningen. All participants signed an informed consent form.

Plasma isolation, sample preparation and sequencing

Plasma was obtained from two different sources. The first source was fresh EDTA blood, either processed within 3 hours of blood collection or within 24 hours if stabilizing reagent was present in the tubes (Streck Inc., Omaha, NE, USA). For samples sequenced using the Illumina platform, blood was first centrifuged at 1200 rcf for 10 minutes, without using brakes to stop the rotor. The plasma was then transferred to another tube and centrifuged at 2400 rcf for 20 minutes. The plasma was transferred to a third tube and stored at −80 °C. For samples sequenced on the SOLiD platform, the centrifugal forces used were 1600 rcf and 16000 rcf, respectively. The second source of plasma was obtained using an alternative isolation method using only the first centrifugation step at 1200 rcf, after which the blood plasma was stored at −20 °C.

For samples sequenced on the HiSeq, we isolated cell-free DNA (cfDNA) from 1.5 ml plasma with the QIAamp MinElute Virus Spin kit (Qiagen, Valencia, CA, USA) (90 non-trisomy and 6 trisomic samples), the Qiagen circulating nucleic acid kit (Qiagen) (21 non-trisomy samples) and the Akonni TruTip kit (Akonni Biosystems, Frederick, MD, USA) (31 non-trisomy samples and 1 trisomic sample). After DNA isolation, sample preparation was performed with NEBNext Multiplex Oligos for Illumina (New England Biolabs Inc., Ipswich, MA, USA). Before the amplification step, we performed a two-step size selection using Agencourt AMPure xp beads (Beckman Coulter, Inc., Brea, CA, USA), using a beads/sample ratio of 0.6:1 in the first step and a ratio of 1.2:1 in the second step. Samples were sequenced with a 50 bp read length on a HiSeq 2500 sequencing platform (Illumina).

For samples sequenced on the SOLiD, cfDNA was extracted from 1 ml plasma using the QIAamplDSP DNA blood mini kit (Qiagen). Libraries were prepared according to factory protocol and sequenced with a 35 bp read length on the SOLiD 5500 Wildfire sequencing platform (Life Technologies).

Read alignment

For Illumina data, after an initial quality control of the fastq data using the program fastqc (v.0.7.0), the data were aligned to the human reference genome build b37 as released by the 1000 Genomes project¹² using BWA aln samse (0.5.8_patched) with default settings¹³. After alignment a Sam output file¹⁴ was created for each sample. Using Picard tools 1.6.1, a set of tools designed by the Broad Institute (Cambridge, USA) (http://broadinstitute.github.io/picard/) for processing and analyzing next generation sequencing data, the Sam files were transformed into Bam files. These Bam files were sorted and Bam index files formed. The Bam index files link the reads to the genome position. Quality metrics files were then created and the duplicate reads in the Bam files marked.

For SOLiD data, raw reads were mapped against the human reference genome (GRCh37/ hg19) using BWA v0.5.9¹³. Options used for mapping were −c, −l 25, −k 2, and −n 10. The Bam files were filtered using Sambamba v0.4.5¹⁵ to retain non-duplicate reads, uniquely mapped reads (XT:A:R), reads with no mismatches to the reference genome (CM:i:0), and reads with no second best hits in the reference genome (X1:i:0).

After filtering and removal of duplicate reads, the total autosomal read count was on average 20.2 million (SD 5.6 million) for SOLiD data and 12.5 million (SD 2.2 million) for Illumina data.

Variation reduction

Aligned reads were divided into 50,000 bp bins and variation between samples was reduced using all possible combinations of zero or more variation reduction methods: peak correction, GC-correction and χ²VR. When more than one method was used, they were performed in the order described above (Fig. 1). A maximum of one GC-correction method was used. Since the LOESS GC-correction has been described more often^5,6,7,8 than the weighted bin GC-correction⁹, we used LOESS GC-correction to evaluate the other variation reduction and prediction methods.

Peak correction

Peak correction was performed as described by Fan and Quake⁹. This method removes bins having a read count that significantly differs from the average using the information of all control samples. A bin was considered to deviate from normal if the total read count fell outside 1.96 SD compared with total read counts in the bins on the same chromosome for that sample. We interpreted bins to have a consistent pattern of region-specific variations if the variation deviated from normal in 95% or more of the control samples.

GC-correction

An important factor explaining the systematic uncontrolled variation between chromosomes is the guanine and cytosine (GC) content of the DNA fragments analyzed. When this GC-bias is corrected during preprocessing of the data, it results in a significantly lower variability⁸. GC-correction was performed based on total read counts using two different methods. The first GC-correction method is based on a LOESS curve fitted to the reads counts in bins sorted on GC content^5,6,7,8 and based on R v3.0.2 default settings (span 0.75; degree = 2). The second GC-correction method is based on the average coverage of bins having a similar GC-content⁹. The GC% of each bin is determined for both methods. Bins not containing any reads and bins with an unknown base composition are ignored. The weights of the correction factors were based on GC-content intervals of 0.1% and consisted of the average coverage of the bins within the GC-interval divided by the average coverage of all bins.

Trisomy prediction

We predicted trisomies using four different prediction methods: standard Z-score prediction⁵, NCV, using only the most informative chromosomes¹⁰, MAD-based Z-score¹¹ and RBZ. Depending on the variation reduction methods employed, we used corrected or uncorrected read counts for prediction. For all analyses chromosomes 13, 18 and 21 were not used as predictor chromosomes, since the prediction would be affected if a trisomy was present in one of the chromosomes used for prediction.

In short, the standard Z-score calculates the fraction of reads originating from the chromosome of interest compared with all reads originating from autosomal chromosomes, and then subtracts the mean fraction – which is the expected fraction – of the chromosome of interest in a set of control samples. The result is then divided by the SD of the fraction in the control set.

The NCV does not use all the autosomal chromosomes to calculate the fraction of the chromosome of interest, instead using the most informative chromosomes, which were selected using a training set¹⁰. All combinations of denominator chromosomes were tested for both the Illumina and SOLiD datasets, and the combinations yielding the lowest CVs were selected. The NCV is sometimes compared to using an internal reference⁶ because, during analysis, the selected reference chromosomes behave similarly to the chromosome of interest. This positive correlation results in less sample to sample variation, reduces the need for GC correction, and increases prediction precision.

The MAD-based Z-score replaces the SD by 1.4826 * MAD, making the calculation more tolerant of outliers in the control set¹¹. The MAD was calculated in three steps. First, the median of the fractions of the chromosome of interest in the control set was calculated. Second, the absolute difference of the chromosomal fraction to the median was calculated for each control sample. Finally, the MAD was calculated by taking the median of these absolute differences.

Comparison of the algorithms

In comparing the algorithms we used the CV as a benchmark for performance. The CV is a standardized measure of dispersion of a probability distribution and is defined as the ratio of the SD to the mean. In this manner it enables comparison between normal distributions with a different mean. The height of the CV of the control group, together with the percentage cffDNA, determines the discriminative power between normal and trisomic samples. When the CV decreases, the sensitivity increases (Supplement 4). We determined the added value of each variation reduction or prediction algorithm to lowering the CV to determine the best combination of algorithms.

For our analysis, we used all the non-trisomy samples sequenced with the same platform that underwent the same plasma isolation procedure as control samples. This resulted in control group sizes of 142 for the Illumina and 128 for the SOLiD sequencer. For all algorithms, the control group is only used when it is normally distributed as determined using the Shapiro Wilk statistical test (p > 0.05).

Algorithm combinations tested

We evaluated the effects of both peak correction and χ²VR on the CV of the control samples, the effect of the two different GC correction methods in combination with all prediction methods on the CV, and the effect of the different prediction methods on CV and Z-scores in combination with all possible variation reduction methods, except peak correction and the bin GC correction. The consistency of the RBZ trisomy prediction was determined by estimating three additional trisomy prediction models for each analysis.

Match QC score

To provide a proof of principle for the Match QC score performance, we divided the Illumina control group into a training set of 85 and a test set of 57 samples. The 34 Illumina samples that underwent a different plasma isolation protocol were used as an example of samples having undergone an alternative procedure.

We then calculated the Match QC score for all samples, using uncorrected, χ²VR, LOESS GC, and combined LOESS GC and χ²VR-corrected data. Cut-offs for the Match QC score were set on the average Match QC of the training set plus three SD. For all samples Z-scores were calculated for chromosomes 13, 18 and 21 to determine whether the scores fall within three SD of the average of the control set.

Results

For both the SOLiD and Illumina control groups, the CV of chromosomes 13, 18 and 21 was determined for all combinations of variation reduction and trisomy prediction methods and their theoretical effect on sensitivity and specificity was calculated (Supplement 5). The estimated percentages of cffDNA in the tested trisomy samples are shown in Supplement 6.

Effect of peak correction

To examine the effect of correcting bins with a coverage that deviates significantly from the average, we compared the CV of the peak-corrected data with that on which no peak correction was performed. Peak correction reduced the CV in most analysis strategies (Fig. 2). The largest relative effect for all chromosomes was observed when a GC-correction was also performed. The effect was largest in chromosome 21, which was the chromosome showing the lowest GC-bias when no correction was applied, suggesting that the influence of coverage peaks on variability only comes to light when GC-bias is limited. In data that was also χ²VR corrected, the variation did not further decrease but it did sometimes increase after use of a peak correction. This suggests that the peak correction and the χ²VR are partly correcting the same sources of bias.

Effects of the two GC correction methods

To examine the performance of the weighted bin GC correction and the LOESS GC-correction, we compared the performance of both methods in combination with all other variation reduction and prediction methods for chromosomes 13, 18 and 21 (Fig. 3). For chromosome 13, both GC correction methods performed equally well regardless of the other variation reduction and prediction methods used. For chromosome 18, the weighted bin GC correction had a better performance for the NCV and RBZ compared to LOESS GC correction. However, the Z-score and MAD-based Z-score predictions performed better using the LOESS GC-correction. For chromosome 21, the weighted bin GC correction performed best, regardless of the other methods used. The data sets used made no difference to the performance of either GC-correction method.

Effect of chi-squared-based variation reduction

To examine the performance of the χ²VR, we compared the control group CV using all other variation and prediction methods, with and without the χ²VR (Fig. 4). The χ²VR resulted in a lower CV in most analysis strategies for all chromosomes. The effect was most striking in chromosome 21, regardless of the other methods used.

Effect of trisomy prediction algorithms

To examine the effect of the prediction algorithms (standard Z-score, MAD-based Z-score, NCV and RBZ), we compared the CV using uncorrected, χ²VR, LOESS GC, and combined χ²VR and LOESS GC corrected data. Since the peak correction provides no added value to the χ²VR, it was not used for comparison. The RBZ produced the lowest CV for all variation reduction methods except the SOLiD combined LOESS GC and χ²VR corrected data, in which the MAD-based Z-score for chromosome 13 produced an even lower CV (Fig. 5). The variation using the NCV is higher than that using the RBZ, but the CV is still much lower than the CVs of the methods that used all autosomal chromosomes. The standard Z-score had the highest coefficient of variation in all models.

A lower CV yields a more extreme Z-score, which means that in the case of a trisomy, the Z-score is more likely to be higher than the threshold, resulting in a higher sensitivity. The Z-scores of the trisomy samples of the four prediction algorithms for the uncorrected, χ²VR, LOESS GC, and combined χ²VR and LOESS GC corrected data are listed in Supplement 7. False-negative and false-positive results were determined for all the above combinations of variation reduction algorithms and prediction algorithms, based on a 99.7% confidence interval (Z-score threshold of three) (Supplement 8).

Of the 50 trisomic samples, a false-negative result was found in two trisomy 13 and three trisomy 18 samples for the Z-score or the MAD-based Z-score when no variation reduction was done. One confirmed trisomy 18 sample did not give a positive result with any combination of algorithms, possibly due to a low fetal percentage. No false-negatives were found for chromosome 21. For all true-positive results, all four RBZ models showed a Z-score higher than three.

To better show the effect of the different variation reduction and prediction algorithms on the Z-score, we selected three samples, sequenced on the SOLiD platform, each having a trisomy 13, 18 or 21 (Fig. 6). Based on the Z-scores and CVs, each sample had an estimated fetal percentage of 5–6%. The NCV and RBZ consistently yielded higher Z-scores than the standard Z-score and the MAD-based Z-score. The effect of the GC-correction is reflected in the results of the standard Z-score and the MAD-based Z-score for chromosome 13 and the effect of the χ²VR shows in the chromosome 21 results.

Of the 270 non-trisomy samples, four samples showed a false-positive result for more than one prediction algorithm. For one sample, all four prediction methods showed a result higher than three. The more sensitive NCV and RBZ prediction methods resulted in more false-positive results than the standard Z-score or MAD-based Z-score because more parameters are estimated, which leads to some overfitting and therefore underestimation of the prediction accuracy for new samples. This effect will be reduced when larger control groups are used. Three other false-positive results were only seen in one of the variation reduction methods, one for NCV and three for RBZ. In all these cases, Z-scores were just above three. In all cases adding or removing a variation reduction step, resulted in a negative call. For samples having a false-positive RBZ result, at least one of the additional RBZ predictions resulted in a negative prediction, except for the sample having a Z-score higher than three in all prediction methods.

Match QC score

To examine whether the Match QC score could accurately predict whether a sample fits within a control group, we calculated the Match QC scores and all the Z-scores for a training set, a test set of samples that had been prepared in the same manner as the training set, and a third set of samples originating from single centrifuged plasma. For all three sets, we used uncorrected, χ²VR, LOESS GC and combined χ²VR- and LOESS GC-corrected data (Fig. 7). Test set samples had Match QC scores in the same range as the training set samples and Z-scores that fell within three SD of the mean for all types of corrected data. Single centrifuged samples, however, showed Match QC scores in the same range as the control group samples for uncorrected and χ²VR corrected data, but above the three-SD threshold for LOESS GC corrected data and combined LOESS GC- and χ²VR-corrected data.

Z-score distributions for the training set samples and the test set samples were indistinguishable for all correction methods, but Z-scores based on uncorrected or χ²VR corrected data were not normally distributed for chromosomes 13 and 18. For the single centrifuged samples, Z-scores did not deviate from the normal distribution for the uncorrected data of chromosome 21. Match QC scores for all the samples analyzed, thresholds and Z-score distributions for chromosomes 13, 18 and 21 are shown in Supplement 9.

Discussion

We show that both the χ²VR and the RBZ reduced the variability of the NIPT result and thus increased its sensitivity in both Illumina and SOLiD data. Furthermore, we show that a Match QC exceeding a three-SD threshold, determined using control samples, identified those samples for which the controls were not representative. Although the algorithms described in this study are designed to improve analysis of NIPT data, they may also be of use in similar types of analyses that need high sensitivity such as copy number variation detection in liquid biopsy data^{16, 17}.

The lower variability between samples decreases the percentage of fetal DNA needed for NIPT. A low percentage of fetal DNA is an important contributor to false negative or inconclusive results¹⁸. Moreover, the average percentage of fetal DNA is lower in trisomy 13 and trisomy 18 pregnancies than in non-trisomy pregnancies^{19, 20}. A low variability is therefore even more important for these pregnancies for the test to have a high sensitivity. Moreover, our novel algorithms produce a lower variability for a given number of reads, resulting in the need for fewer reads and lowering sequencing costs. Alternatively, only DNA-fragments originating from regions of interest could be selected^21,22,23. However, such a selection requires additional amplification during sample preparation, which could also create additional variation due to increased over-dispersion^{24, 25}. We therefore chose to reduce variation by correcting for bias in read counts before analysis, leading to a more comparable distribution of reads over the chromosomes between samples. Other studies have shown that variability can be introduced by bias present in the data, such as GC-bias^{3, 5,6,7,8,9}, or peaks of extreme coverage, probably caused by repeats⁹. However, due to a higher number of available reads, better results were obtained using a non-repeat-masked reference genome^{5, 7}. For this reason, we did not mask any regions based on mappability tracks or blacklisted regions in our comparison.

In our comparison the lowest CVs for chromosomes 13, 18 and 21 were produced using the combination of the weighted-bin-based GC-correction method and the χ²VR with the RBZ. However, each variation reduction algorithm we tested reduced the variability when used alone. The effect of the peak variation reduction was small when combined with the χ²VR. This shows that the χ²VR corrects bias caused by regions of extreme coverage. Moreover, since the χ²VR focuses on variation present in each specific bin, and not on chromosomal averages, it can correct for variation that is too subtle for peak correction. And since no assumptions are made about the origin of the bias, no prior knowledge is needed for correction. However, when using the χ²VR on the X-chromosome, variability should be determined using only data from pregnancies of a female fetus to prevent variability in the fetal percentage adding to the total variability on that chromosome. After application of GC-correction, χ²VR reduced variation even further, suggesting that χ²VR corrects for sources of bias other than that from GC. Since up to 50% of the human genome is repetitive²⁶, we suggest that part of the extra corrected bias is due to repeat structures. It has also been suggested that biological factors play a role in bias in NIPT^{27, 28}, so part of the corrected bias might have a biological origin.

Where peak correction and χ²VR only remove reads to reduce variation, GC-correction removes reads in bins having a GC-percentage containing more reads than average, but it adds virtual reads in bins with a GC-percentage containing fewer reads than average. Although, after GC correction, more reads seem to be present for several chromosomes, dispersion is still based on the original number of reads aligned to those chromosomes.

We demonstrated that the prediction method used can also reduce variability and increase sensitivity. The RBZ resulted in the lowest variability and decreased the need for GC-correction because this method takes this kind of systematic bias into account. However, there may be some pitfalls. Similar to the NCV, prediction is based on a limited number of predictor chromosomes. The effect of an aberration in one of the predictor chromosomes on the prediction is larger for the RBZ and NCV than for the standard Z-score, which uses all autosomes for prediction. To limit the effect of possible aberrations, we recommend comparing four independent predictor sets for the RBZ. Conflicting results of different models are a warning of possible false-positive results. In our data, all 49 trisomies detected were predicted independently by the four RBZ prediction sets. Only one false-positive call was made by all four sets. This call was also made by all the other prediction methods, suggesting that there may indeed be a higher fraction of reads of the called chromosome present in the data. Since the NCV can be based on only one denominator chromosome, we suggest multiple predictions using different denominators should also be used for NCV.

Our results show that a Match QC score below the three-SD threshold does not guarantee that the control group is representative for a sample, but a score exceeding the threshold does indicate that the analysis is not accurate. The main assumption in NIPT analysis is that the control set is representative of the sample analyzed. A non-representative control set leads to an inaccurate prediction and possibly to false-positive or false-negative results. It is therefore important that all samples undergo the same preparation, sequencing procedure and bioinformatics analysis. However, even when standard procedures are used, bias can vary between sequencing runs²⁹. Prediction methods with a higher sensitivity are more vulnerable to the effects of unaccounted biological variation because deviations in the expected chromosomal fractions will more rapidly lead to false-positive results. Sample quality metrics are therefore essential for reliable analysis.

Our study shows that both the χ²VR and the RBZ increase the sensitivity of NIPT compared to previously published methods. Furthermore, we show that the Match QC score identifies samples for which the non-trisomy control set was not informative. Moreover, these algorithms may have a broader applicability than NIPT analysis, for instance in analysis of copy number variations in liquid biopsy data. We recommend our novel algorithms, as included in the NIPTeR package, as a useful addition to the NIPT analysis toolbox, resulting in a higher sensitivity, in theory making it possible to detect trisomies in blood with a fetal DNA amount as low as 2%.

References

Lo, Y. M. D. et al. Early report. Presence of fetal DNA in maternal plasma and serum. Lancet 350, 485–487, doi:10.1016/S0140-6736(97)02174-0 (1997).
Article CAS PubMed Google Scholar
Alfirevic, Z., Mujezinovic, F. & Sundberg, K. Amniocentesis and chorionic villus sampling for prenatal diagnosis. Cochrane Database Syst Rev 3, CD003252, doi:10.1002/14651858.CD003252 (2003).
Google Scholar
Fan, H. C., Blumenfeld, Y. J., Chitkara, U., Hudgins, L. & Quake, S. R. Noninvasive diagnosis of fetal aneuploidy by shotgun sequencing DNA from maternal blood. Proc Natl Acad Sci USA 105, 16266–16271, doi:10.1073/pnas.0808319105 (2008).
Article ADS CAS PubMed PubMed Central Google Scholar
Chiu, R. W. K. et al. Noninvasive prenatal diagnosis of fetal chromosomal aneuploidy by massively parallel genomic sequencing of DNA in maternal plasma. Proc Natl Acad Sci USA 105, 20458–20463, doi:10.1073/pnas.0810641105 (2008).
Article ADS CAS PubMed PubMed Central Google Scholar
Chen, E. Z. et al. Noninvasive prenatal diagnosis of fetal trisomy 18 and trisomy 13 by maternal plasma DNA sequencing. PloS One 6, e21791, doi:10.1371/journal.pone.0021791 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Lau, T. K. et al. Noninvasive prenatal diagnosis of common fetal chromosomal aneuploidies by maternal plasma DNA sequencing. J Matern Fetal Neonatal Med 25, 1370–1374, doi:10.3109/14767058.2011.635730 (2012).
Article CAS PubMed Google Scholar
Palomaki, G. E. et al. DNA sequencing of maternal plasma reliably identifies trisomy 18 and trisomy 13 as well as Down syndrome: an international collaborative study. Genet Med 14, 296–305, doi:10.1038/gim.2011.73 (2012).
Article CAS PubMed PubMed Central Google Scholar
Liang, D. et al. Non-invasive prenatal testing of fetal whole chromosome aneuploidy by massively parallel sequencing. Prenat Diagn 33, 409–415, doi:10.1002/pd.4033 (2013).
Article CAS PubMed Google Scholar
Fan, H. C. & Quake, S. R. Sensitivity of noninvasive prenatal detection of fetal aneuploidy from maternal plasma using shotgun sequencing is limited only by counting statistics. PloS One 5, e10439, doi:10.1371/journal.pone.0010439 (2010).
Article ADS PubMed PubMed Central Google Scholar
Sehnert, A. J. et al. Optimal detection of fetal chromosomal abnormalities by massively parallel DNA sequencing of cell-free fetal DNA from maternal blood. Clin Chem 57, 1042–1049, doi:10.1373/clinchem.2011.165910 (2011).
Article CAS PubMed Google Scholar
Stumm, M. et al. Diagnostic accuracy of random massively parallel sequencing for non-invasive prenatal detection of common autosomal aneuploidies: a collaborative study in Europe. Prenat Diagn 34, 185–191, doi:10.1002/pd.4278 (2014).
Article PubMed Google Scholar
1000 Genomes Project Consortium. et al. A map of human genome variation from population-scale sequencing. Nature 467, 1061–1073, doi:10.1038/nature09534 (2010).
Article Google Scholar
Li, H. & Durbin, R. Fast and accurate long-read alignment with Burrows-Wheeler transform. Bioinformatics 26, 589–595, doi:10.1093/bioinformatics/btp698 (2010).
Article PubMed PubMed Central Google Scholar
Li, H. et al. 1000 Genome Project Data Processing Subgroup. The Sequence Alignment/Map format and Samtools. Bioinformatics 25, 2078–2079, doi:10.1093/bioinformatics/btp352 (2009).
Article PubMed PubMed Central Google Scholar
Tarasov, A., Vilella, A. J., Cuppen, E., Nijman, I. J. & Prins, P. Sambamba: fast processing of NGS alignment formats. Bioinformatics 31, 2032–2034, doi:10.1093/bioinformatics/btv098 (2015).
Article CAS PubMed PubMed Central Google Scholar
Chan, K. C. A. et al. Cancer genome scanning in plasma: Detection of tumor-associated copy number aberrations, single-nucleotide variants, and tumoral heterogeneity by massively parallel sequencing. Clin Chem 59, 211–224, doi:10.1373/clinchem.2012.196014. (2013).
Article CAS PubMed Google Scholar
Leary, R. J. et al. Detection of chromosomal alterations in the circulation of cancer patients with whole-genome sequencing. Sci Transl Med 4, 162ra154–162ra154, doi:10.1126/scitranslmed.3004742 (2012).
Article PubMed PubMed Central Google Scholar
Mackie, F. L., Hemming, K., Allen, S., Morris, R. & Kilby, M. D. The accuracy of cell-free fetal DNA-based non-invasive prenatal testing in singleton pregnancies: a systematic review and bivariate meta-analysis. BJOG 1–15, doi:10.1111/1471-0528.14050 (2016).
Wegrzyn, P., Fabio, C., Falcon, O., Peralta, C. F. A. & Nicolaides, K. H. Placental volume measured by three-dimensional ultrasound at 11 to 13 + 6 weeks of gestation: relation to chromosomal defects. Ultrasound in Obstet Gynecol 26, 28–32, doi:10.1002/uog.2783 (2012).
Article Google Scholar
Ashoor, G., Poon, L., Syngelaki, A., Mosimann, B. & Nicolaides, K. H. Fetal fraction in maternal plasma cell-free DNA at 11–13 weeks’ gestation: effect of maternal and fetal factors. Fetal Diagn Ther 31, 237–243, doi:10.1159/000337373 (2012).
Article PubMed Google Scholar
Sparks, A. B. et al. Selective analysis of cell-free DNA in maternal blood for evaluation of fetal trisomy. Prenat Diagn 32, 3–9, doi:10.1002/pd.2922 (2012).
Article CAS PubMed PubMed Central Google Scholar
Ashoor, G. et al. Trisomy 13 detection in the first trimester of pregnancy using a chromosome-selective cell-free DNA analysis method. Ultrasound Obstet Gynecol 41, 21–25, doi:10.1002/uog.12299 (2013).
Article CAS PubMed Google Scholar
Zimmermann, B. et al. Noninvasive prenatal aneuploidy testing of chromosomes 13, 18, 21, X, and Y, using targeted sequencing of polymorphic loci. Prenat Diagn 32, 1233–1241, doi:10.1002/pd.3993 (2012).
Article CAS PubMed PubMed Central Google Scholar
Mutter, G. L. & Boynton, K. A. PCR bias in amplification of androgen receptor alleles, a trinucleotide repeat marker used in clonality studies. Nucleic Acids Res 23, 1411–1418, doi:10.1093/nar/23.8.1411 (1995).
Article CAS PubMed PubMed Central Google Scholar
Dohm, J. C., Lottaz, C., Borodina, T. & Himmelbauer, H. Substantial biases in ultra-short read data sets from high-throughput DNA sequencing. Nucleic Acids Res 36, e105–e105, doi:10.1093/nar/gkn425 (2008).
Article PubMed Google Scholar
Smit, A. F. A., Hubley, R. & Green, P. RepeatMasker Open-4.0 http://www.repeatmasker.org (2013–2015).
Van den Oever, J. M. E. et al. Successful noninvasive trisomy 18 detection using single molecule sequencing. Clin Chem 59, 705–709, doi:10.1373/clinchem.2012.196212 (2013).
Article PubMed Google Scholar
Chandrananda, D. et al. Investigating and correcting plasma DNA sequencing coverage bias to enhance aneuploidy discovery. PLoS One 9, p. e86993, doi:10.1371/journal.pone.0086993 (2014).
Article ADS PubMed Google Scholar
Aird, D. et al. Analyzing and minimizing PCR amplification bias in Illumina sequencing libraries. Genome Biol 12, R18, doi:10.1186/gb-2011-12-2-r18 (2011).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We thank Jackie Senior and Kate Mc Intyre for editorial advice.

Author information

L. F. Johansson and E. N. de Boer contributed equally to this work.

Authors and Affiliations

University of Groningen, University Medical Centre Groningen, Department of Genetics, Groningen, The Netherlands
L. F. Johansson, E. N. de Boer, H. A. de Weerd, F. van Dijk, R. F. Suijkerbuijk, R. J. Sinke, G. J. te Meerman, R. H. Sijmons, M. A. Swertz & B. Sikkema-Raddatz
University of Groningen, University Medical Centre Groningen, Genomics Coordination Centre, Groningen, The Netherlands
L. F. Johansson, H. A. de Weerd, F. van Dijk & M. A. Swertz
University Medical Centre Utrecht, Department of Genetics, Utrecht, The Netherlands
M. G. Elferink & G. H. Schuring-Blom

Authors

L. F. Johansson
View author publications
You can also search for this author in PubMed Google Scholar
E. N. de Boer
View author publications
You can also search for this author in PubMed Google Scholar
H. A. de Weerd
View author publications
You can also search for this author in PubMed Google Scholar
F. van Dijk
View author publications
You can also search for this author in PubMed Google Scholar
M. G. Elferink
View author publications
You can also search for this author in PubMed Google Scholar
G. H. Schuring-Blom
View author publications
You can also search for this author in PubMed Google Scholar
R. F. Suijkerbuijk
View author publications
You can also search for this author in PubMed Google Scholar
R. J. Sinke
View author publications
You can also search for this author in PubMed Google Scholar
G. J. te Meerman
View author publications
You can also search for this author in PubMed Google Scholar
R. H. Sijmons
View author publications
You can also search for this author in PubMed Google Scholar
M. A. Swertz
View author publications
You can also search for this author in PubMed Google Scholar
B. Sikkema-Raddatz
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

L.F.J. and E.N.d.B. wrote the manuscript and prepared the figures. L.F.J., E.N.d.B. and G.J.t.M. developed the methods and performed validation studies. H.A.d.W., L.F.J., F.v.D. and M.G.E. built the software packages and analysis pipeline. G.H.S.-B., R.F.S. and B.S. included the patients. R.J.S., G.J.t.M., R.H.S., M.A.S. and B.S. supervised the design and progress of this project. All authors read and approved the final manuscript.

Corresponding authors

Correspondence to L. F. Johansson or E. N. de Boer.

Ethics declarations

Competing Interests

The authors declare that they have no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary material

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Johansson, L.F., de Boer, E.N., de Weerd, H.A. et al. Novel Algorithms for Improved Sensitivity in Non-Invasive Prenatal Testing. Sci Rep 7, 1838 (2017). https://doi.org/10.1038/s41598-017-02031-5

Download citation

Received: 06 January 2017
Accepted: 04 April 2017
Published: 12 May 2017
DOI: https://doi.org/10.1038/s41598-017-02031-5
Springer Nature Limited

This article is cited by

Noninvasive prenatal testing for chromosome aneuploidies and subchromosomal microdeletions/microduplications in a cohort of 8141 single pregnancies
- Hua Hu
- Li Wang
- Ying Yang
Human Genomics (2019)
A next-generation sequencing method for gene doping detection that distinguishes low levels of plasmid DNA against a background of genomic DNA
- Eddy N. de Boer
- Petra E. van der Wouden
- Hidde J. Haisma
Gene Therapy (2019)
NIPTeR: an R package for fast and accurate trisomy prediction in non-invasive prenatal testing
- Lennart F. Johansson
- Hendrik A. de Weerd
- Morris A. Swertz
BMC Bioinformatics (2018)

Novel Algorithms for Improved Sensitivity in Non-Invasive Prenatal Testing

Abstract

Similar content being viewed by others

Statistical Approach to Decreasing the Error Rate of Noninvasive Prenatal Aneuploid Detection caused by Maternal Copy Number Variation

NIPTeR: an R package for fast and accurate trisomy prediction in non-invasive prenatal testing

An adaptive detection method for fetal chromosomal aneuploidy using cell-free DNA from 447 Korean women

Introduction

Material and Methods

Chi-squared-based variation reduction

Regression-based Z-score

Match QC score

Validation of algorithms

Samples

Plasma isolation, sample preparation and sequencing

Read alignment

Variation reduction

Peak correction

GC-correction

Trisomy prediction

Comparison of the algorithms

Algorithm combinations tested

Match QC score

Results

Effect of peak correction

Effects of the two GC correction methods

Effect of chi-squared-based variation reduction

Effect of trisomy prediction algorithms

Match QC score

Discussion

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing Interests

Additional information

Electronic supplementary material

Supplementary material

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Noninvasive prenatal testing for chromosome aneuploidies and subchromosomal microdeletions/microduplications in a cohort of 8141 single pregnancies

A next-generation sequencing method for gene doping detection that distinguishes low levels of plasmid DNA against a background of genomic DNA

NIPTeR: an R package for fast and accurate trisomy prediction in non-invasive prenatal testing

Search

Navigation