Read Depth Analysis to Identify CNV in Bacteria Using CNOGpro
Whole-genome sequencing with short-read technologies is well suited for calling single nucleotide polymorphisms, but has major problems with the detection of structural variants larger than the read length. One such type of variation is copy number variation (CNV), which entails deletion or duplication of genomic regions, and the expansion or contraction of repeated elements. Duplicated and deleted regions will typically be collapsed during de novo assembly of sequence data, or ignored when mapping reads toward a reference. However, signatures of the copy number variation can be detected in the resultant read depth at each position in the genome. We here provide instructions on how to analyze this read depth signal with the R package CNOGpro, allowing for estimation of copy numbers with uncertainty for each feature in a genome.
Key wordsRead depth CNV Bacteria CNOGpro Coverage Whole-genome sequencing
- 15.Li H (2013) Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. arXiv:1303.3997 [q-bio.GN]Google Scholar