Journal of Mathematical Biology

, Volume 67, Issue 2, pp 389–410

Length distribution of sequencing by synthesis: fixed flow cycle model

Article

Abstract

Sequencing by synthesis is the underlying technology for many next- generation DNA sequencing platforms. We developed a new model, the fixed flow cycle model, to derive the distributions of sequence length for a given number of flow cycles under the general conditions where the nucleotide incorporation is probabilistic and may be incomplete, as in some single-molecule sequencing technologies. Unlike the previous model, the new model yields the probability distribution for the sequence length. Explicit closed form formulas are derived for the mean and variance of the distribution.

Keywords

Sequencing by synthesis Next-generation sequencing Sequence analysis Generating function Probability Combinattorics 

Mathematics Subject Classification (2000)

05A15 60C05 92D99 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer-Verlag 2012

Authors and Affiliations

  1. 1.Department of Molecular Biophysics and Biochemistry, W.M. Keck Foundation Biotechnology Resource LaboratoryYale UniversityNew HavenUSA

Personalised recommendations