Abstract
Oxford Nanopore Technologies’ (ONT) MinION device is capable of reading single molecule DNA strands tens of thousands of bases long, by passing a strand through a nanopore and recording the changes in electric current. The error rate of the platform is higher than most mature next-generation sequencing (NGS) platforms, with many of the deletions accumulating in stretches of identical bases (homopolymers). However, the mean time each 5-base long subsequence (k-mer) of the molecule spends inside of the pore (dwell time) can also be used to infer the length of the true sequence. We developed a method called NanoTimer, which estimates the homopolymer length from the dwell times. It relies on the redundancy of having multiple reads covering a reference sequence, and the depth of coverage determines its accuracy.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
1. Mikheyev AS, Tin MM (2014) A first look at the Oxford Nanopore MinION sequencer. Mol Ecol Resour 14(6): p. 1097-102. DOI 10.1111/1755-0998.12324
2. Cherf GM, Lieberman KR, Rashid H, Lam CE, Karplus K, Akeson M (2012) Automated Forward and Reverse Ratcheting of DNA in a Nanopore at Five Angstrom Precision. Nat Biotechnol 2012 Apr; 30(4): 344-348 DOI 10.1038/nbt.2147
3. Boza V, Brejova H, (2016) DeepNano: Deep Recurrent Neural Networks for Base Calling in MinION Nanopore Reads. arXiv, DOI arXiv:1603.09195
4. Brown C: Inside the skunkworx. In: London Calling. Oxford Nanopore Technologies. 2016. https://londoncallingconf.co.uk/lc/2016-plenary#168687629
5. Qiao W, et al. (2016) Long-Read Single Molecule Real-Time Full Gene Sequencing of Cytochrome P450-2D6. Hum Mutat. 2016;37:315–23. DOI 10.1002/humu.22936
6. Loman N. J., Quick J., Simpson J.T. (2015) A complete bacterial genome assembled de novo using only nanopore sequencing data. Nat Methods. 2015;12:733–5. DOI 10.1038/nmeth.3444
7. Stoiber M, Brown J. B. et al (2016) De novo Identification of DNA Modifications Enabled by Genome-Guided Nanopore Signal Processing. Biorxiv DOI 10.1101/094672
8. NA12878 Human Reference on Oxford Nanopore MinION at https://github.com/nanopore-wgs-consortium/NA12878
9. Zook JM et al, (2014) Integrating human sequence data sets provides a resource of benchmark SNP and indel genotype calls. Nature Biotechnology 32, 246-251. DOI 10.1038/nbt.2835
10. GRCh38 reference genome ftp://ftp-trace.ncbi.nih.gov/1000genomes/ftp/technical/reference/GRCh38_reference_genome/
11. Li, H. (2013) Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. arXiv:1303.3997v1
12. Robinson JT, Thorvaldsdóttir H, Winckler W, Guttman M, Lander ES, Getz G, Mesirov JP (2011) Integrative Genomics Viewer. Nature Biotechnology 29, 24–26. DOI 10.1038/nbt.1754.
13. Ammar, R, Paton, T.A., Torti, D., Shlien, A. (2015) Long read nanopore sequencing for detection of HLA and CYP2D6 variants and haplotypes. F1000Research DOI:10.12688/f1000research.6037.1
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Sarkozy, P., Jobbágy, Á., Antal, P. (2018). Calling Homopolymer Stretches from Raw Nanopore Reads by Analyzing k-mer Dwell Times. In: Eskola, H., Väisänen, O., Viik, J., Hyttinen, J. (eds) EMBEC & NBC 2017. EMBEC NBC 2017 2017. IFMBE Proceedings, vol 65. Springer, Singapore. https://doi.org/10.1007/978-981-10-5122-7_61
Download citation
DOI: https://doi.org/10.1007/978-981-10-5122-7_61
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-5121-0
Online ISBN: 978-981-10-5122-7
eBook Packages: EngineeringEngineering (R0)