Skip to main content

Part of the book series: Mathematics and Visualization ((MATHVISUAL))

Summary

Genomes of hundreds of species have been sequenced to date, and many more are being sequenced. As more and more sequence data sets become available, and as the challenge of comparing these massive “billion basepair DNA sequences” becomes substan- tial, so does the need for more powerful tools supporting the exploration of these data sets. Similarity score data used to compare aligned DNA sequences is inherently one-dimensional. One-dimensional (1D) representations of these data sets do not effectively utilize screen real estate. As a result, tools using 1D representations are incapable of providing informatory overview for extremely large data sets. We present a technique to arrange 1D data in 3D space to allow us to apply state-of-the-art interactive volume visualization techniques for data exploration. We demonstrate our technique using multi-millions-basepair-long aligned DNA sequence data and compare it with traditional 1D line plots. The results show that our tech- nique is superior in providing an overview of entire data sets. Our technique, coupled with 1D line plots, results in effective multi-resolution visualization of very large aligned sequence data sets.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Dan Ashlock and Jim Golden. Ch.11 Evolutionary Computation and Fractal Visualization of Sequence Data. Morgan Kaufmann, 2002.

    Google Scholar 

  2. Bob Beretta, Pat Brown, Matt Craighead, Cass Everitt, Evan Hart, Jon Leech, Bill Licea-Kane, Bimal Poddar, Jeremy Sandmel, Jon Paul Schelter, Avinash Seetharamaiah, and Nick Triantos. GL_ARB_fragment_program Specification. Online OpenGL Extension Registry, August 2003.

    Google Scholar 

  3. Michael Brudno, Chuong Do, Gregory Cooper, Michael F. Kim, Eugene Davydov, Eric D. Green, Arend Sidow, and Serafim Batzoglou. Lagan and multi-lagan: efficient tools for large-scale multiple alignment of genomic DNA. Genome Research, 13(4):721–731, 2003.

    Article  Google Scholar 

  4. Hsuan T. Chang, Neng-Wen Lo, Wei C. Lu, and Chung J. Kuo. Visualization and comparison of DNA sequences by use of three-dimensional trajectories. In Proceedings of the First Asia-Pacific Bioinformatics Conference on Bioinformatics 2003, pages 81–85, Adelaide, Australia, 2003.

    Google Scholar 

  5. Ed Huai-hsin Chi, Phillip Barry, Elizabeth Shoop, John Carlis, Ernest Retzel, and John Riedl. Visualization of biological sequence similarity search results. In Gregory M. Nielson and Deborah Silver, editors, Proceedings of IEEE Visualization 1995, IEEE Visualization, Annual Conference Series, pages 44–51, Atlanta, USA, 1995. IEEE, IEEE Computer Society Press.

    Google Scholar 

  6. Ed Huai-hsin Chi, John Riedl, Elizabeth Shoop, John V. Carlis, Ernest Retzel, and Phillip Barry. Flexible information visualization of multivariate data from biological sequence similarity searches. In Roni Yagel and Gregory M. Nielson, editors, Proceedings of IEEE Visualization 1996, IEEE Visualization, Annual Conference Series, pages 133–140, San Francisco, USA, 1996. IEEE, IEEE Computer Society Press.

    Google Scholar 

  7. Kelly A. Frazer, Laura Elnitski, Deanna M. Church, Inna Dubchak, and Ross C. Hardison. Cross-species sequence comparisons: A review of methods and available resources. Genome Research, 13(1):1–12, 2003.

    Article  Google Scholar 

  8. Berthold Göttgens, Linda M. Barton, Michael A. Chapman, Angus M. Sinclair, Bjarne Knudsen, Darren Grafham, James G.R. Gilbert, Jane Rogers, David R. Bentley, and Anthony R. Green. Transcriptional regulation of the stem cell leukemia gene (scl) comparative analysis of five vertebrate scl loci. Genome Research, 12(5):749–759, 2002.

    Article  Google Scholar 

  9. Berthold Göttgens, James G R. Gilbert, Linda M. Barton, Darren Grafham, Jane Rogers, David R. Bentley, and Anthony R. Green. Long-range comparison of human and mouse scl loci: Localized regions of sensitivity to restriction endonucleases correspond precisely with peaks of conserved noncoding sequences. Genome Research, 11(1):87–97, 2001.

    Article  Google Scholar 

  10. W. Heidrich, M. McCool, and J. Stevens. Interactive maximum projection volume rendering. In Gregory M. Nielson and Deborah Silver, editors, Proceedings of IEEE Visualization 1995, IEEE Visualization, Annual Conference Series, pages 11–18, Atlanta, USA, 1995. IEEE, IEEE Computer Society Press.

    Chapter  Google Scholar 

  11. T. Hubbard, D. Barker, E. Birney1, G. Cameron, Y. Chen, L. Clark, T. Cox, J. Cuff, V. Curwen, T. Down, R. Durbin, E. Eyras, J. Gilbert, M. Hammond, L. Huminiecki, A. Kasprzyk, H. Lehvaslaiho, P. Lijnzaad, C. Melsopp, E. Mongin, R. Pettett, M. Pocock, S. Potter, A. Rust, E. Schmidt, S. Searle, G. Slater, J. Smith, W. Spooner, A. Stabenau, J. Stalker, E. Stupka, A. Ureta-Vidal, I. Vastrik, and M. Clamp. The ensembl genome database project. Nucleic Acids Research, 30(1):38–41, 2002.

    Article  Google Scholar 

  12. H. Joel Jeffrey. Chaos game representation of gene structure. Nucleic Acids Research, 18(8):2163–2170, 1990.

    Article  Google Scholar 

  13. Daniel A. Keim, Mihael Ankerst, and Hans-Peter Kriegel. Recursive pattern: A technique for visualizing very large amounts of data. In Gregory M. Nielson and Deborah Silver, editors, Proceedings of IEEE Visualization 1995, IEEE Visualization, Annual Conference Series, pages 279–288, Atlanta, Georgia, 1995. IEEE, IEEE Computer Society.

    Chapter  Google Scholar 

  14. W. James Kent, Charles W. Sugnet, Terrence S. Furey, Krishna M. Roskin, Tom H. Pringle, Alan M. Zahler, and David Haussler. The human genome browser at UCSC. Genome Research, 12(6):996–1006, 2002.

    Google Scholar 

  15. Nelson L. Max. Visualizing Hilbert curves. IEEE visualization 1998. In David S. Ebert, Holly Rushmeier, and Hans Hagen, editors, Proceedings of IEEE Visualization 1998, IEEE Visualization, Annual Conference Series, pages 447–450, North Carolina, USA, 1998. IEEE, IEEE Computer Society Press.

    Google Scholar 

  16. C. Mayor, M. Brudno, J. R. Schwartz, A. Poliakov, E. M. Rubin, Kelly A. Frazer, Lior S. Pachter, and Inna Dubchak. VISTA: visualizing global DNA sequence alignments of arbitrary length. Bioinformatics, 16(11):1046–1047, 2000.

    Article  Google Scholar 

  17. Hans Sagan. Space-Filling Curves. Springer-Verlag, 1994.

    Google Scholar 

  18. S. Schwartz, L. Elnitski, M. Li, M. Weirauch, C. Riemer, A. Smit, E. D. Green, R. C. Hardison, W. Miller, and NISC Comparative Sequencing Program. Multipipmaker and supporting tools: Alignments and analysis of multiple genomic DNA sequences. Nucleic Acids Research, 31(13):3518–3524, 2003.

    Google Scholar 

  19. Nameeta Shah, Olivier Couronne, Len A. Pennacchio, M. Brudno, Serafim Batzoglou, E. W. Bethel, E. M. Rubin, Bernd Hamann, and Inna Dubchak. Phylo-vista: interactive visualization of multiple DNA sequence alignments. Bioinformatics, 20(5):636–643, 2004.

    Article  Google Scholar 

  20. Ben Shneiderman. The eyes have it: A task by data type taxonomy for information visualizations. In Proceedings of IEEE Symposium on Visual Languages 1996, pages 336–343. IEEE Computer Society, 1996.

    Google Scholar 

  21. James Slack, Kristian Hildebrand, Tamara Munzner, and Katherine St. John. Sequence-juxtaposer: Fluid navigation for large-scale sequence comparison in context. In Robert Giegerich and Jens Stoye, editors, German Conference on Bioinformatics, volume 53 of LNI. GI, 2004.

    Google Scholar 

  22. Rhazes Spell, Rachael Brady, and Fred Dietrich. BARD: A visualization tool for biological sequence analysis. In Tamara Munzner and Stephen North, editors, IEEE Symposium on Information Visualization, 2003, pages 219–226. IEEE Computer Society, 2003.

    Google Scholar 

  23. Praveen Thiagarajan and Guang Gao. Visualizing biosequence data using texture mapping. In Pak Chung Wong and Keith Andrews, editors, IEEE Symposium on Information Visualization, 2002, pages 103–109. IEEE Computer Society Press, 2002.

    Google Scholar 

  24. J. W. Thomas, J. W. Touchman, R. W. Blakesley, G. G. Bouffard, S. M. Beckstrom-Sternberg, E. H. Margulies, M. Blanchette, A. C. Siepel, P. J. Thomas, J. C. Mcdowell, B. Maskeri, N. F. Hansen, M. S. Schwartz, R. J. Weber, W. J. Kent, D. Karolchik, T. C. Bruen, R. Bevan, D. J. Cutler, S. Schwartz, L. Elnitski, J. R. Idol, A. B. Prasad, S.-Q Lee-Lin, V. V. B. Maduro, T. J. Summers, M. E. Portnoy, N. L. Dietrich, N. Akhter, K. Ayele, B. Benjamin, K. Cariaga, C. P. Brinkley, S. Y. Brooks, S. Granite, X. Guan, J. Gupta, P. Haghighi, S.-L Ho, M. C. Huang, E. Karlins, P. L. Laric, R. Legaspi, M. J. Lim, Q. L. Maduro, C. A. Masiello, S. D. Mastrian, J. C. Mccloskey, R. Pearson, S. Stantripop, E. E. Tiongson, J. T. Tran, C. Tsurgeon, J. L. Vogt, M. A. Walker, K. D. Wetherby, L. S. Wiggins, A. C. Young, L.-H Zhang, K. Osoegawa, B. Zhu, B. Zhao, C. L. Shu, P. J. De Jong, C. E. Lawrence, A. F. Smit, A. Chakravarti, D. Haussler, P. Green, W. Miller, and E. D. Green. Comparative analyses of multi-species sequences from targeted genomic regions. Nature, 424(14):788–793, 2003.

    Article  Google Scholar 

  25. Douglas Voorhies. Space-filling curves and a measure of coherence. In James Arvo, editor, Graphics Gems II, Graphics Gems, pages 26–30. Academic Press, 1991.

    Google Scholar 

  26. David L. Wheeler, Colombe Chappey, Alex E. Lash, Detlef D. Leipe, Thomas L. Madden, Gregory D. Schuler, Tatiana A. Tatusova, and Barbara A. Rapp. Database resources of the national center for biotechnology information. Nucleic Acids Research, 28(1):10–14, 2000.

    Article  Google Scholar 

  27. Pak Chung Wong, Kwong Kwok Wong, Harlan Foote, and Jim Thomas. Global visualization and alignment of whole bacterial genomes. IEEE Transactions on Visualization and Computer Graphics, 9(3):361–377, 2003.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Shah, N., Dillard, S.E., Weber, G.H., Hamann, B. (2009). Volume Visualization of Multiple Alignment of Large Genomic DNA. In: Möller, T., Hamann, B., Russell, R.D. (eds) Mathematical Foundations of Scientific Visualization, Computer Graphics, and Massive Data Exploration. Mathematics and Visualization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/b106657_17

Download citation

Publish with us

Policies and ethics