Skip to main content

Efficient Calculation of Interval Scores for DNA Copy Number Data Analysis

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNBI,volume 3500))

Abstract

Background. DNA amplifications and deletions characterize cancer genome and are often related to disease evolution. Microarray based techniques for measuring these DNA copy-number changes use fluorescence ratios at arrayed DNA elements (BACs, cDNA or oligonucleotides) to provide signals at high resolution, in terms of genomic locations. These data are then further analyzed to map aberrations and boundaries and identify biologically significant structures.

Methods. We develop a statistical framework that enables the casting of several DNA copy number data analysis questions as optimization problems over real valued vectors of signals. The simplest form of the optimization problem seeks to maximize \(\varphi (I) = \sum v_i/\sqrt{|I|}\) over all subintervals I in the input vector. We present and prove a linear time approximation scheme for this problem. Namely, a process with time complexity O( − 2) that outputs an interval for which ϕ(I) is at least Opt/α(ε), where Opt is the actual optimum and α(ε) → 1 as ε → 0. We further develop practical implementations that improve the performance of the naive quadratic approach by orders of magnitude. We discuss properties of optimal intervals and how they apply to the algorithm performance.

Examples. We benchmark our algorithms on synthetic as well as publicly available DNA copy number data. We demonstrate the use of these methods for identifying aberrations in single samples as well as common alterations in fixed sets and subsets of breast cancer samples.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Balsara, B.R., Testa, J.R.: Chromosomal imbalances in human lung cancer. Oncogene 21(45), 6877–6883 (2002)

    Article  Google Scholar 

  2. Barrett, M.T., Scheffer, A., Ben-Dor, A., Sampas, N., Lipson, D., Kincaid, R., Tsang, P., Curry, B., Baird, K., Meltzer, P.S., Yakhini, Z., Bruhn, L., Laderman, S.: Comparative genomic hybridization using oligonucleotide microarrays and total genomic DNA. PNAS 101(51), 17765–17770 (2004)

    Article  Google Scholar 

  3. Bignell, G., Huang, J., Greshock, J., Watt, S., Butler, A., West, S., Grigorova, M., Jones, K., Wei, W., Stratton, M., Futreal, P., Weber, B., Shapero, M., Wooster, R.: High-resolution analysis of DNA copy number using oligonucleotide microarrays. Genome Research 14(2), 287–295 (2004)

    Article  Google Scholar 

  4. Brennan, C., Zhang, Y., Leo, C., Feng, B., Cauwels, C., Aguirre, A.J., Kim, M., Protopopov, A., Chin, L.: High-resolution global profiling of genomic alterations with long oligonucleotide microarray. Cancer Research 64(14), 4744–4748 (2004)

    Article  Google Scholar 

  5. DeGroot, M.H.: Probability and Statistics, ch. 5.7, p. 275. Addison-Wesley, Reading (1989)

    Google Scholar 

  6. Sebat, J., et al.: Large-scale copy number polymorphism in the human genome. Science 305(5683), 525–528 (2004)

    Article  Google Scholar 

  7. Feller, W.: An Introduction to Probability Theory and Its Applications, ch. VII.6, vol. I, p. 193. John Wiley & Sons, Chichester (1970)

    Google Scholar 

  8. Fu, K.S., Mui, J.K.: A survey of image segmentation. Pattern Recognition 13(1), 3–16 (1981)

    Article  MathSciNet  Google Scholar 

  9. Himberg, J., Korpiaho, K., Mannila, H., Tikanmki, J., Toivonen, H.T.T.: Time series segmentation for context recognition in mobile devices. In: Proceedings of the 2001 IEEE International Conference on Data Mining (ICDM 2001), pp. 203–210 (2001)

    Google Scholar 

  10. Hyman, E., Kauraniemi, P., Hautaniemi, S., Wolf, M., Mousses, S., Rozenblum, E., Ringner, M., Sauter, G., Monni, O., Elkahloun, A., Kallioniemi, O.P., Kallioniemi, A.: Impact of DNA amplification on gene expression patterns in breast cancer. Cancer Research 62, 6240–6245 (2002)

    Google Scholar 

  11. Kallioniemi, O.P., Kallioniemi, A., Sudar, D., Rutovitz, D., Gray, J.W., Waldman, F., Pinkel, D.: Comparative genomic hybridization: a rapid new method for detecting and mapping DNA amplification in tumors. Semin Cancer Biol. 4(1), 41–46 (1993)

    Google Scholar 

  12. Lipson, D., Ben-Dor, A., Dehan, E., Yakhini, Z.: Joint analysis of DNA copy numbers and expression levels. In: Jonassen, I., Kim, J. (eds.) WABI 2004. LNCS (LNBI), vol. 3240, pp. 135–146. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

  13. Mertens, F., Johansson, B., Hoglund, M., Mitelman, F.: Chromosomal imbalance maps of malignant solid tumors: a cytogenetic survey of 3185 neoplasms. Can. Res. 57, 2765–2780 (1997)

    Google Scholar 

  14. Morley, M., Molony, C., Weber, T., Devlin, J., Ewens, K., Spielman, R., Cheung, V.: Genetic analysis of genome-wide variation in human gene expression. Nature 430(7001), 743–747 (2004)

    Article  Google Scholar 

  15. Hupe, P., Stransky, N., Thiery, J.P., Radvanyi, F., Barillot, E.: Analysis of array CGH data: from signal ratio to gain and loss of DNA regions. Bioinformatics (2004) (Epub ahead of print)

    Google Scholar 

  16. Pinkel, D., Segraves, R., Sudar, D., Clark, S., Poole, I., Kowbel, D., Collins, C., Kuo, W.L., Chen, C., Zhai, Y., Dairkee, S.H., Ljung, B.M., Gray, J.W., Albertson, D.G.: High resolution analysis of DNA copy number variation using comparative genomic hybridization to microarrays. Nature Genetics 20(2), 207–211 (1998)

    Article  Google Scholar 

  17. Platzer, P., Upender, M.B., Wilson, K., Willis, J., Lutterbaugh, J., Nosrati, A., Willson, J.K., Mack, D., Ried, T., Markowitz, S.: Silence of chromosomal amplifications in colon cancer. Cancer Research 62(4), 1134–1138 (2002)

    Google Scholar 

  18. Pollack, J.R., Perou, C.M., Alizadeh, A.A., Eisen, M.B., Pergamenschikov, A., Williams, C.F., Jeffrey, S.S., Botstein, D., Brown, P.O.: Genome-wide analysis of DNA copy-number changes using cDNA microarrays. Nature Genetics 23(1), 41–46 (1999)

    Article  Google Scholar 

  19. Pollack, J.R., Sorlie, T., Perou, C.M., Rees, C.A., Jeffrey, S.S., Lonning, P.E., Tibshirani, R., Botstein, D., Borresen-Dale, A., Brown, P.O.: Microarray analysis reveals a major direct role of DNA copy number alteration in the transcriptional program of human breast tumors. PNAS 99(20), 12963–12968 (2002)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Lipson, D., Aumann, Y., Ben-Dor, A., Linial, N., Yakhini, Z. (2005). Efficient Calculation of Interval Scores for DNA Copy Number Data Analysis. In: Miyano, S., Mesirov, J., Kasif, S., Istrail, S., Pevzner, P.A., Waterman, M. (eds) Research in Computational Molecular Biology. RECOMB 2005. Lecture Notes in Computer Science(), vol 3500. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11415770_6

Download citation

  • DOI: https://doi.org/10.1007/11415770_6

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-25866-7

  • Online ISBN: 978-3-540-31950-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics