Skip to main content

Fingerprint Clustering with Bounded Number of Missing Values

  • Conference paper
  • 898 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 4009))

Abstract

The problem of clustering fingerprint vectors with missing values is an interesting problem in Computational Biology that has been proposed in [6]. In this paper we show some improvements in closing the gaps between the known lower bounds and upper bounds on the approximability of variants of the biological problem. Moreover, we have studied two additional variants of the original problem. We prove that all such problems are APX-hard even when each fingerprint contains only two unknown positions and we present a greedy algorithm that has constant approximation factors for these variants. Despite the hardness of these restricted versions of the problem, we show that the general clustering problem on an unbounded number of missing values such that they occur for every fixed position of an input vector in at most one fingerprint is polynomial time solvable.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Alimonti, P., Kann, V.: Some APX-completeness results for cubic graphs. Theoretical Computer Science 237(1–2), 123–134 (2000)

    Article  MathSciNet  MATH  Google Scholar 

  2. Ausiello, G., Crescenzi, P., Gambosi, G., Kann, V., Marchetti-Spaccamela, A., Protasi, M.: Complexity and Approximation: Combinatorial optimization problems and their approximability properties. Springer, Heidelberg (1999)

    MATH  Google Scholar 

  3. Drmanac, R.: cDNA screening by array hybridization. Meth. in Enzym. 303, 165–178 (1999)

    Article  Google Scholar 

  4. Drmanac, S., Drmanac, R.: Processing of cDNA and genomic kilobase-size clones for massive screening mapping and sequencing by hybridization. Biotechn. 17, 328–336 (1994)

    Google Scholar 

  5. Drmanac, S., Stavropoulos, N., Labat, I., Vonau, J., Hauser, B., Soares, M., Drmanac, R.: Gene-representation cDNA clusters defined by hybridization of 57 419 clones from infant brain libraries with short oligonucleotide probes. Genomics 37, 29–40 (1996)

    Article  Google Scholar 

  6. Figueroa, A., Borneman, J., Jiang, T.: Clustering binary fingerprint vectors with missing values for DNA array data analysis. Journal of Computational Biology 11(5), 887–901 (2004)

    Article  Google Scholar 

  7. Figueroa, A., Goldstein, A., Jiang, T., Kurowski, M., Lingas, A., Persson, M.: Approximate clustering of fingerprint vectors with missing values. In: Proc. 11th Computing: The Australasian Theory Symposium (CATS). CRPIT, vol. 41, pp. 57–60 (2005)

    Google Scholar 

  8. Valinsky, L., Della Vedova, G., Jiang, T., Borneman, J.: Oligonucleotide fingerprinting of rRNA genes for analysis of fungal community composition. Applied and Environmental Microbiology 68(12), 5999–6004 (2002)

    Article  Google Scholar 

  9. Valinsky, L., Della Vedova, G., Scupham, A., Alvey, S., Figueroa, A., Yin, B., Hartin, R., Chrobak, M., Crowley, D., Jiang, T., Borneman, J.: Analysis of bacterial microbial community composition by oligonucleotide fingerprinting of rRNA genes. Applied and Environmental Microbiology 68(7), 3243–3250 (2002)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Bonizzoni, P., Della Vedova, G., Dondi, R., Mauri, G. (2006). Fingerprint Clustering with Bounded Number of Missing Values. In: Lewenstein, M., Valiente, G. (eds) Combinatorial Pattern Matching. CPM 2006. Lecture Notes in Computer Science, vol 4009. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11780441_11

Download citation

  • DOI: https://doi.org/10.1007/11780441_11

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-35455-0

  • Online ISBN: 978-3-540-35461-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics