Skip to main content

From Combinatorial Optimization to Data Mining

  • Chapter
  • First Online:
Book cover Journeys to Data Mining

Abstract

My work in data mining started almost immediately after my Ph.D. was completed and was initially unrelated to my Ph.D. thesis, until more recently, when I started working in the field of graph mining. At the time I started working in data mining, the field was still in its infancy. I had graduated in 1996 from MIT in the field of combinatorial optimization and network flows and was mostly interested in problems of a theoretical nature. I had joined IBM Research, which, at the time, contained some of the strongest researchers in the field of data mining, including some of its key founding fathers.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 54.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. S. Berchtold, C. Bohm, H.-P. Kriegel, The pyramid-technique: towards breaking the curse of dimensionality, in ACM SIGMOD Conference, 1998

    Google Scholar 

  2. A.K. Jain, R.C. Dubes, Algorithms for Clustering Data (Prentice Hall, Upper Saddle River, NJ, 1988)

    MATH  Google Scholar 

  3. C.C. Aggarwal, Re-designing distance functions and distance-based applications for high dimensional data, in ACM SIGMOD Record, March 2001

    Google Scholar 

  4. A. Hinneburg, C.C. Aggarwal, D.A. Keim, What is the nearest neighbor in high dimensional space? in VLDB Conference, 2000

    Google Scholar 

  5. C.C. Aggarwal, P.S. Yu, The IGrid index: reversing the dimensionality curse for similarity indexing in high dimensional space, in ACM KDD Conference, 2000

    Google Scholar 

  6. C.C. Aggarwal, A. Hinneburg, D.A. Keim, On the surprising behavior of distance metrics in high dimensional space, in ICDT Conference, 2010

    Google Scholar 

  7. C.C. Aggarwal, Towards systematic design of distance functions for data mining applications, in ACM KDD Conference, 2003

    Google Scholar 

  8. C.C. Aggarwal, On the effects of dimensionality reduction on high dimensional similarity search, in ACM PODS Conference, 2001

    Google Scholar 

  9. C.C. Aggarwal, P.S. Yu, Finding generalized projected clusters in high dimensional spaces, in ACM SIGMOD Conference, 2000

    Google Scholar 

  10. C.C. Aggarwal, P.S. Yu, Outlier detection for high dimensional data, in ACM SIGMOD Conference, 2001

    Google Scholar 

  11. C.C. Aggarwal, J. Han, J. Wang, P. Yu, A framework for clustering evolving data streams, in VLDB Conference, 2003

    Google Scholar 

  12. C.C. Aggarwal, J. Han, J. Wang, P. Yu, On demand classification of data streams, in ACM KDD Conference, 2004

    Google Scholar 

  13. C.C. Aggarwal, On abnormality detection in spuriously populated data streams, in SDM Conference, 2005

    Google Scholar 

  14. C.C. Aggarwal, On biased reservoir sampling in the presence of stream evolution, in VLDB Conference, 2006

    Google Scholar 

  15. C.C. Aggarwal, Data Streams: Models and Algorithms (Springer, New York, 2007)

    Google Scholar 

  16. C.C. Aggarwal, P.S. Yu, On static and dynamic methods for condensation-based privacy-preserving data mining. ACM Trans. Database Syst. 33(1), 1–39 (2008)

    Article  Google Scholar 

  17. C.C. Aggarwal, P.S. Yu, Privacy-Preserving Data Mining: Models and Algorithms (Springer, New York, 2008)

    Google Scholar 

  18. C.C. Aggarwal, Managing and Mining Uncertain Data (Springer, New York, 2009)

    Google Scholar 

  19. C.C. Aggarwal, Managing and Mining Graph Data (Springer, New York, 2010)

    Google Scholar 

  20. C.C. Aggarwal, Social Network Data Analytics (Springer, New York, 2011)

    Google Scholar 

  21. R. Agrawal, R. Srikant, Privacy-preserving data mining, in ACM SIGMOD Conference, 2000

    Google Scholar 

  22. D. Agrawal, C.C. Aggarwal, On the design and quantification of privacy-preserving data mining algorithms, in ACM PODS Conference, 2001

    Google Scholar 

  23. C.C. Aggarwal, On k-anonymity and the curse of dimensionality, in VLDB Conference, 2005

    Google Scholar 

  24. C.C. Aggarwal, On randomization, public information, and the curse of dimensionality, in ICDE Conference, 2007

    Google Scholar 

  25. C.C. Aggarwal, On the design and quantification of privacy-preserving data mining algorithms, in ICDE Conference, 2009

    Google Scholar 

  26. C.C. Aggarwal, P.S. Yu, A framework for clustering uncertain data streams, in ICDE Conference, 2008

    Google Scholar 

  27. C.C. Aggarwal, P.S. Yu, Outlier detection with uncertain data, in ICDE Conference, 2008

    Google Scholar 

  28. C.C. Aggarwal, On density-based transforms for uncertain data mining, in ICDE Conference, 2007

    Google Scholar 

  29. C.C. Aggarwal, Y. Li, P. Yu, R. Jin, On dense pattern mining in graph streams, in VLDB Conference, 2010

    Google Scholar 

  30. C.C. Aggarwal, Y. Zhao, P. Yu, Outlier detection in graph streams, in ICDE Conference, 2010

    Google Scholar 

  31. C.C. Aggarwal, Y. Li, J. Wang, J. Wang, Frequent pattern mining with uncertain data, in ACM KDD Conference, 2009

    Google Scholar 

  32. R. Agrawal, T. Imielinski, A. Swami, Mining association rules between sets of items in large databases, in SIGMOD Conference, 1993

    Google Scholar 

  33. J.B. Orlin, A polynomial time primal network simplex algorithm for minimum cost flows. Math. Program 77, 109–129 (1997)

    MathSciNet  Google Scholar 

  34. R.J. Bayardo Jr., Efficiently mining long patterns from databases, in ACM SIGMOD Conference, 1998

    Google Scholar 

  35. J. Han, J. Pei, Y. Yin, Mining frequent patterns without candidate generation, in ACM SIGMOD Conference, 2000

    Google Scholar 

  36. S. Brin, L. Page, The anatomy of a large scale hypertextual engine, in WWW Conference, 1998

    Google Scholar 

  37. R.K. Ahuja, T.L. Magnanti, J.B. Orlin, Network Flows: Theory, Algorithms and Applications (Prentice Hall, Englewood Cliffs, NJ, 1992)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Charu C. Aggarwal .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Aggarwal, C.C. (2012). From Combinatorial Optimization to Data Mining. In: Gaber, M. (eds) Journeys to Data Mining. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-28047-4_3

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-28047-4_3

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-28046-7

  • Online ISBN: 978-3-642-28047-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics