Skip to main content

How to Analyze 1 Billion CDRs per Sec on $200K Hardware

  • Conference paper
  • 200 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1819))

Abstract

Modern telecommunication systems generate large amounts of data such as details of network traffic and service usage (call detail records). Amongst other information, these huge databases contain the behavioral patterns of the company’s customers. By extracting this data, a telecommunications company (Telco) can better understand the needs of its customers.

Traditional database technology scales to hold vast amounts of data but has severe performance limitations when it comes to analyzing this data. Data mining tools, which often store data in a private representation, offer fast analysis on small data sets but generally do not scale beyond a few million rows.

This paper presents a scalable, parallel data analysis engine capable of processing tens of millions of rows per second per CPU. This technology enables knowledge workers to get sub-second responses to queries that would previously have taken minutes or even hours.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Berry, M.J.A., Linoff, G.: Data Mining Techniques. John Wiley & Sons, Chichester (1997)

    Google Scholar 

  2. Boncz, A., Ruhl, T., Kwakkel, K.: The Drill Down Benchmark. In: Proc. of 24th VLDB Conf. (1998)

    Google Scholar 

  3. Chattratichat, J., et al.: Large Scale Data Mining: Challenges and Responses. In: Proceedings KDD 1997 (1997)

    Google Scholar 

  4. Inmon, W.H.: Building the Data Warehouse. John Wiley & Sons, Chichester (1996)

    Google Scholar 

  5. TPC-D benchmark for ProLiant 65000 6/200-1, report data september 19 (1997)

    Google Scholar 

  6. Triangle Database Marketing: Report produced for internal use of TANTAU Software Inc. June 03 (1999)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2000 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Pattison, I., Green, R. (2000). How to Analyze 1 Billion CDRs per Sec on $200K Hardware. In: Jonker, W. (eds) Databases in Telecommunications. DBTel 1999. Lecture Notes in Computer Science, vol 1819. Springer, Berlin, Heidelberg. https://doi.org/10.1007/10721056_11

Download citation

  • DOI: https://doi.org/10.1007/10721056_11

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-67667-6

  • Online ISBN: 978-3-540-45100-6

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics