Skip to main content

BAC: A Bagged Associative Classifier for Big Data Frameworks

  • Conference paper
  • First Online:
New Trends in Databases and Information Systems (ADBIS 2016)

Abstract

Big Data frameworks allow powerful distributed computations extending the results achievable on a single machine. In this work, we present a novel distributed associative classifier, named BAC, based on ensemble techniques. Ensembles are a popular approach that builds several models on different subsets of the original dataset, eventually voting to provide a unique classification outcome. Experiments on Apache Spark and preliminary results showed the capability of the proposed ensemble classifier to obtain a quality comparable with the single-machine version on popular real-world datasets, and overcome their scalability limits on large synthetic datasets.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Tsai, C.W., Lai, C.F., Chao, H.C., Vasilakos, A.V.: Big data analytics: a survey. J. Big Data 2(1), 1–32 (2015)

    Article  Google Scholar 

  2. Baralis, E., Garza, P.: A lazy approach to pruning classification rules. In: ICDM 2002, Maebashi, Japan, December 2002

    Google Scholar 

  3. Liu, B., Hsu, W., Ma, Y.: Integrating classification and association rule mining. In: KDD 1998, New York, NY., August 1998

    Google Scholar 

  4. Agrawal, R., Imilienski, T., Swami, A.: Mining association rules between sets of items in large databases. In: SIGMOD 1993, Washington DC., May 1993

    Google Scholar 

  5. Blake, C., Merz, C.: UCI repository of machine learning databases (1998)

    Google Scholar 

  6. Han, J.: Data Mining: Concepts and Techniques. Morgan Kaufmann Publishers Inc., San Francisco, CA, USA (2005)

    Google Scholar 

  7. Quinlan, J.: C4.5: program for classification learning. Morgan Kaufmann, San Mateo (1992)

    Google Scholar 

  8. Rokach, L., Maimon, O.: Data Mining with Decision Trees: Theory and Applications. World Scientific Publishing Co. Inc., River Edge, NJ, USA (2008)

    MATH  Google Scholar 

  9. Sun, Y., Wang, Y., Wong, A.K.C.: Boosting an associative classifier. IEEE Trans. Knowl. Data Eng. 18(7), 988–992 (2006)

    Article  Google Scholar 

Download references

Acknowledgment

The research leading to these results has received funding from the European Union under the FP7 Grant Agreement n. 619633 (“ONTIC” Project).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Luca Venturini .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Venturini, L., Garza, P., Apiletti, D. (2016). BAC: A Bagged Associative Classifier for Big Data Frameworks. In: Ivanović, M., et al. New Trends in Databases and Information Systems. ADBIS 2016. Communications in Computer and Information Science, vol 637. Springer, Cham. https://doi.org/10.1007/978-3-319-44066-8_15

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-44066-8_15

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-44065-1

  • Online ISBN: 978-3-319-44066-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics