Datenbank-Spektrum

, Volume 13, Issue 1, pp 17–22 | Cite as

Efficient OR Hadoop: Why Not Both?

Schwerpunktbeitrag

Abstract

In this article, we give an overview of research related to Big Data processing in Hadoop going on at the Information Systems Group at Saarland University. We discuss how to make Hadoop efficient. We briefly survey three of our projects in this context: Hadoop++, Trojan Layouts, and HAIL.

Keywords

Hadoop HDFS MapReduce Indexing Big data 

Notes

Acknowledgements

Research partially supported by BMBF. We would like to thank all authors and team members of the Hadoop++, Cloud Variance, RAFT, Trojan Layouts, HAIL, and LIAH projects for their support.

References

  1. 1.
    Ailamaki A et al. (2001) Weaving relations for cache performance. In: VLDB, pp 169–180 Google Scholar
  2. 2.
    Cafarella MJ, Ré C (2010) Manimal: relational optimization for data-intensive programs. In: WebDB Google Scholar
  3. 3.
    Dittrich J, Quiané-Ruiz JA (2012) Efficient big data processing in Hadoop MapReduce. Proc VLDB Endow 5(12):2014–2015 Google Scholar
  4. 4.
    Dittrich J, Quiané-Ruiz JA, Jindal A, Kargin Y, Setty V, Schad J (2010) Hadoop++: making a yellow elephant run like a Cheetah (without it even noticing). Proc VLDB Endow 3(1–2):515–529 Google Scholar
  5. 5.
    Dittrich J, Quiané-Ruiz JA, Richter S, Schuh S, Jindal A, Schad J (2012) Only aggressive elephants are fast elephants. Proc VLDB Endow 5(11):1591–1602 Google Scholar
  6. 6.
    Jindal A, Quiané-Ruiz JA, Dittrich J (2011) Trojan data layouts: right shoes for a running elephant. In: SOCC Google Scholar
  7. 7.
    Jindal A, Quiané-Ruiz JA, Dittrich J (2013) WWHow! Freeing data storage from cages. In: CIDR Google Scholar
  8. 8.
    Jindal A, Schuhknecht FM, Dittrich J, Khachatryan K, Bunte A (2013) How Achaeans would construct columns in Troy. In: CIDR Google Scholar
  9. 9.
    Pavlo A et al. (2009) A comparison of approaches to large-scale data analysis. In: SIGMOD, pp 165–178 CrossRefGoogle Scholar
  10. 10.
    Quiané-Ruiz JA, Pinkel C, Schad J, Dittrich J (2011) RAFT at work: speeding-up MapReduce applications under task and node failures. In: SIGMOD, pp 1225–1228 Google Scholar
  11. 11.
    Quiané-Ruiz JA, Pinkel C, Schad J, Dittrich J (2011) RAFTing MapReduce: fast recovery on the RAFT. In: ICDE, pp 589–600 Google Scholar
  12. 12.
    Richter S, Quiané-Ruiz JA, Schuh S, Dittrich J (2012) Towards zero-overhead adaptive indexing in Hadoop. arXiv:1212.3480 [cs.db]
  13. 13.
    Schad J, Dittrich J, Quiané-Ruiz JA (2010) Runtime measurements in the cloud: observing, analyzing, and reducing variance. Proc VLDB Endow 3(1):460–471 Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2013

Authors and Affiliations

  1. 1.Jens Dittrich Information Systems Group, Campus E1 1Saarland UniversitySaarbrückenGermany

Personalised recommendations