Encyclopedia of Big Data Technologies

Living Edition
| Editors: Sherif Sakr, Albert Zomaya

Apache Mahout

  • Andrew Musselman
Living reference work entry
DOI: https://doi.org/10.1007/978-3-319-63962-8_144-1

Definitions

Apache Mahout (http://mahout.apache.org) is a distributed linear algebra framework that includes a mathematically expressive domain-specific language (DSL). It is designed to aid mathematicians, statisticians, and data scientists to quickly implement numerical algorithms while focusing on the mathematical concepts in their work, rather than on code syntax. Mahout uses an extensible plug-in interface to systems such as Apache Spark and Apache Flink.

Historical Background

Mahout was founded as a sub-project of Apache Lucene in late 2007 and was promoted to a top-level Apache Software Foundation (ASF) (ASF 2017) project in 2010 (Khudairi 2010). The goal of the project from the outset has been to provide a machine learning framework that was both accessible to practitioners and able to perform sophisticated numerical computation on large data sets.

Mahout has undergone two major stages of architecture design. The first versions relied on the Apache Hadoop MapReduce framework, a...

This is a preview of subscription content, log in to check access.

References

Copyright information

© Springer International Publishing AG 2018

Authors and Affiliations

  1. 1.Apache Software FoundationSeattleUSA

Section editors and affiliations

  • Domenico Talia
    • 1
  • Paolo Trunfio
    • 1
  1. 1.DIMESUniversity of CalabriaRendeItaly