Encyclopedia of Big Data Technologies

Living Edition
| Editors: Sherif Sakr, Albert Zomaya

Apache Mahout

Living reference work entry
DOI: https://doi.org/10.1007/978-3-319-63962-8_144-1


Apache Mahout (http://mahout.apache.org) is a distributed linear algebra framework that includes a mathematically expressive domain-specific language (DSL). It is designed to aid mathematicians, statisticians, and data scientists to quickly implement numerical algorithms while focusing on the mathematical concepts in their work, rather than on code syntax. Mahout uses an extensible plug-in interface to systems such as Apache Spark and Apache Flink.

Historical Background

Mahout was founded as a sub-project of Apache Lucene in late 2007 and was promoted to a top-level Apache Software Foundation (ASF) (ASF 2017) project in 2010 (Khudairi 2010). The goal of the project from the outset has been to provide a machine learning framework that was both accessible to practitioners and able to perform sophisticated numerical computation on large data sets.

Mahout has undergone two major stages of architecture design. The first versions relied on the Apache Hadoop MapReduce framework, a...


Apache Mahout Compatible GPUs Apache Spark HTTP Access Logs Hadoop 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
This is a preview of subscription content, log in to check access.


Authors and Affiliations

  1. 1.Apache Software FoundationSeattleUSA

Section editors and affiliations

  • Domenico Talia
    • 1
  • Paolo Trunfio
    • 1
  1. 1.DIMESUniversity of CalabriaRendeItaly