Apache Hama is a framework for Big Data analytics which uses the Bulk Synchronous Parallel (BSP) computing model, which was established in 2012 as a Top-Level Project of The Apache Software Foundation.
It provides not only pure BSP programming model but also SQL-like query interface (Apache MRQL) and vertex/neuron centric programming models, inspired by Google's Pregel and DistBelief (Apache Horn). For the latest information about Hama, please visit our website at: https://hama.apache.org/ and our wiki at: https://wiki.apache.org/hama/
Please refer to the Installation Guide in the online documentation for an overview on how to getting started with Hama.
Hama provides examples package that allows you to quickly run examples on your Hama Cluster. To run one of them, use % $HAMA_HOME/bin/hama jar hama-examples-x.x.x.jar
. For example:
Download a Iris dataset. And then, run K-Means using:
% $HAMA_HOME/bin/hama jar hama-examples-x.x.x.jar kmeans /tmp/kmeans.txt /tmp/result 10 3
Hama is an open source volunteer project under the Apache Software Foundation. We encourage you to learn about the project and contribute your expertise.