() is a programming model and an associated implementation for processing and generating big data sets with a parallel, distributed algorithm on a cluster. The model is a specialization of the split-apply-combine strategy for data analysis.
A. HDFS
B. Chukwa
C. MapReduce
D. HBase