mapreduce metapatterns Archives

job merging optimization to process two unrelated jobs that are loading the same data to share the mapreduce pipeline

June, 2017 adarsh

job merging is an optimization aimed to reduce the amount of I/O through the MapReduce pipeline. Job merging is a…

adarsh

Chain folding is an optimization that is applied to MapReduce job chains.Take a look at the map phases in the…

June, 2017 adarsh

Job chaining is extremely important to understand and have an operational plan for in your environment. Many people find that…