Big Data

hadoop mapreduce example with custom inputformat,inputsplit,recordreader,outputformat and recordwriter for generating test data

June, 2017 adarsh

Generating data isn’t common. Typically you’ll generate a bunch of the data at once then use it over and over…

adarsh 7d Comments

Problem to solve : We wish to know how have the genres ranked by Average Rating, for each profession and…

June, 2017 adarsh

Problem to solve : Top twenty rated movies (Condition: The movie should be rated/viewed by at least 40 users) The…

adarsh 17d Comments

The problem mentioned below revolves around movies dataset. The dataset contains 2 files which are follows, File Name …

June, 2017 adarsh

job merging is an optimization aimed to reduce the amount of I/O through the MapReduce pipeline. Job merging is a…

adarsh

Chain folding is an optimization that is applied to MapReduce job chains.Take a look at the map phases in the…

June, 2017 adarsh

Job chaining is extremely important to understand and have an operational plan for in your environment. Many people find that…