Big Data

pig tutorial 2 – pig data types, relations, bags, tuples, fields and parameter substitution

July, 2017 adarsh

Relations, Bags, Tuples, Fields Pig Latin statements work with relations. A relation is a bag and a bag is a…

adarsh

A Pig Latin statement is an operator that takes a relation as input and produces another relation as output this…

July, 2017 adarsh

There are many input and output formats supported in hadoop out of the box and we will explore the same…

adarsh

What will be the mapper,reducer and the partitioner that will be used in mapreduce program if we dont specify any…

adarsh 2d Comments

Some applications don’t want files to be split, as this allows a single mapper to process each input file in…

July, 2017 adarsh 1 Comment

In the real world, user code is buggy, processes crash, and machines fail. One of the major benefits of using…

adarsh

You can run a mapreduce job with a single method call submit() on a Job object or you can also…