Big Data

oozie workflow example for sqoop action with end to end configuration

August, 2017 adarsh

Apache Sqoop is a Hadoop tool used for importing and exporting data between relational databases MySQL, Oracle, etc. and Hadoop…

August, 2017 adarsh

Oozie’s Pig action runs a Pig job in Hadoop.Pig action requires you to bundle the Pig script with all the…

adarsh

Hive actions run a Hive query on the cluster. The Hive query and the required configuration, libraries, and code for…

adarsh 1 Comment

The following is an ordered sequence of XML elements for map-reduce job and you must specify them in order when…

August, 2017 adarsh

These are user-defined aggregating functions that operate row-wise or group-wise and output one row or one row for each group…

adarsh

These are regular user-defined functions that operate row-wise and output one result for one row.Lets say we have a input…

adarsh

Common join The common join is also called reduce side join. It is a basic join in Hive and works…