Skip to content

Big Data

Analytics And More
  • Home
  • Spark
  • Design Patterns
  • streaming
  • Map Reduce
  • Hive
  • Hdfs & Yarn
  • Pig
  • Oozie
  • Hbase

Analytics && More

oozie workflow example for sqoop action with end to end configuration

August, 2017 adarsh

Apache Sqoop is a Hadoop tool used for importing and exporting data between relational databases MySQL, Oracle, etc. and Hadoop…

Continue Reading →

Posted in: Data Analytics, Oozie Filed under: oozie workflow, sqoop

oozie workflow example for pig action with end to end configuration

August, 2017 adarsh

Oozie’s Pig action runs a Pig job in Hadoop.Pig action requires you to bundle the Pig script with all the…

Continue Reading →

Posted in: Data Analytics, Oozie, Pig Filed under: oozie workflow, pig latin

oozie workflow example for hive action with end to end configuration

adarsh

Hive actions run a Hive query on the cluster. The Hive query and the required configuration, libraries, and code for…

Continue Reading →

Posted in: Hive, Oozie Filed under: hive, oozie workflow

oozie workflow example for map reduce action with end to end configuration

adarsh 1 Comment

The following is an ordered sequence of XML elements for map-reduce job and you must specify them in order when…

Continue Reading →

Posted in: Data Analytics, Map Reduce, Oozie Filed under: map reduce, oozie workflow

Hive tutorial 11 – hive example for writing custom user defined aggregate functions

August, 2017 adarsh

These are user-defined aggregating functions that operate row-wise or group-wise and output one row or one row for each group…

Continue Reading →

Posted in: Data Analytics, Hive Filed under: hive, hive custom function

Hive tutorial 10 – Hive example for writing custom user defined function

adarsh

These are regular user-defined functions that operate row-wise and output one result for one row.Lets say we have a input…

Continue Reading →

Posted in: Data Analytics, Hive Filed under: hive, hive custom function

Hive tutorial 9 – Hive performance tuning using join optimization with common, map, bucket and skew join

adarsh

Common join The common join is also called reduce side join. It is a basic join in Hive and works…

Continue Reading →

Posted in: Data Analytics, Hive, performance tuning Filed under: hive, hive performance tuning

Post navigation

Page 23 of 33
← Previous 1 … 22 23 24 … 33 Next →

Recent Posts

  • Optimization for Using AWS Lambda to Send Messages to Amazon MSK
  • Rebalancing a Kafka Cluster in AWS MSK using CLI Commands
  • Using StsAssumeRoleCredentialsProvider with Glue Schema Registry Integration in Kafka Producer
  • Home
  • Contact Me
  • About Me
Copyright © 2017 Time Pass Techies