Skip to content

Big Data

Analytics And More
  • Home
  • Spark
  • Design Patterns
  • streaming
  • Map Reduce
  • Hive
  • Hdfs & Yarn
  • Pig
  • Oozie
  • Hbase

Category: Oozie

oozie spark action workflow example

March, 2018 adarsh 1 Comment

Lets create oozie workflow with spark action for creating a inverted index use case. Inverted index pattern is used to…

Continue Reading →

Posted in: Data Analytics, Oozie, Spark Filed under: oozie workflow, Spark Rdd

oozie workflow example to use multipleinputs and orcinputformat to process the data from different mappers and joining the dataset in the reducer

August, 2017 adarsh 1 Comment

I will explain how to use multipleinputs to process linelength and speeddata from ems . The input format we will…

Continue Reading →

Posted in: Data Analytics, Oozie Filed under: oozie workflow

oozie using if else,fork and join,ssh,distcp and sub-workflow action

adarsh

I have covered most of the oozie actions in the previous tutorial and below are some of the random topics…

Continue Reading →

Posted in: Data Analytics, Oozie Filed under: oozie workflow

oozie workflow example for java action with end to end configuration

adarsh

Java action is a great way to run custom Java code on the Hadoop cluster. The Java action will execute…

Continue Reading →

Posted in: Data Analytics, Oozie Filed under: oozie workflow

oozie workflow example for hdfs file system action with end to end configuration

adarsh 1 Comment

Users can run HDFS commands using Oozie’s FS action. Not all HDFS commands are supported, but the following common operations…

Continue Reading →

Posted in: Data Analytics, Hdfs, Oozie Filed under: hdfs, hdfs filesystem, oozie workflow

oozie workflow example for shell action with end to end configuration

adarsh

Oozie provides a convenient way to run any shell command. This could be Unix commands, Perl,Python or even Java programs…

Continue Reading →

Posted in: Data Analytics, Oozie Filed under: oozie workflow

oozie workflow example for sqoop action with end to end configuration

adarsh

Apache Sqoop is a Hadoop tool used for importing and exporting data between relational databases MySQL, Oracle, etc. and Hadoop…

Continue Reading →

Posted in: Data Analytics, Oozie Filed under: oozie workflow, sqoop

Post navigation

Page 1 of 2
1 2 Next →

Recent Posts

  • Optimization for Using AWS Lambda to Send Messages to Amazon MSK
  • Rebalancing a Kafka Cluster in AWS MSK using CLI Commands
  • Using StsAssumeRoleCredentialsProvider with Glue Schema Registry Integration in Kafka Producer
  • Home
  • Contact Me
  • About Me
Copyright © 2017 Time Pass Techies