Skip to content

Big Data

Analytics And More
  • Home
  • Spark
  • Design Patterns
  • streaming
  • Map Reduce
  • Hive
  • Hdfs & Yarn
  • Pig
  • Oozie
  • Hbase

Category: Data Analytics

Spark pair rdd and transformations in scala and java – tutorial 2

November, 2017 adarsh

There are a number of ways to get pair RDDs in Spark and many formats will directly load pair RDDs…

Continue Reading →

Posted in: Data Analytics, Spark Filed under: Spark Rdd

Spark rdd api transformations and actions tutorial with examples – tutorial 1

October, 2017 adarsh

An RDD in Spark is simply an immutable distributed collection of objects. Each RDD is split into multiple partitions, which…

Continue Reading →

Posted in: Data Analytics, Spark Filed under: Spark Rdd

chain of responsibility design pattern real world example in java

September, 2017 adarsh

Chain of responsibility is a behavioral design pattern which helps in decoupling the sender of a request from its receiver…

Continue Reading →

Posted in: Data Analytics, Design Patterns Filed under: java design pattern

scala decorator design pattern with real world example

September, 2017 adarsh

The Decorator Pattern is part of the structural design pattern and this is a pattern which attaches additional responsibilities to…

Continue Reading →

Posted in: Data Analytics, Design Patterns Filed under: scala design pattern

scala builder pattern real world example using the named and default parameters feature

September, 2017 adarsh

The builder pattern is an object creation software design pattern. The intention of the builder pattern is to find a…

Continue Reading →

Posted in: Data Analytics, Design Patterns Filed under: scala design pattern

oozie workflow example to use multipleinputs and orcinputformat to process the data from different mappers and joining the dataset in the reducer

August, 2017 adarsh 1 Comment

I will explain how to use multipleinputs to process linelength and speeddata from ems . The input format we will…

Continue Reading →

Posted in: Data Analytics, Oozie Filed under: oozie workflow

oozie using if else,fork and join,ssh,distcp and sub-workflow action

adarsh

I have covered most of the oozie actions in the previous tutorial and below are some of the random topics…

Continue Reading →

Posted in: Data Analytics, Oozie Filed under: oozie workflow

Post navigation

Page 15 of 26
← Previous 1 … 14 15 16 … 26 Next →

Recent Posts

  • Optimization for Using AWS Lambda to Send Messages to Amazon MSK
  • Rebalancing a Kafka Cluster in AWS MSK using CLI Commands
  • Using StsAssumeRoleCredentialsProvider with Glue Schema Registry Integration in Kafka Producer
  • Home
  • Contact Me
  • About Me
Copyright © 2017 Time Pass Techies
 

Loading Comments...