Skip to content

Big Data

Analytics And More
  • Home
  • Spark
  • Design Patterns
  • streaming
  • Map Reduce
  • Hive
  • Hdfs & Yarn
  • Pig
  • Oozie
  • Hbase

Tag: map reduce design pattern

mapreduce reduce side join,average and top n records pattern with real world example

June, 2017 adarsh

Problem to solve : Top twenty rated movies (Condition: The movie should be rated/viewed by at least 40 users) The…

Continue Reading →

Posted in: Data Analytics, Map Reduce Filed under: map reduce, map reduce design pattern, mapreduce real world example

mapreduce reduce side join and top n records pattern with real world example

adarsh 17d Comments

The problem mentioned below revolves around movies dataset. The dataset contains 2 files which are follows, File Name    …

Continue Reading →

Posted in: Data Analytics, Map Reduce Filed under: map reduce, map reduce design pattern, mapreduce real world example

job merging optimization to process two unrelated jobs that are loading the same data to share the mapreduce pipeline

June, 2017 adarsh

job merging is an optimization aimed to reduce the amount of I/O through the MapReduce pipeline. Job merging is a…

Continue Reading →

Posted in: Data Analytics, Map Reduce Filed under: map reduce, map reduce design pattern, mapreduce metapatterns

Chain folding optimization in mapreduce using chainmapper and chainreducer approach

adarsh

Chain folding is an optimization that is applied to MapReduce job chains.Take a look at the map phases in the…

Continue Reading →

Posted in: Data Analytics, Map Reduce Filed under: map reduce, map reduce design pattern, mapreduce metapatterns

job chaining in mapreduce with jobcontrol ,controlledJob and driver

June, 2017 adarsh

Job chaining is extremely important to understand and have an operational plan for in your environment. Many people find that…

Continue Reading →

Posted in: Data Analytics, Map Reduce Filed under: map reduce, map reduce design pattern, mapreduce metapatterns

mapreduce example to do a composite joins with many very large formatted inputs on mapside

adarsh

Composite joins are particularly useful if you want to join very large data sets together. However, the data sets must…

Continue Reading →

Posted in: Data Analytics, Map Reduce Filed under: map reduce, map reduce design pattern, mapreduce join patterns

mapreduce example using replicated join between one large and many small data sets that can be performed on the map side

adarsh

A replicated join is an extremely useful, but has a strict size limit on all but one of the data…

Continue Reading →

Posted in: Data Analytics, Map Reduce Filed under: map reduce, map reduce design pattern, mapreduce join patterns

Post navigation

Page 2 of 5
← Previous 1 2 3 … 5 Next →

Recent Posts

  • Optimization for Using AWS Lambda to Send Messages to Amazon MSK
  • Rebalancing a Kafka Cluster in AWS MSK using CLI Commands
  • Using StsAssumeRoleCredentialsProvider with Glue Schema Registry Integration in Kafka Producer
  • Home
  • Contact Me
  • About Me
Copyright © 2017 Time Pass Techies