Skip to content

Big Data

Analytics And More
  • Home
  • Spark
  • Design Patterns
  • streaming
  • Map Reduce
  • Hive
  • Hdfs & Yarn
  • Pig
  • Oozie
  • Hbase

Tag: map reduce design pattern

mapreduce example to find top n records in a sample data

June, 2017 adarsh 1 Comment

Finding outliers is an important part of data analysis because these records are typically the most interesting and unique pieces…

Continue Reading →

Posted in: Data Analytics, Map Reduce Filed under: map reduce, map reduce design pattern, mapreduce filtering patterns

mapreduce bloom filter example,pattern and optimization with sample data

June, 2017 adarsh 4d Comments

Bloom filtering is similar to generic filtering in that it is looking at each record and deciding whether to keep…

Continue Reading →

Posted in: Data Analytics, Map Reduce Filed under: map reduce, map reduce design pattern, mapreduce filtering patterns

mapreduce example for simple random sampling of data

June, 2017 adarsh

In simple random sampling (SRS), we want to grab a subset of our larger data set in which each record…

Continue Reading →

Posted in: Data Analytics, Map Reduce Filed under: map reduce, map reduce design pattern, mapreduce filtering patterns

mapreduce example to filter data

adarsh

Filtering serves as an abstract pattern for some of the other patterns. Filtering simply evaluates each record separately and decides,…

Continue Reading →

Posted in: Data Analytics, Map Reduce Filed under: map reduce, map reduce design pattern, mapreduce filtering patterns

mapreduce count example using mapreduce frameworks counters utility

June, 2017 adarsh

A count or summation can tell you a lot about particular fields of data, or your data as a whole.…

Continue Reading →

Posted in: Data Analytics, Map Reduce Filed under: map reduce, map reduce design pattern, mapreduce numerical summarization pattern

mapreduce example to find the inverted index of a sample

adarsh

Inverted index pattern is used to generate an index from a data set to allow for faster searches or data…

Continue Reading →

Posted in: Data Analytics, Map Reduce Filed under: map reduce, map reduce design pattern, mapreduce numerical summarization pattern

mapreduce example for calculating standard deviation and median on a sample data

June, 2017 adarsh 11d Comments

A standard deviation shows how much variation exists in the data from the average, thus requiring the average to be…

Continue Reading →

Posted in: Data Analytics, Map Reduce Filed under: map reduce, map reduce design pattern, mapreduce numerical summarization pattern

Post navigation

Page 4 of 5
← Previous 1 … 3 4 5 Next →

Recent Posts

  • Optimization for Using AWS Lambda to Send Messages to Amazon MSK
  • Rebalancing a Kafka Cluster in AWS MSK using CLI Commands
  • Using StsAssumeRoleCredentialsProvider with Glue Schema Registry Integration in Kafka Producer
  • Home
  • Contact Me
  • About Me
Copyright © 2017 Time Pass Techies