Skip to content

Big Data

Analytics And More
  • Home
  • Spark
  • Design Patterns
  • streaming
  • Map Reduce
  • Hive
  • Hdfs & Yarn
  • Pig
  • Oozie
  • Hbase

Analytics && More

spark example for jaccard similarity for lsh algorithm

October, 2018 adarsh

The Jaccard similarity index or the jaccard similarity coefficient compares two datasets to see which data is shared and which…

Continue Reading →

Posted in: Data Analytics, Spark Filed under: Spark Rdd

hbase batch request example – client side buffering

September, 2018 adarsh

Hbase client uses RPC to send the data from client to server and it is recommended to enable client side…

Continue Reading →

Posted in: Hbase Filed under: hbase

hbase CheckAndPut example – java client api

adarsh

Checkandput allows us to compare and then set the value of a row . Using the checkandput we can only…

Continue Reading →

Posted in: Hbase Filed under: hbase

hbase put example java client api – CRUD operations in hbase

adarsh

The put operations has two variations the fist variation works on a single row and the second on a lists…

Continue Reading →

Posted in: Hbase Filed under: hbase

spark treeAggregate example and treeReduce example

May, 2018 adarsh 1 Comment

treeAggregate is a specialized implementation of aggregate that iteratively applies the combine function to a subset of partitions. This is…

Continue Reading →

Posted in: Data Analytics, Spark Filed under: Spark Rdd

spark aggregatebykey example in java

adarsh

Both foldByKey() and reduceByKey() require that the return type of our result be the same type as that of the…

Continue Reading →

Posted in: Spark Filed under: Spark Rdd

spark partition level functions by examples

May, 2018 adarsh

Spark has support for partition level functions which operate on per partition data. Working with data on a per partition…

Continue Reading →

Posted in: Spark Filed under: Spark Rdd

Post navigation

Page 8 of 33
← Previous 1 … 7 8 9 … 33 Next →

Recent Posts

  • Optimization for Using AWS Lambda to Send Messages to Amazon MSK
  • Rebalancing a Kafka Cluster in AWS MSK using CLI Commands
  • Using StsAssumeRoleCredentialsProvider with Glue Schema Registry Integration in Kafka Producer
  • Home
  • Contact Me
  • About Me
Copyright © 2017 Time Pass Techies