Skip to content

Big Data

Analytics And More
  • Home
  • Map Reduce
  • Spark
  • Hive
  • Hdfs & Yarn
  • Pig
  • Oozie
  • Hbase
  • Design Patterns
  • streaming

Analytics && More

spark dataframe fuzzy string matching

January, 2021 adarsh Leave a comment

In this article we will implement fuzzy string matching in a spark data frame, using the Levenshtein distance algorithm. With…

Continue Reading →

Posted in: Data Analytics, Spark

spark sql consecutive sequence example

September, 2020 adarsh Leave a comment

In this short article I will show how to find consecutive sequence in spark sql. Below is the sample dataset,…

Continue Reading →

Posted in: Spark

spark sql example to find second highest average

adarsh Leave a comment

In this short article I will show how to find second highest average in spark sql. Below is the sample…

Continue Reading →

Posted in: Spark

spark sql example to find max of average

adarsh Leave a comment

In this short article I will show how to find max of average in spark sql. Below is the sample…

Continue Reading →

Posted in: Spark

spark create dataset from scala list

adarsh Leave a comment

In this short article I will show how to create dataframe/dataset in spark sql. In scala we can use the…

Continue Reading →

Posted in: Spark

Pass array as UDF parameter in Spark SQL

September, 2020 adarsh 1 Comment

1. Overview In this short article I will show how to pass an array as udf parameter in spark sql.…

Continue Reading →

Posted in: Data Analytics

spark couchbase read and write data

September, 2020 adarsh Leave a comment

1. Overview In this article I will explain how to integrate spark and couchbase with Spark Connector to read and…

Continue Reading →

Posted in: Data Analytics, Spark

Post navigation

Page 1 of 31
1 2 … 31 Next →

Recent Posts

  • spark dataframe fuzzy string matching
  • spark sql consecutive sequence example
  • spark sql example to find second highest average
  • Home
  • Contact Me
  • About Me
Copyright © 2017 Time Pass Techies