Spark has support for zipping rdds using functions like zip, zipPartition, zipWithIndex and zipWithUniqueId . Lets go through each of…
A window function calculates a return value for every input row of a table based on a group of rows,…
Here we want to find the difference between two dataframes at a column level . We can use the dataframe1.except(dataframe2)…
Lets say we have dataset as below and we want to split a single column into multiple columns using withcolumn…
Lets convert the dataframe of string into the dataframe of Row using the rowencoder. We create the structfield and add…
Lets create a dataframe from list of row object . First populate the list with row object and then we…
In the below example lets see how to create a custom Writable that can be used as a key in…