stream processing Archives

Producing events and handling credentials refresh for IAM enabled aws msk cluster using aws msk IAM auth library

January, 2023 adarsh

Amazon Managed Streaming for Apache Kafka (MSK) is a fully managed service that makes it easy to build and run…

Continue Reading →

submit spark job programmatically using SparkLauncher

March, 2019 adarsh

In this article I will illustrate how to submit a spark job programmatically using SparkLauncher. Let us take a use…

Continue Reading →

kafka example for custom serializer, deserializer and encoder with spark streaming integration

November, 2017 adarsh 1 Comment

Lets say we want to send a custom object as the kafka value type and we need to push this…

Continue Reading →

performance tuning in spark streaming

adarsh

Batch and Window Sizes – The most common question is what minimum batch size Spark Streaming can use. In general,…

Continue Reading →

checkpointing and fault tolerance in spark streaming

adarsh

Checkpointing is the main mechanism that needs to be set up for fault tolerance in Spark Streaming. It allows Spark…

Continue Reading →

stateful transformation spark streaming example

adarsh

Stateful transformations are operations on DStreams that track data across time that is, some data from previous batches is used…

Continue Reading →

stateless transformation spark streaming example

adarsh

Stateless transformations like map(), flatMap(), filter(), repartition(), reduceByKey(), groupByKey() are simple RDD transformations being applied on every batch. Keep in…

Continue Reading →

Big Data

Category: stream processing

Producing events and handling credentials refresh for IAM enabled aws msk cluster using aws msk IAM auth library

submit spark job programmatically using SparkLauncher

kafka example for custom serializer, deserializer and encoder with spark streaming integration

performance tuning in spark streaming

checkpointing and fault tolerance in spark streaming

stateful transformation spark streaming example

stateless transformation spark streaming example