The Internals of Apache Spark

Taking notes about the core of Apache Spark while exploring the lowest depths of the amazing piece of software (towards its mastery)

Last updated 2 months ago


The Internals of Spark Structured Streaming

Notes about the internals of Spark Structured Streaming

Last updated 5 months ago


The Internals of Spark SQL

Notes about the internals of Spark SQL (the Apache Spark module for structured queries)

Last updated 5 days ago


The Internals of Kafka Streams

Gitbook about Kafka Streams -- a library for developing distributed applications for processing record streams with Apache Kafka as the data storage for input and output records.

Last updated 8 months ago