Welcome to Mastering Apache Spark gitbook! I’m very excited to have you here and hope you will enjoy exploring the internals of Apache Spark (Core) as much as I have.
I write to discover what I know.
I’m Jacek Laskowski, an independent consultant, software developer and technical instructor specializing in Apache Spark, Apache Kafka and Kafka Streams (with Scala, sbt, Kubernetes, DC/OS, Apache Mesos, and Hadoop YARN).
|I’m also writing Mastering Spark SQL, Mastering Kafka Streams, Apache Kafka Notebook and Spark Structured Streaming Notebook gitbooks.|
Expect text and code snippets from a variety of public sources. Attribution follows.
Now, let me introduce you to Apache Spark.