The Internals of Apache Spark

Taking notes about the core of Apache Spark while exploring the lowest depths of the amazing piece of software (towards its mastery)

Last updated 4 months ago

1538

Computational and Inferential Thinking

The Foundations of Data Science

Last updated 2 months ago

235

The Internals of Spark Structured Streaming

Notes about the internals of Spark Structured Streaming

Last updated 15 days ago

200

The Internals of Spark SQL

Notes about the internals of Spark SQL (the Apache Spark module for structured queries)

Last updated 2 months ago

114

Apache Spark - Best Practices and Tuning

This is a collections of notes about Apache Spark's best practices. The notes aim to help me design and develop better programs with Apache Spark.

Last updated a year ago

94

Hadoop and Kerberos: The Madness Beyond the Gate

Hadoop and Kerberos: The details. If you don't use Hadoop, or don't want to know about the darkness that is Kerberos, leave this book alone —it will only damage your mind.

Last updated 6 months ago

89