Independent Consultant passionate about #ApacheSpark, #ApacheKafka, #Scala, #sbt (and #Mesos #DCOS) ~ @theASF member ~ @WarszawScaLa leader ~ Java Champion
This is a collections of notes about Apache Spark's best practices. The notes aim to help me design and develop better programs with Apache Spark.
Last updated 3 months ago
All about Spark Internals, RDD, DataSet, DataFrame, Catalyst Optimizer, etc
Last updated a year ago
My personal user notes, tidied up into a book format. Its a work in progress.
Last updated 8 months ago