Independent Consultant passionate about #ApacheSpark, #ApacheKafka, #Scala, #sbt (and #Mesos #DCOS) ~ @theASF member ~ @WarszawScaLa leader ~ Java Champion
This is a collections of notes about Apache Spark's best practices. The notes aim to help me design and develop better programs with Apache Spark.
Last updated 24 days ago
All about Spark Internals, RDD, DataSet, DataFrame, Catalyst Optimizer, etc
Last updated a year ago
My personal user notes, tidied up into a book format. Its a work in progress.
Last updated 6 months ago