jaceklaskowski
Mastering Apache Spark

Updated a month ago

kkolman (@kkolman) started discussion #136

a year ago · 1 comment

Open

RDD lineage graph

Nice book you put together, thanks !

In the RDD lineage page, there is a possible error:

https://jaceklaskowski.gitbooks.io/mastering-apache-spark/content/spark-rdd-lineage.html

The pseudo-code says

val r10 = r00 cartesian r01

But the graph does not show an arrow r01 -> r10.

Jacek Laskowski @jaceklaskowski commented a year ago

Thanks for kind words about the book! It seems it's getting better every day as more and more people find it useful.

Regarding the pseudo-code, you may be right since I didn't pay that much attention to it (it's a pseudo-code, isn't it?)

If you can improve it with a proper code to have the RDD lineage, I'd be more than happy to approve the change. Mind sending a pull request?


to join this conversation on GitBook. Already have an account? Sign in to comment
Notifications

You’re not receiving notifications from this thread.


2 participants