al/real applications are better covered in this book.
For DataFrame and other online Apache Spark documentation is still the best
source.
Keep in mind Spark and its different subsystems are constantly evolving.
Publications will be always somewhat outdated but not the key fundamental
concep
civilized.
Have a great day!
- Nicos
> On Jan 22, 2015, at 7:49 AM, Sean Owen wrote:
>
> Yes, this isn't a well-formed question, and got maybe the response it
> deserved, but the tone is veering off the rails. I just got a much
> ruder reply from Sudipta privately, whic
ems. In case you are interested
in further tuning the Java GC, continue reading below.
Complete list of tips here:
https://spark.apache.org/docs/latest/tuning.html#serialized-rdd-storage
<https://spark.apache.org/docs/latest/tuning.html#serialized-rdd-storage>
Cheers,
- Nicos
> On J