Zeming, Jacek also has a really good online spark book for spark 2, "mastering spark". I found it very helpful when trying to understand spark 2's encoders.
his book is here: https://www.gitbook.com/book/jaceklaskowski/mastering-apache-spark/details On Wed, May 3, 2017 at 8:16 PM, Neelesh Salian <neeleshssal...@gmail.com> wrote: > The Apache Spark documentation is good to begin with. > All the programming guides, particularly. > > > On Wed, May 3, 2017 at 5:07 PM, ayan guha <guha.a...@gmail.com> wrote: > >> I would suggest do not buy any book, just start with databricks community >> edition >> >> On Thu, May 4, 2017 at 9:30 AM, Tobi Bosede <ani.to...@gmail.com> wrote: >> >>> Well that is the nature of technology, ever evolving. There will always >>> be new concepts. If you're trying to get started ASAP and the internet >>> isn't enough, I'd recommend buying a book and using Spark 1.6. A lot of >>> production stacks are still on that version and the knowledge from >>> mastering 1.6 is transferable to 2+. I think that beats waiting forever. >>> >>> On Wed, May 3, 2017 at 6:35 PM, Zeming Yu <zemin...@gmail.com> wrote: >>> >>>> I'm trying to decide whether to buy the book learning spark, spark for >>>> machine learning etc. or wait for a new edition covering the new concepts >>>> like dataframe and datasets. Anyone got any suggestions? >>>> >>> >>> >> >> >> -- >> Best Regards, >> Ayan Guha >> > > > > -- > Regards, > Neelesh S. Salian > >