Well the obvious point is security. Ranger and Sentry can secure jdbc endpoints only. For performance aspect, I am equally curious 🤓
On Wed, 4 Oct 2017 at 10:30 pm, Gourav Sengupta <gourav.sengu...@gmail.com> wrote: > Hi, > > I am genuinely curious to see whether any one responds to this question. > > Its very hard to shake off JAVA, OOPs and JDBC's :) > > > > Regards, > Gourav Sengupta > > On Tue, Oct 3, 2017 at 7:08 PM, Nicolas Paris <nipari...@gmail.com> wrote: > >> Hi >> >> I wonder the differences accessing HIVE tables in two different ways: >> - with jdbc access >> - with sparkContext >> >> I would say that jdbc is better since it uses HIVE that is based on >> map-reduce / TEZ and then works on disk. >> Using spark rdd can lead to memory errors on very huge datasets. >> >> >> Anybody knows or can point me to relevant documentation ? >> >> --------------------------------------------------------------------- >> To unsubscribe e-mail: user-unsubscr...@spark.apache.org >> >> > -- Best Regards, Ayan Guha