Re: Hive From Spark: Jdbc VS sparkContext

ayan guha Wed, 04 Oct 2017 05:00:56 -0700

Well the obvious point is security. Ranger and Sentry can secure jdbc
endpoints only. For performance aspect, I am equally curious 🤓


On Wed, 4 Oct 2017 at 10:30 pm, Gourav Sengupta <gourav.sengu...@gmail.com>
wrote:

> Hi,
>
> I am genuinely curious to see whether any one responds to this question.
>
> Its very hard to shake off JAVA, OOPs and JDBC's :)
>
>
>
> Regards,
> Gourav Sengupta
>
> On Tue, Oct 3, 2017 at 7:08 PM, Nicolas Paris <nipari...@gmail.com> wrote:
>
>> Hi
>>
>> I wonder the differences accessing HIVE tables in two different ways:
>> - with jdbc access
>> - with sparkContext
>>
>> I would say that jdbc is better since it uses HIVE that is based on
>> map-reduce / TEZ and then works on disk.
>> Using spark rdd can lead to memory errors on very huge datasets.
>>
>>
>> Anybody knows or can point me to relevant documentation ?
>>
>> ---------------------------------------------------------------------
>> To unsubscribe e-mail: user-unsubscr...@spark.apache.org
>>
>>
> --
Best Regards,
Ayan Guha

Re: Hive From Spark: Jdbc VS sparkContext

Reply via email to