spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From ayan guha <guha.a...@gmail.com>
Subject Re: Hive From Spark: Jdbc VS sparkContext
Date Wed, 04 Oct 2017 12:00:20 GMT
Well the obvious point is security. Ranger and Sentry can secure jdbc
endpoints only. For performance aspect, I am equally curious 🤓

On Wed, 4 Oct 2017 at 10:30 pm, Gourav Sengupta <gourav.sengupta@gmail.com>
wrote:

> Hi,
>
> I am genuinely curious to see whether any one responds to this question.
>
> Its very hard to shake off JAVA, OOPs and JDBC's :)
>
>
>
> Regards,
> Gourav Sengupta
>
> On Tue, Oct 3, 2017 at 7:08 PM, Nicolas Paris <niparisco@gmail.com> wrote:
>
>> Hi
>>
>> I wonder the differences accessing HIVE tables in two different ways:
>> - with jdbc access
>> - with sparkContext
>>
>> I would say that jdbc is better since it uses HIVE that is based on
>> map-reduce / TEZ and then works on disk.
>> Using spark rdd can lead to memory errors on very huge datasets.
>>
>>
>> Anybody knows or can point me to relevant documentation ?
>>
>> ---------------------------------------------------------------------
>> To unsubscribe e-mail: user-unsubscribe@spark.apache.org
>>
>>
> --
Best Regards,
Ayan Guha

Mime
View raw message