spark-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Mayur Rustagi <mayur.rust...@gmail.com>
Subject Re: Better option to use Querying in Spark
Date Tue, 06 May 2014 06:14:42 GMT
All three have different usecases. If you are looking for more of a
warehouse you are better off with Shark.
SparkSQL is a way to query regular data in sql like syntax leveraging
columnar store.

BlinkDB is a experiment, meant to integrate with Shark in the long term.
Not meant for production usecase directly.


Mayur Rustagi
Ph: +1 (760) 203 3257
http://www.sigmoidanalytics.com
@mayur_rustagi <https://twitter.com/mayur_rustagi>



On Tue, May 6, 2014 at 11:22 AM, prabeesh k <prabsmails@gmail.com> wrote:

> Hi,
>
> I have seen three different ways to query data from Spark
>
>    1. Default SQL support(
>
> https://github.com/apache/spark/blob/master/examples/src/main/scala/org/apache/spark/sql/examples/HiveFromSpark.scala
>    )
>    2. Shark
>    3. Blink DB
>
> I would like know which one is more efficient
>
> Regards.
> prabeesh
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message