spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Michael Armbrust <mich...@databricks.com>
Subject Re: Using SQL statements vs. SchemaRDD methods
Date Wed, 05 Nov 2014 01:43:30 GMT
They both compile down to the same logical plans so the performance of
running the query should be the same.  The Scala DSL uses a lot of Scala
magic and thus is experimental where as HiveQL is pretty set in stone.

On Tue, Nov 4, 2014 at 5:22 PM, SK <skrishna.id@gmail.com> wrote:

> SchemaRDD  supports some of the SQL-like functionality like groupBy(),
> distinct(), select(). However, SparkSQL also supports SQL statements which
> provide this functionality. In terms of future support and performance, is
> it better to use SQL statements or the SchemaRDD methods that provide
> equivalent functionality?
>
> thanks
>
>
>
> --
> View this message in context:
> http://apache-spark-user-list.1001560.n3.nabble.com/Using-SQL-statements-vs-SchemaRDD-methods-tp18124.html
> Sent from the Apache Spark User List mailing list archive at Nabble.com.
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: user-unsubscribe@spark.apache.org
> For additional commands, e-mail: user-help@spark.apache.org
>
>

Mime
View raw message