spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Michael Artz <>
Subject Spark SQL vs HiveQL
Date Mon, 28 Aug 2017 14:50:25 GMT
  There isn't any good source to answer the question if Hive as an
SQL-On-Hadoop engine just as fast as Spark SQL now? I just want to know if
there has been a comparison done lately for HiveQL vs Spark SQL on Spark
versions 2.1 or later.  I have a large ETL process, with many table joins
and some string manipulation. I don't think anyone has done this kind of
testing in a while.  With Hive LLAP being so performant, I am trying to
make the case for using Spark and some of the architects are light on
experience so they are scared of Scala.


View raw message