spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ajay <>
Subject Clarifications on Spark
Date Fri, 05 Dec 2014 07:25:03 GMT

I work for an eCommerce company. Currently we are looking at building a Data
warehouse platform as described below:

DW as a Service
SQL On No SQL (Drill/Pig/Hive/Spark SQL)
No SQL databases (One or more. May be RDBMS directly too)
    | (Bulk load)
My SQL Database    

I wish to get a few clarifications on Apache Drill as follows:

1) Can we use Spark for SQL on No SQL or do we need to mix them with
Pig/Hive or any other for any reason?
2) Can Spark SQL be used a query interface for Business Intelligence,
Analytics and Reporting
3) Is Spark supports only Hadoop, HBase?. We may use
Cassandra/MongoDb/CouchBase as well.
4) Is Spark supports RDBMS too?. We can have a single interface to pull out
data from multiple data sources?
5) Any recommendations(not limited to usage of Spark) for our specific
requirement described above.


Note : I have posted a similar post on the Drill User list as well as I am
not sure which one best fits for our usecase.

View this message in context:
Sent from the Apache Spark User List mailing list archive at

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message