spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Manjunath Shetty H <manjunathshe...@live.com>
Subject Spark 1.6 and ORC bucketed queries
Date Thu, 02 Apr 2020 03:18:05 GMT
Hi,

Is it possible to do ORC bucked queries in Spark 1.6 ?

Folder structure is like this:
<partition1>/
                     bucket1.orc
                     bucket2.orc
                     bucket3.orc

And the Spark SQL query will be like `select * from <table> where partition = partition1
and bucket = bucket1`, this query should only read `bucket1.orc` file.

Is this possible with Spark 1.6, if so please let me know how to achieve that ?


Thanks
Manjunath Shetty

Mime
View raw message