spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From yana <>
Subject Re: Spark 1.3.1 and Parquet Partitions
Date Thu, 07 May 2015 12:43:23 GMT
I believe this is a regression. Does not work for me either. There is a Jira on parquet wildcards
which is resolved, I'll see about getting it reopened

Sent on the new Sprint Network from my Samsung Galaxy S®4.

<div>-------- Original message --------</div><div>From: Vaxuki <>
</div><div>Date:05/07/2015  7:38 AM  (GMT-05:00) </div><div>To: Olivier
Girardot <> </div><div>Cc: </div><div>Subject:
Re: Spark 1.3.1 and Parquet Partitions </div><div>
Nope. Wildcard extensions don't work I am debugging the code to figure out what's wrong I
know I am using 1.3.1 for sure

Pardon typos...

On May 7, 2015, at 7:06 AM, Olivier Girardot <> wrote:

"hdfs://some ip:8029/dataset/*/*.parquet" doesn't work for you ?

Le jeu. 7 mai 2015 à 03:32, vasuki <> a écrit :
Spark 1.3.1 -
i have a parquet file on hdfs partitioned by some string looking like this

I am trying to get to load it using sqlContext using sqlcontext.parquetFile(
"hdfs://some ip:8029/dataset/< what do i put here >

No leads so far. is there i can load the partitions ? I am running on
cluster and not local..

View this message in context:
Sent from the Apache Spark User List mailing list archive at

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message