spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Tobias Pfeiffer <>
Subject Re: SQL query over (Long, JSON string) tuples
Date Thu, 29 Jan 2015 09:29:12 GMT
Hi Ayoub,

thanks for your mail!

On Thu, Jan 29, 2015 at 6:23 PM, Ayoub <> wrote:
> SQLContext and hiveContext have a "jsonRDD" method which accept an
> RDD[String] where the string is a JSON String a returns a SchemaRDD, it
> extends RDD[Row] which the type you want.
> After words you should be able to do a join to keep your tuple.

I'm afraid that's not so easy, because you can only join on a certain key,
and the key is exactly what I have to drop in order to infer the schema.


View raw message