spark-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Emre Sevinc <emre.sev...@gmail.com>
Subject Re: Multi-Line JSON in SparkSQL
Date Mon, 04 May 2015 06:13:13 GMT
You can check out the following library:

   https://github.com/alexholmes/json-mapreduce

--
Emre Sevinç


On Sun, May 3, 2015 at 10:04 PM, Olivier Girardot <
o.girardot@lateral-thoughts.com> wrote:

> Hi everyone,
> Is there any way in Spark SQL to load multi-line JSON data efficiently, I
> think there was in the mailing list a reference to
> http://pivotal-field-engineering.github.io/pmr-common/ for its
> JSONInputFormat
>
> But it's rather inaccessible considering the dependency is not available in
> any public maven repo (If you know of one, I'd be glad to hear it).
>
> Is there any plan to address this or any public recommendation ?
> (considering the documentation clearly states that sqlContext.jsonFile will
> not work for multi-line json(s))
>
> Regards,
>
> Olivier.
>



-- 
Emre Sevinc

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message