spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jörn Franke <>
Subject Re: 3rd party hadoop input formats for EDI formats
Date Mon, 15 Jan 2018 18:42:38 GMT
I do not want to make advertisement for certain third party components.

Hence, just some food for thought:
Python Pandas supports some of those formats (it is not an inputformat though).

Some commercial offers just provide etl to convert it into another format supported already
by Spark .

Then you have a lot of message gateways which receive these messages and can also convert

As a last thing you have third party libraries supporting these formats and it is rather easily
to create your own inputformat for them based on that.

So it is not only about finding an inputformat, but may also make an architectural decision
to convert these formats into Spark supported ones.

> On 15. Jan 2018, at 19:01, Saravanan Nagarajan <> wrote:
> Hello All,
>  Ned to research the availability of both open source and commercial libraries to read
healthcare EDI formats such as HL7, 835, 837. Each library need to be researched/ranked on
several criteria like pricing if commercial, suitability for integration into sagacity, stability
of library, maturity /stability of API, support options etc. Any documentation or suggestion
would help . Thanks!

View raw message