spark-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Joseph Torres <joseph.tor...@databricks.com>
Subject Re: TextSocketMicroBatchReader no longer supports nc utility
Date Mon, 04 Jun 2018 16:48:17 GMT
I tend to agree that this is a bug. It's kinda silly that nc does this, but
a socket connector that doesn't work with netcat will surely seem broken to
users. It wouldn't be a huge change to defer opening the socket until a
read is actually required.

On Sun, Jun 3, 2018 at 9:55 PM, Jungtaek Lim <kabhwan@gmail.com> wrote:

> Hi devs,
>
> Not sure I can hear back the response sooner since Spark summit is just
> around the corner, but just would want to post and wait.
>
> While playing with Spark 2.4.0-SNAPSHOT, I found nc command exits before
> reading actual data so the query also exits with error.
>
> The reason is due to launching temporary reader for reading schema, and
> closing reader, and re-opening reader. While reliable socket server should
> be able to handle this without any issue, nc command normally can't handle
> multiple connections and simply exits when closing temporary reader.
>
> I would like to file an issue and contribute on fixing this if we think
> this is a bug (otherwise we need to replace nc utility with another one,
> maybe our own implementation?), but not sure we are happy to apply
> workaround for specific source.
>
> Would like to hear opinions before giving a shot.
>
> Thanks,
> Jungtaek Lim (HeartSaVioR)
>

Mime
View raw message