spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Zhou (Joe) Xing" <>
Subject Re: Standalone cluster node utilization
Date Thu, 14 Jul 2016 16:45:00 GMT

i have seen similar behavior in my standalone cluster, I tried to increase the number of partitions
and at some point it seems all the executors or worker nodes start to make parallel connection
to remote data store. But it would be nice if someone could point us to some references on
how to make proper use of the repartition of data from a remote data store read by spark SQL,
thanks a lot


> On Jul 14, 2016, at 9:18 AM, Jakub Stransky <> wrote:
> <image.png>

To unsubscribe e-mail:

View raw message