spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From asma zgolli <zgollia...@gmail.com>
Subject Re: Parallelize Join Problem
Date Wed, 17 Apr 2019 18:18:55 GMT
How can I figure out if the data is skewed ? are there some statistics i
can check ?

Le mer. 17 avr. 2019 à 20:12, Yeikel <email@yeikel.com> a écrit :

> It is hard to tell , but your data may be skewed
>
>
>
> --
> Sent from: http://apache-spark-user-list.1001560.n3.nabble.com/
>
> ---------------------------------------------------------------------
> To unsubscribe e-mail: user-unsubscribe@spark.apache.org
>
>

-- 
Asma ZGOLLI

PhD student in data engineering - computer science

Mime
View raw message