spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Malte <>
Subject GroupBy on RDD returns empty collection
Date Tue, 02 Jun 2015 02:34:40 GMT
I noticed that my spark jobs suddenly return empty data and tried to find out
why. It seems as if a groupBy operation is the cause of it. When I run 

val original:RDD[Data]
val x = original.cache().groupBy(x=>(x.first,x.last,

and then try

I get an
Exception in thread "main" java.lang.UnsupportedOperationException: empty

original definitely is not empty. 

I use Spark 1.2.1 on Mesos

any ideas?

View this message in context:
Sent from the Apache Spark User List mailing list archive at

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message