spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From skane <sk...@websense.com>
Subject Re: PySpark issue with sortByKey: "IndexError: list index out of range"
Date Thu, 06 Nov 2014 18:39:10 GMT
I don't have any insight into this bug, but on Spark version 1.0.0 I ran into
the same bug running the 'sort.py' example. On a smaller data set, it worked
fine. On a larger data set I got this error:

Traceback (most recent call last):
  File "/home/skane/spark/examples/src/main/python/sort.py", line 30, in
<module>
    .sortByKey(lambda x: x)
  File "/usr/lib/spark/python/pyspark/rdd.py", line 480, in sortByKey
    bounds.append(samples[index])
IndexError: list index out of range



--
View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/PySpark-issue-with-sortByKey-IndexError-list-index-out-of-range-tp16445p18288.html
Sent from the Apache Spark User List mailing list archive at Nabble.com.

---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscribe@spark.apache.org
For additional commands, e-mail: user-help@spark.apache.org


Mime
View raw message