spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Arun Kumar <arunpat...@gmail.com>
Subject almost sorted data
Date Fri, 25 Oct 2013 09:01:38 GMT
Hi,

I am trying to process some logs and the data is sorted(*almost*) by
timestamp.
If I do a full sort it takes a lot of time. Is there some way to sort more
efficiently (like restricting sort to per partition).

Thanks in advance

Mime
View raw message