spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Arun Kumar <>
Subject almost sorted data
Date Fri, 25 Oct 2013 09:01:38 GMT

I am trying to process some logs and the data is sorted(*almost*) by
If I do a full sort it takes a lot of time. Is there some way to sort more
efficiently (like restricting sort to per partition).

Thanks in advance

View raw message