spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Kevin Burton <>
Subject limit vs sample for indexing a small amount of data quickly?
Date Thu, 01 Jan 2015 03:00:44 GMT
Is there a limit function which just returns the first N records?

Sample is nice but I’m trying to do this so it’s super fast and just to
test the functionality of an algorithm.

With sample I’d have to compute the % that would yield 1000 results first…



Location: *San Francisco, CA*
… or check out my Google+ profile

View raw message