spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From satheessh chinnu <sathees...@gmail.com>
Subject RDD function question
Date Mon, 16 Sep 2013 20:36:29 GMT
i am having a text file.  Each line is a record and first ten characters on
each line is a date in YYYY-MM-DD format.

i would like to run a map function on this RDD with specific date range.
(i.e from 2005 -01-01 to 2007-12-31).  I would like to avoid reading the
records out of the specified data range. (i.e kind of primary index sorted
by date)

is there way to implement this?

Mime
View raw message