spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From satheessh chinnu <>
Subject RDD function question
Date Mon, 16 Sep 2013 20:36:29 GMT
i am having a text file.  Each line is a record and first ten characters on
each line is a date in YYYY-MM-DD format.

i would like to run a map function on this RDD with specific date range.
(i.e from 2005 -01-01 to 2007-12-31).  I would like to avoid reading the
records out of the specified data range. (i.e kind of primary index sorted
by date)

is there way to implement this?

View raw message