spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Tobias Pfeiffer <>
Subject Re: Create DStream consisting of HDFS and (then) Kafka data
Date Thu, 08 Jan 2015 05:33:30 GMT

On Thu, Jan 8, 2015 at 2:19 PM, <> wrote:

> dstream processing bulk HDFS data- is something I don't feel is super

well socialized yet, & fingers crossed that base gets built up a little
> more.

Just out of interest (and hoping not to hijack my own thread), why are you
not doing plain RDD processing when you are only processing HDFS data?
What's the advantage of doing DStream?


View raw message