spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Tathagata Das <tathagata.das1...@gmail.com>
Subject Re: Spark Streaming + reduceByWindow(reduceFunc, invReduceFunc, windowDuration, slideDuration
Date Wed, 06 Aug 2014 22:48:50 GMT
Why isnt a simple window function sufficient?

eventData.window(Minutes(15), Seconds(3)) will keep generating RDDs every 3
second, each containing last 15 minutes of data.

TD


On Wed, Aug 6, 2014 at 3:43 PM, salemi <alireza.salemi@udo.edu> wrote:

> Hi,
> I have a  DStream called eventData and it contains set of  Data objects
> defined as followed:
>
> case class Data(startDate: Long, endDate: Long, className: String, id:
> String, state: String)
>
> How would the reducer and inverse reducer functions look like if I would
> like to add the data for current 3 second and filter out the last 3 second
> data?
>
> eventData.reduceByWindow(/reduceFunc/, /invReduceFunc/, Minutes(15),
> Seconds(3))
>
> Thanks
> Ali
>
>
>
> --
> View this message in context:
> http://apache-spark-user-list.1001560.n3.nabble.com/Spark-Streaming-reduceByWindow-reduceFunc-invReduceFunc-windowDuration-slideDuration-tp11591.html
> Sent from the Apache Spark User List mailing list archive at Nabble.com.
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: user-unsubscribe@spark.apache.org
> For additional commands, e-mail: user-help@spark.apache.org
>
>

Mime
View raw message