spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From tog <>
Subject Re: Time series data
Date Mon, 29 Jun 2015 09:22:18 GMT

Have you tested the Cloudera project: ?
Let me know how did you progress on that route as I am also interested in
that topic ?


On 26 June 2015 at 14:07, Caio Cesar Trucolo <> wrote:

> Hi everyone!
> I am working with multiple time series data and in summary I have to
> adjust each time series (like inserting average values in data gaps) and
> then training regression models with mllib for each time series. The
> adjustment step I did with the adjustement function being mapped for each
> element of RDD (in this case being the ID[as key] and the grouped by key
> features). But for the regression models, it was not possible because the
> functions need RDDs and my solution would be map each element (grouped as
> time series) to a function of training. How can I deal with time series
> data in this context with Spark? I did'nt find a way.
> Thank you
> --
> Caio

PGP KeyID: 2048R/EA31CFC9

View raw message