flink-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Tamás Krutki (JIRA) <j...@apache.org>
Subject [jira] [Issue Comment Deleted] (FLINK-1284) Uniform random sampling operator over windows
Date Sun, 03 May 2015 16:26:06 GMT

     [ https://issues.apache.org/jira/browse/FLINK-1284?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Tamás Krutki updated FLINK-1284:
--------------------------------
    Comment: was deleted

(was: Hi,
I'd like to try this, can you please assign it to me?)

> Uniform random sampling operator over windows
> ---------------------------------------------
>
>                 Key: FLINK-1284
>                 URL: https://issues.apache.org/jira/browse/FLINK-1284
>             Project: Flink
>          Issue Type: New Feature
>          Components: Streaming
>            Reporter: Paris Carbone
>            Priority: Minor
>
> It would be useful for several use cases to have a built-in uniform random sampling operator
in the streaming API that can operate on windows. This can be used for example for online
machine learning operations, evaluating heuristics or continuous visualisation of representative
values.
> The operator could be given a field and a number of random samples needed, following
a window statement as such:
> mystream.window(..).sample(fieldID,#samples)
> Given that pre-aggregation is enabled, this could perhaps be implemented as a binary
reduce operator or a combinable groupreduce that pre-aggregates the empiricals of that field.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message