flink-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "ASF GitHub Bot (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (FLINK-11882) Introduce BytesHashMap to batch hash agg
Date Tue, 12 Mar 2019 08:01:00 GMT

     [ https://issues.apache.org/jira/browse/FLINK-11882?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel

ASF GitHub Bot updated FLINK-11882:
    Labels: pull-request-available  (was: )

> Introduce BytesHashMap to batch hash agg
> ----------------------------------------
>                 Key: FLINK-11882
>                 URL: https://issues.apache.org/jira/browse/FLINK-11882
>             Project: Flink
>          Issue Type: New Feature
>          Components: Runtime / Operators
>            Reporter: Jingsong Lee
>            Assignee: Jingsong Lee
>            Priority: Major
>              Labels: pull-request-available
> Introduce bytes based hash table.
> It can be used for performing aggregations where the aggregated values are fixed-width.
> Because the data is stored in continuous memory, AggBuffer of variable length cannot
be applied to this HashMap. The KeyValue form in hash map is designed to reduce the cost of
key fetching in lookup.
> Add a test to do a complete hash agg. When HashMap has enough memory, pure hash AGG
is performed; when memory is insufficient, it degenerates into sort agg.

This message was sent by Atlassian JIRA

View raw message