flink-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Stefan Richter (JIRA)" <j...@apache.org>
Subject [jira] [Assigned] (FLINK-6773) Use compression (e.g. snappy) for full check/savepoints
Date Thu, 15 Jun 2017 08:29:00 GMT

     [ https://issues.apache.org/jira/browse/FLINK-6773?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel

Stefan Richter reassigned FLINK-6773:

    Assignee: Stefan Richter

> Use compression (e.g. snappy) for full check/savepoints
> -------------------------------------------------------
>                 Key: FLINK-6773
>                 URL: https://issues.apache.org/jira/browse/FLINK-6773
>             Project: Flink
>          Issue Type: Improvement
>          Components: State Backends, Checkpointing
>            Reporter: Stefan Richter
>            Assignee: Stefan Richter
> We could use compression (e.g. snappy stream compression) to decrease the size of our
full checkpoints and savepoints. From some initial experiments, I think there is great potential
to achieve compression rates around 30-50%. Given those numbers, I think this is very low
hanging fruit to implement.
> One point to consider in the implementation is that compression blocks should respect
key-groups, i.e. typically it should make sense to compress per key-group.

This message was sent by Atlassian JIRA

View raw message