flink-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Aljoscha Krettek (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (FLINK-4808) Allow skipping failed checkpoints
Date Wed, 19 Oct 2016 16:04:58 GMT

    [ https://issues.apache.org/jira/browse/FLINK-4808?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15589126#comment-15589126
] 

Aljoscha Krettek commented on FLINK-4808:
-----------------------------------------

I think it might be good to allow a certain number of checkpoints within a given time frame.

> Allow skipping failed checkpoints
> ---------------------------------
>
>                 Key: FLINK-4808
>                 URL: https://issues.apache.org/jira/browse/FLINK-4808
>             Project: Flink
>          Issue Type: New Feature
>    Affects Versions: 1.1.2, 1.1.3
>            Reporter: Stephan Ewen
>            Assignee: Ufuk Celebi
>             Fix For: 1.2.0
>
>
> Currently, if Flink cannot complete a checkpoint, it results in a failure and recovery.
> To make the impact of less stable storage infrastructure on the performance of Flink
less severe, Flink should be able to tolerate a certain number of failed checkpoints and simply
keep executing.
> This should be controllable via a parameter, for example:
> {code}
> env.getCheckpointConfig().setAllowedFailedCheckpoints(3);
> {code}
> A value of {{-1}} could indicate an infinite number of checkpoint failures tolerated
by Flink.
> The default value should still be {{0}}, to keep compatibility with the existing behavior.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message