flink-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ufuk Celebi (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (FLINK-2993) Set default DelayBetweenExecutionRetries to 0
Date Wed, 11 Nov 2015 09:07:10 GMT

    [ https://issues.apache.org/jira/browse/FLINK-2993?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15000134#comment-15000134
] 

Ufuk Celebi commented on FLINK-2993:
------------------------------------

I agree that 100 seconds default is too high. Let's go for something lower.

But bear in mind that it will give a strange user experience when the restarting does not
work, because of missing task slots. If it is too aggressive, you will see redeployment failures.

We should go for a less aggressive delay as default for HA (for now), where job recovery is
delayed by this timeout as well and the task managers have to be notified about the new leader
and reconnect within this time.

It will also be helpful to print this delay at the client when the state changes to RESTARTING.
What do you think?

> Set default DelayBetweenExecutionRetries to 0
> ---------------------------------------------
>
>                 Key: FLINK-2993
>                 URL: https://issues.apache.org/jira/browse/FLINK-2993
>             Project: Flink
>          Issue Type: Improvement
>          Components: Core
>    Affects Versions: 0.10
>            Reporter: Stephan Ewen
>            Assignee: Stephan Ewen
>             Fix For: 1.0
>
>
> The default value is too high and gives a strange user experience.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message