flink-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Stephan Ewen (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (FLINK-3065) Can't cancel failing jobs
Date Tue, 24 Nov 2015 10:08:11 GMT

    [ https://issues.apache.org/jira/browse/FLINK-3065?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15024173#comment-15024173

Stephan Ewen commented on FLINK-3065:

Robert is investigating the ZkClient issue (seems to still pull the wrong version).

If that does not work, we may have to implement our own ZooKeeper calls.

Last resort is to kill the JVM on non-responsive cancelling and rely on Yarn to restart them...

> Can't cancel failing jobs
> -------------------------
>                 Key: FLINK-3065
>                 URL: https://issues.apache.org/jira/browse/FLINK-3065
>             Project: Flink
>          Issue Type: Bug
>          Components: Command-line client, Webfrontend
>    Affects Versions: 0.10.0, 1.0.0
>            Reporter: Gyula Fora
>            Priority: Blocker
> It is currently not possible to stop a failing streaming job (if it get's stuck while
failing for instance).
> There is no cancel button in the web interface, also it doesnt show on the list of running
jobs in the command line.
> This means jobs getting stuck while failing will take down the cluster eventually.

This message was sent by Atlassian JIRA

View raw message