flink-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Stefan Richter (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (FLINK-10129) Flink job IDs are not getting deleted automatically from zookeeper metadata after canceling flink job in flink HA cluster
Date Mon, 13 Aug 2018 07:46:00 GMT

    [ https://issues.apache.org/jira/browse/FLINK-10129?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16577929#comment-16577929
] 

Stefan Richter commented on FLINK-10129:
----------------------------------------

Could you please specify for which Flink version you observed this problem?

> Flink job IDs are not getting deleted automatically from zookeeper metadata after canceling
flink job in flink HA cluster 
> --------------------------------------------------------------------------------------------------------------------------
>
>                 Key: FLINK-10129
>                 URL: https://issues.apache.org/jira/browse/FLINK-10129
>             Project: Flink
>          Issue Type: Bug
>            Reporter: Keshav Lodhi
>            Priority: Blocker
>
> Hi Team,
> Here is, what i am looking for:
>  * We have  flink HA dockerized cluster with (3 zookeepers, 2 job-managers, 3 task-managers) 
>  * So whenever we are cancelling the flink job, it is getting cancelled but it is not
deleting the cancelled job ID from the zookeeper metadata (Inside flink/jobgraph folder in
zookeeper) automatically. 
>  * So whenever any one of the job-manager goes down/restarted , it doesn't come up
and throws exception like  "Could not find this job id xxxxxxxxxx".
>  * The current work around is to remove the canceled job ID from the zookeeper metadata
manually. (But this is not the recommended solution).     
>  
> Please advise.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message