flink-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "ramkrishna.s.vasudevan (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (FLINK-5962) Cancel checkpoint canceller tasks in CheckpointCoordinator
Date Mon, 06 Mar 2017 09:36:32 GMT

    [ https://issues.apache.org/jira/browse/FLINK-5962?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15896972#comment-15896972

ramkrishna.s.vasudevan commented on FLINK-5962:

I can work on this [~till.rohrmann] - if you have not already started with it.

> Cancel checkpoint canceller tasks in CheckpointCoordinator
> ----------------------------------------------------------
>                 Key: FLINK-5962
>                 URL: https://issues.apache.org/jira/browse/FLINK-5962
>             Project: Flink
>          Issue Type: Bug
>          Components: State Backends, Checkpointing
>    Affects Versions: 1.2.0, 1.3.0
>            Reporter: Till Rohrmann
>            Priority: Critical
> The {{CheckpointCoordinator}} register a canceller task for each running checkpoint.
The canceller task's responsibility is to cancel a checkpoint if it takes too long to complete.
We should cancel this task as soon as the checkpoint has been completed, because otherwise
we will keep many canceller tasks around. This can eventually lead to an OOM exception.

This message was sent by Atlassian JIRA

View raw message