flink-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "AT-Fieldless (Jira)" <j...@apache.org>
Subject [jira] [Commented] (FLINK-5479) Per-partition watermarks in FlinkKafkaConsumer should consider idle partitions
Date Tue, 15 Oct 2019 11:43:01 GMT

    [ https://issues.apache.org/jira/browse/FLINK-5479?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16951861#comment-16951861

AT-Fieldless commented on FLINK-5479:

Hi all - Has this bug been fixed? I use 1.6.2 version Flink and my job can't generate correct
watermark  when only one partition has data.

> Per-partition watermarks in FlinkKafkaConsumer should consider idle partitions
> ------------------------------------------------------------------------------
>                 Key: FLINK-5479
>                 URL: https://issues.apache.org/jira/browse/FLINK-5479
>             Project: Flink
>          Issue Type: Improvement
>          Components: Connectors / Kafka
>            Reporter: Tzu-Li (Gordon) Tai
>            Priority: Major
> Reported in ML: http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/Kafka-topic-partition-skewness-causes-watermark-not-being-emitted-td11008.html
> Similar to what's happening to idle sources blocking watermark progression in downstream
operators (see FLINK-5017), the per-partition watermark mechanism in {{FlinkKafkaConsumer}}
is also being blocked of progressing watermarks when a partition is idle. The watermark of
idle partitions is always {{Long.MIN_VALUE}}, therefore the overall min watermark across all
partitions of a consumer subtask will never proceed.
> It's normally not a common case to have Kafka partitions not producing any data, but
it'll probably be good to handle this as well. I think we should have a localized solution
similar to FLINK-5017 for the per-partition watermarks in {{AbstractFetcher}}.

This message was sent by Atlassian Jira

View raw message