flink-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Piotr Nowojski (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (FLINK-7845) Netty Exception when submitting batch job repeatedly
Date Fri, 10 Nov 2017 17:13:00 GMT

    [ https://issues.apache.org/jira/browse/FLINK-7845?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16247785#comment-16247785

Piotr Nowojski commented on FLINK-7845:

Could you strip down your example to some minimal code showing the problem? 

This IllegalAccessError is almost for sure unrelated. Probably it's caused by dependency convergence
error on between Netty 3.8 and 4.0 which should be fixed by https://issues.apache.org/jira/browse/FLINK-7013.
You can try it out with Flink 1.4-SNAPSHOT or wait until the release of 1.4.

> Netty Exception when submitting batch job repeatedly
> ----------------------------------------------------
>                 Key: FLINK-7845
>                 URL: https://issues.apache.org/jira/browse/FLINK-7845
>             Project: Flink
>          Issue Type: Bug
>          Components: Core, Network
>    Affects Versions: 1.3.2
>            Reporter: Flavio Pompermaier
> We had some problems with Flink and Netty so we wrote a small unit test to reproduce
the memory issues we have in production. It happens that we have to restart the Flink cluster
because the memory is always increasing from job to job. 
> The github project is https://github.com/okkam-it/flink-memory-leak and the JUnit test
is contained in the MemoryLeakTest class (within src/main/test).
> I don't know if this is the root of our problems but at some point, usually around the
28th loop, the job fails with the following exception (actually we never faced that in production
but maybe is related to the memory issue somehow...):
> {code:java}
> Caused by: java.lang.IllegalAccessError: org/apache/flink/runtime/io/network/netty/NettyMessage
> 	at io.netty.util.internal.__matchers__.org.apache.flink.runtime.io.network.netty.NettyMessageMatcher.match(NoOpTypeParameterMatcher.java)
> 	at io.netty.channel.SimpleChannelInboundHandler.acceptInboundMessage(SimpleChannelInboundHandler.java:95)
> 	at io.netty.channel.SimpleChannelInboundHandler.channelRead(SimpleChannelInboundHandler.java:102)
> 	... 16 more
> {code}

This message was sent by Atlassian JIRA

View raw message