flink-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Heller, David" <david.hel...@student.hpi.de>
Subject Re: Channel to path could not be opened, temporary files missing?
Date Thu, 09 Jul 2015 10:20:40 GMT
Thanks a lot Stephan, as far as we observed the error, it always occurred in an iteration later
than the first one (mostly iteration 2). I filed a bug report here: https://issues.apache.org/jira/browse/FLINK-2334

And thanks for pointing out the workaround. We will try forcing a sort-merge join.

> On 09 Jul 2015, at 10:37, Stephan Ewen <sewen@apache.org <mailto:sewen@apache.org>>
> Can you verify that this always happens in an iteration later than the first one? It
may be that the cached build-side files are accidentally removed too early.
> Can you file a bug report for this? Please make sure you add the stack trace and the
suspicion that cached build partitions are deleted to early.
> A workaround could be to force a sort-merge join for the time being.
> On Thu, Jul 9, 2015 at 12:17 AM, Heller, David <david.heller@student.hpi.de <mailto:david.heller@student.hpi.de>>
> Hello,
> we are implementing a text mining algorithm on Flink (0.9.0) and encountered a mysterious
IOException (see below for detailed stacktrace). Flink seems to have problems finding some
temporary file even though there is enough free space on the disk. Do you have an idea why
the file cannot be found?
> We've encountered the exception both in local and cluster execution and on MacOS as well
as on linux.
> Data size does not seem to be the reason: the error occurred on a 6.5GB dataset as well
as on a small 400MB dataset
> Our algorithm works iteratively and uses Bulk iterations. Interestingly, on one occasion
the error disappeared consistently when setting the iteration number higher (from 2 to 6).
> On another occasion, the exception appeared when adding a simple map operator at the
end (holding the identity function).
> Generally, the error is quite hard to reproduce.
> Thanks in advance for any ideas and your time!
> David
> Stacktrace:
> java.io.IOException: Channel to path '/var/folders/xx/0dd3w4jd7fbb4ytmhqxm157h0000gn/T/flink-io-f5061483-ff59-43dc-883f-79af813d5804/19a70637e025c7ee3919b30239060895.000023.channel'
could not be opened.
>         at org.apache.flink.runtime.io.disk.iomanager.AbstractFileIOChannel.<init>(AbstractFileIOChannel.java:61)
>         at org.apache.flink.runtime.io.disk.iomanager.AsynchronousFileIOChannel.<init>(AsynchronousFileIOChannel.java:86)
>         at org.apache.flink.runtime.io.disk.iomanager.AsynchronousBulkBlockReader.<init>(AsynchronousBulkBlockReader.java:46)
>         at org.apache.flink.runtime.io.disk.iomanager.AsynchronousBulkBlockReader.<init>(AsynchronousBulkBlockReader.java:39)
>         at org.apache.flink.runtime.io.disk.iomanager.IOManagerAsync.createBulkBlockChannelReader(IOManagerAsync.java:263)
>         at org.apache.flink.runtime.operators.hash.MutableHashTable.buildTableFromSpilledPartition(MutableHashTable.java:751)
>         at org.apache.flink.runtime.operators.hash.MutableHashTable.prepareNextPartition(MutableHashTable.java:508)
>         at org.apache.flink.runtime.operators.hash.ReOpenableMutableHashTable.prepareNextPartition(ReOpenableMutableHashTable.java:167)
>         at org.apache.flink.runtime.operators.hash.MutableHashTable.nextRecord(MutableHashTable.java:544)
>         at org.apache.flink.runtime.operators.hash.NonReusingBuildFirstHashMatchIterator.callWithNextKey(NonReusingBuildFirstHashMatchIterator.java:104)
>         at org.apache.flink.runtime.operators.AbstractCachedBuildSideMatchDriver.run(AbstractCachedBuildSideMatchDriver.java:155)
>         at org.apache.flink.runtime.operators.RegularPactTask.run(RegularPactTask.java:496)
>         at org.apache.flink.runtime.iterative.task.AbstractIterativePactTask.run(AbstractIterativePactTask.java:139)
>         at org.apache.flink.runtime.iterative.task.IterationIntermediatePactTask.run(IterationIntermediatePactTask.java:92)
>         at org.apache.flink.runtime.operators.RegularPactTask.invoke(RegularPactTask.java:362)
>         at org.apache.flink.runtime.taskmanager.Task.run(Task.java:559)
>         at java.lang.Thread.run(Thread.java:745)
> Caused by: java.io.FileNotFoundException: /var/folders/xx/0dd3w4jd7fbb4ytmhqxm157h0000gn/T/flink-io-f5061483-ff59-43dc-883f-79af813d5804/19a70637e025c7ee3919b30239060895.000023.channel
(No such file or directory)
>         at java.io.RandomAccessFile.open0(Native Method)
>         at java.io.RandomAccessFile.open(RandomAccessFile.java:316)
>         at java.io.RandomAccessFile.<init>(RandomAccessFile.java:243)
>         at java.io.RandomAccessFile.<init>(RandomAccessFile.java:124)
>         at org.apache.flink.runtime.io.disk.iomanager.AbstractFileIOChannel.<init>(AbstractFileIOChannel.java:57)
>         ... 16 more

View raw message