tez-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Nathan Roberts (JIRA)" <j...@apache.org>
Subject [jira] [Created] (TEZ-3440) Shuffling to memory can get out-of-sync when fetching multiple compressed map outputs
Date Wed, 21 Sep 2016 17:52:21 GMT
Nathan Roberts created TEZ-3440:
-----------------------------------

             Summary: Shuffling to memory can get out-of-sync when fetching multiple compressed
map outputs
                 Key: TEZ-3440
                 URL: https://issues.apache.org/jira/browse/TEZ-3440
             Project: Apache Tez
          Issue Type: Bug
            Reporter: Nathan Roberts


Haven't verified yet but certainly looks like tez needs same fix as MAPREDUCE-5308 in IFile.

Specifically saw this because downstream tasks were reporting enough fetch failures that long-running
upstream tasks had to be re-run, which makes job run for much longer than it needs.

Usually shows itself as an "Invalid map id" error on a multi-map fetch on part 2-n (i.e. never
the first one).
 




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message