hadoop-mapreduce-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jason Lowe (JIRA)" <j...@apache.org>
Subject [jira] [Created] (MAPREDUCE-5042) Reducer unable to fetch for a map task that was recovered
Date Fri, 01 Mar 2013 23:11:13 GMT
Jason Lowe created MAPREDUCE-5042:

             Summary: Reducer unable to fetch for a map task that was recovered
                 Key: MAPREDUCE-5042
                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-5042
             Project: Hadoop Map/Reduce
          Issue Type: Bug
          Components: mr-am, security
    Affects Versions: 0.23.7, 2.0.4-beta
            Reporter: Jason Lowe
            Priority: Blocker

If an application attempt fails and is relaunched the AM will try to recover previously completed
tasks.  If a reducer needs to fetch the output of a map task attempt that was recovered then
it will fail with a 401 error like this:

java.io.IOException: Server returned HTTP response code: 401 for URL: http://xx:xx/mapOutput?job=job_1361569180491_21845&reduce=0&map=attempt_1361569180491_21845_m_000016_0
	at sun.net.www.protocol.http.HttpURLConnection.getInputStream(HttpURLConnection.java:1615)
	at org.apache.hadoop.mapreduce.task.reduce.Fetcher.copyFromHost(Fetcher.java:231)
	at org.apache.hadoop.mapreduce.task.reduce.Fetcher.run(Fetcher.java:156)

Looking at the corresponding NM's logs, we see the shuffle failed due to "Verification of
the hashReply failed".

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

View raw message