hadoop-mapreduce-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Mingzhe Hao (JIRA)" <j...@apache.org>
Subject [jira] [Created] (MAPREDUCE-5863) Killing task attempts while speculation is enabled can cause the job to fail
Date Sun, 27 Apr 2014 04:23:14 GMT
Mingzhe Hao created MAPREDUCE-5863:
--------------------------------------

             Summary: Killing task attempts while speculation is enabled can cause the job
to fail
                 Key: MAPREDUCE-5863
                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-5863
             Project: Hadoop Map/Reduce
          Issue Type: Bug
    Affects Versions: 2.4.0
            Reporter: Mingzhe Hao


There could be a race condition when a T_ADD_SPEC_ATTEMPT is being fired, the task gets succeeded
and then killed by the client. In that case, the task state changes from SUCCEEDED to SCHEDULED,
and then task gets a T_ADD_SPEC_ATTEMPT event, which is invalid for SCHEDULED state.

1. Task is running.
2. Speculator fires a T_ADD_SPEC_ATTEMPT
3. Before task receives T_ADD_SPEC_ATTEMPT, it succeeds
4. Succeeded TA receives TA_KILL from client. Now the task is at SCHEDULED state.
5. Task receives T_ADD_SPEC_ATTEMPT, since this is an unexpected event, the job fails.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Mime
View raw message