hadoop-yarn-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Eric Payne (JIRA)" <j...@apache.org>
Subject [jira] [Created] (YARN-4217) Failed AM attempt retries on same failed host
Date Thu, 01 Oct 2015 16:04:28 GMT
Eric Payne created YARN-4217:

             Summary: Failed AM attempt retries on same failed host
                 Key: YARN-4217
                 URL: https://issues.apache.org/jira/browse/YARN-4217
             Project: Hadoop YARN
          Issue Type: Improvement
          Components: applications
    Affects Versions: 2.7.1
            Reporter: Eric Payne

This happens when the cluster is maxed out. One node is going bad, so everything that happens
on it fails, so the bad node is never busy. Since the cluster is maxed out, when the RM looks
for a node with available resources, it will always find the almost bad one because nothing
can run on it so it has available resources.

This message was sent by Atlassian JIRA

View raw message