hadoop-yarn-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "BELUGA BEHR (JIRA)" <j...@apache.org>
Subject [jira] [Created] (YARN-7368) Yarn Work-Preserving Better Handling Failed Disk
Date Thu, 19 Oct 2017 19:55:00 GMT
BELUGA BEHR created YARN-7368:

             Summary: Yarn Work-Preserving Better Handling Failed Disk
                 Key: YARN-7368
                 URL: https://issues.apache.org/jira/browse/YARN-7368
             Project: Hadoop YARN
          Issue Type: Improvement
          Components: nodemanager, yarn
    Affects Versions: 2.8.1
            Reporter: BELUGA BEHR

If the drive that hosts the {{yarn.nodemanager.recovery.dir}} is broken then the entire NodeManager
will not start.  Please improve this so that if the directory is not able to be created/accessed
then the recovery portion of the NM is simply skipped and the NM continues to operate as normal.

It may also be beneficial to be able to define multiple directories, like YARN logging directories,
so that if one drive fails, not all of the recovery data is lost.


This message was sent by Atlassian JIRA

To unsubscribe, e-mail: yarn-dev-unsubscribe@hadoop.apache.org
For additional commands, e-mail: yarn-dev-help@hadoop.apache.org

View raw message