hadoop-yarn-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Morty Zhong (JIRA)" <j...@apache.org>
Subject [jira] [Created] (YARN-9380) FederationInterceptor get Containers from RM may return empty list when RM/NM restart
Date Tue, 12 Mar 2019 04:06:00 GMT
Morty Zhong created YARN-9380:
---------------------------------

             Summary: FederationInterceptor get Containers from RM may return empty list when
RM/NM restart
                 Key: YARN-9380
                 URL: https://issues.apache.org/jira/browse/YARN-9380
             Project: Hadoop YARN
          Issue Type: Bug
          Components: federation
            Reporter: Morty Zhong


FederationInterceptor will recover the map of containerId to subClusterId(field named

containerIdToSubClusterIdMap) by getting containers from RMs(home and secondary RM) when
recover is enabled.However, this may fail in follow condition(RM NM both restart):
 # RM is restart(recover is enabled),recover tokens, apps, but no containers(waiting NM reporting
containers when rsync)
 # RM waiting NM rsync, but before NM rsync, NM is restart. 
 # before NM rsync to RM, NM recover itself, and FederationInterceptor pull containers from
RM(RM has no containers in this moment) and will return empty list

maybe the containerId to subClusterId map store in NMStateStore can solve this?



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-dev-unsubscribe@hadoop.apache.org
For additional commands, e-mail: yarn-dev-help@hadoop.apache.org


Mime
View raw message