giraph-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Roman Shaposhnik (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (GIRAPH-747) BspServiceMaster finishes ZooKeeper cleanup without waiting for all workers to complete
Date Thu, 22 May 2014 00:15:39 GMT

    [ https://issues.apache.org/jira/browse/GIRAPH-747?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14005435#comment-14005435
] 

Roman Shaposhnik commented on GIRAPH-747:
-----------------------------------------

[~initialcontext] Any chance you can pick this up for 1.1.0? You're our last hope ;-)

> BspServiceMaster finishes ZooKeeper cleanup without waiting for all workers to complete
> ---------------------------------------------------------------------------------------
>
>                 Key: GIRAPH-747
>                 URL: https://issues.apache.org/jira/browse/GIRAPH-747
>             Project: Giraph
>          Issue Type: Bug
>    Affects Versions: 1.0.0
>            Reporter: Chuan Lei
>            Assignee: Chuan Lei
>             Fix For: 1.1.0
>
>         Attachments: GIRAPH-747.v1.patch
>
>
> In BspServiceMaster, the function cleanUpZooKeeper should wait for the number of workers
and masters to complete. However, it appears that maxTasks only takes workers into consideration.
Consequently, the worker straggler may fail to report to the ZooKeeper due to the path gets
removed too early. This will cause No lease on path File does not exist exception at runtime.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Mime
View raw message