uima-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jerry Cwiklik (JIRA)" <...@uima.apache.org>
Subject [jira] [Closed] (UIMA-3737) DUCC Agent should not try to remove cgroup when kill -9 fails to stop a process
Date Fri, 11 Apr 2014 19:57:16 GMT

     [ https://issues.apache.org/jira/browse/UIMA-3737?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Jerry Cwiklik closed UIMA-3737.
-------------------------------

    Resolution: Fixed

Dont remove cgroup if it has zombie processes as it may lead to a thread hang. 

> DUCC Agent should not try to remove cgroup when kill -9 fails to stop a process
> -------------------------------------------------------------------------------
>
>                 Key: UIMA-3737
>                 URL: https://issues.apache.org/jira/browse/UIMA-3737
>             Project: UIMA
>          Issue Type: Bug
>          Components: DUCC
>    Affects Versions: 1.0-Ducc
>            Reporter: Jerry Cwiklik
>            Assignee: Jerry Cwiklik
>
> When an agent starts up it tries to clean up the node. It first waits for the OR publications
and then checks the node for running processes. It then reconciles what is running against
what should be running (using OR state). Any processes not in the OR state should be evicted
with kill -9 and its cgroup removed.
> If there is a defunct process on a node, it cannot be killed. Such process still has
associated cgroup which cannot be removed. Its been observed that the agent's attempt to remove
the cgroup of a defunct process causes the CPU utilization to spike for a couple of minutes.
> Agent should detect if a process is defunct and not try to remove its cgroup



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Mime
View raw message