helix-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hunter L (JIRA)" <j...@apache.org>
Subject [jira] [Created] (HELIX-783) TASK: Fix JobQueue's job state-related bug
Date Thu, 01 Nov 2018 23:56:00 GMT
Hunter L created HELIX-783:

             Summary: TASK: Fix JobQueue's job state-related bug
                 Key: HELIX-783
                 URL: https://issues.apache.org/jira/browse/HELIX-783
             Project: Apache Helix
          Issue Type: Improvement
            Reporter: Hunter L
            Assignee: Hunter L

The bug was observed in TestTaskRebalancerStopResume:stopAndResumeNamedQueue(), which was
being unstable. It was observed that for JobQueues with multiple jobs, the second job would
get marked as IN_PROGRESS even though the first job hadn't completed/failed, especially when
the queue was being stopped and resumed. This was due to a bug in getIncompleteJobCount()
because it was not counting jobs in STOPPING state. This was fixed and another check was added
right before JobDispatcher marks a job as STOPPED so that it would not mark it STOPPED if
the job state is NOT_STARTED. Changelist: 1. Fix getIncompleteJobCount() 2. Add a check so
that we don't mark NOT_STARTED jobs as STOPPED

This message was sent by Atlassian JIRA

View raw message