airavata-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Eroma (JIRA)" <j...@apache.org>
Subject [jira] [Created] (AIRAVATA-2944) Job failures due to wall-time exceed should display/send the failure reason to users
Date Tue, 13 Nov 2018 20:36:01 GMT
Eroma created AIRAVATA-2944:
-------------------------------

             Summary: Job failures due to wall-time exceed should display/send the failure
reason to users
                 Key: AIRAVATA-2944
                 URL: https://issues.apache.org/jira/browse/AIRAVATA-2944
             Project: Airavata
          Issue Type: Improvement
          Components: helix implementation
    Affects Versions: 0.18
         Environment: https://staging.ultrascan.scigap.org
            Reporter: Eroma
            Assignee: Dimuthu Upeksha
             Fix For: 0.18


When jobs fail due to wall time exceed the STDERR has message 'slurmstepd: error: *** JOB
2305055 ON c413-043 CANCELLED AT 2018-10-29T02:46:27 DUE TO TIME LIMIT ***' 

and

the job emails comes with subject '....Run time 13:00:11, TIMEOUT, ExitCode 0'

The email subject can be processed and display/send the TIMOUT as the reson for job FAIL.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message