hadoop-yarn-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Adam Antal (Jira)" <j...@apache.org>
Subject [jira] [Created] (YARN-10406) YARN log processor
Date Mon, 24 Aug 2020 14:21:00 GMT
Adam Antal created YARN-10406:

             Summary: YARN log processor
                 Key: YARN-10406
                 URL: https://issues.apache.org/jira/browse/YARN-10406
             Project: Hadoop YARN
          Issue Type: New Feature
          Components: yarn
            Reporter: Adam Antal
            Assignee: Hudáky Márton Gyula

YARN currently does not have any utility that would enable cluster administrators to re-play
actions in a Hadoop YARN cluster in an offline fashion. 

HDFS has the OIV/OEV which does not require a running cluster to look and modify the filesystem.
A corresponding tool would be very helpful in the context of YARN.

Since ATS is not widespread (is not available for older clusters) and there isn't a single
file or entity that would collect all the application/container etc. related information,
we thought our best option to parse and process the output of the YARN daemon log files and
reconstruct the history of the cluster from that. We designed and implemented a CLI based
solution that after parsing the log file enables users to query app/container related information
(listing, filtering by certain properties) and search for common errors like CE failures/error
codes, AM preemption or stack traces. The tool can be integrated into the YARN project as
a sub-project.

This message was sent by Atlassian Jira

To unsubscribe, e-mail: yarn-dev-unsubscribe@hadoop.apache.org
For additional commands, e-mail: yarn-dev-help@hadoop.apache.org

View raw message