Rajesh Balamohan created TEZ-4139:
-------------------------------------
Summary: Tez should consider node information for computing failure fraction
Key: TEZ-4139
URL: https://issues.apache.org/jira/browse/TEZ-4139
Project: Apache Tez
Issue Type: Improvement
Reporter: Rajesh Balamohan
When lots of downstream attempts fail to pull the information from source task, source task
is marked as failed and it is retried. Currently failure fraction is handled by looking at
unique task attempts from downstream. However, it should consider taking into account node
information for computing "failureFraction".
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
|