From issues-return-198056-apmail-hive-issues-archive=hive.apache.org@hive.apache.org Tue Aug 25 17:01:01 2020 Return-Path: X-Original-To: apmail-hive-issues-archive@locus.apache.org Delivered-To: apmail-hive-issues-archive@locus.apache.org Received: from mailroute1-lw-us.apache.org (mailroute1-lw-us.apache.org [207.244.88.153]) by minotaur.apache.org (Postfix) with ESMTP id DAA011ADA3 for ; Tue, 25 Aug 2020 17:01:01 +0000 (UTC) Received: from mail.apache.org (localhost [127.0.0.1]) by mailroute1-lw-us.apache.org (ASF Mail Server at mailroute1-lw-us.apache.org) with SMTP id 8D058124C58 for ; Tue, 25 Aug 2020 17:01:01 +0000 (UTC) Received: (qmail 15970 invoked by uid 500); 25 Aug 2020 17:01:01 -0000 Delivered-To: apmail-hive-issues-archive@hive.apache.org Received: (qmail 15950 invoked by uid 500); 25 Aug 2020 17:01:01 -0000 Mailing-List: contact issues-help@hive.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@hive.apache.org Delivered-To: mailing list issues@hive.apache.org Received: (qmail 15939 invoked by uid 99); 25 Aug 2020 17:01:01 -0000 Received: from mailrelay1-us-west.apache.org (HELO mailrelay1-us-west.apache.org) (209.188.14.139) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 25 Aug 2020 17:01:01 +0000 Received: from jira-he-de.apache.org (static.172.67.40.188.clients.your-server.de [188.40.67.172]) by mailrelay1-us-west.apache.org (ASF Mail Server at mailrelay1-us-west.apache.org) with ESMTP id 8E2594078A for ; Tue, 25 Aug 2020 17:01:00 +0000 (UTC) Received: from jira-he-de.apache.org (localhost.localdomain [127.0.0.1]) by jira-he-de.apache.org (ASF Mail Server at jira-he-de.apache.org) with ESMTP id 156C27808F1 for ; Tue, 25 Aug 2020 17:01:00 +0000 (UTC) Date: Tue, 25 Aug 2020 17:01:00 +0000 (UTC) From: "Prasanth Jayachandran (Jira)" To: issues@hive.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Updated] (HIVE-24068) Add re-execution plugin for handling DAG submission and unmanaged AM failures MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HIVE-24068?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Prasanth Jayachandran updated HIVE-24068: ----------------------------------------- Summary: Add re-execution plugin for handling DAG submission and unmanaged AM failures (was: Add re-execution plugin for handling DAG submission failures) > Add re-execution plugin for handling DAG submission and unmanaged AM failures > ----------------------------------------------------------------------------- > > Key: HIVE-24068 > URL: https://issues.apache.org/jira/browse/HIVE-24068 > Project: Hive > Issue Type: Bug > Affects Versions: 4.0.0 > Reporter: Prasanth Jayachandran > Assignee: Prasanth Jayachandran > Priority: Major > Labels: pull-request-available > Time Spent: 20m > Remaining Estimate: 0h > > DAG submission failure can also happen in environments where AM container died causing DNS issues. DAG submissions are safe to retry as the DAG hasn't started execution yet. There are retries at getSession and submitDAG level individually but some submitDAG failure has to retry getSession as well as AM could be unreachable, this can be handled in re-execution plugin. -- This message was sent by Atlassian Jira (v8.3.4#803005)