hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Prasanth Jayachandran (JIRA)" <>
Subject [jira] [Commented] (HIVE-13226) Improve tez print summary to print query execution breakdown
Date Tue, 08 Mar 2016 08:43:40 GMT


Prasanth Jayachandran commented on HIVE-13226:

The problem is from user perspective, printing methods is not really helpful. "Analyze" for
example have no context. Its also combination of semantic analyze, logical optimization and
task compilation. Also it misses some steps in-between which will be useful to find where
time is spent. For example, time between TezBuildDag and TezSubmitToRunningDag is not accounted
which is the time taken for resource localization, session restart etc. 

"DAG Submit to DAG Accept" -> "DAG Submit to Accept".. is that any better?

> Improve tez print summary to print query execution breakdown
> ------------------------------------------------------------
>                 Key: HIVE-13226
>                 URL:
>             Project: Hive
>          Issue Type: Improvement
>    Affects Versions: 2.1.0
>            Reporter: Prasanth Jayachandran
>            Assignee: Prasanth Jayachandran
>         Attachments: HIVE-13226.1.patch, sampleoutput.png
> When tez print summary is enabled, methods summary is printed which are difficult to
correlate with the actual execution time. We can improve that to print  the execution times
in the sequence of operations that happens behind the scenes.
> Instead of printing the methods name it will be useful to print something like below
> 1) Query Compilation time
> 2) Query Submit to DAG Submit time
> 3) DAG Submit to DAG Accept time
> 4) DAG Accept to DAG Start time
> 5) DAG Start to DAG End time
> With this it will be easier to find out where the actual time is spent. 

This message was sent by Atlassian JIRA

View raw message