hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Prasanth Jayachandran (JIRA)" <>
Subject [jira] [Updated] (HIVE-13226) Improve tez print summary to print query execution breakdown
Date Sat, 12 Mar 2016 20:34:03 GMT


Prasanth Jayachandran updated HIVE-13226:
    Attachment: HIVE-13226.3.patch

Contains fix for ROWGROUPS counter. Also, in the previous patch Prepare time corresponds to
total prepare time of which compilation was part of it. Now the time breakdown looks like

Compile -> From start of driver run to compilation end
Prepare -> From compilation end to start of DAG submit
Submit -> From start of DAG submit to start of DAG accept
Start -> From start of DAG accept to start of run
Finish -> From start of DAG run to finish

Remaining time will represent the time spent fetching the results.

> Improve tez print summary to print query execution breakdown
> ------------------------------------------------------------
>                 Key: HIVE-13226
>                 URL:
>             Project: Hive
>          Issue Type: Improvement
>    Affects Versions: 2.1.0
>            Reporter: Prasanth Jayachandran
>            Assignee: Prasanth Jayachandran
>         Attachments: HIVE-13226.1.patch, HIVE-13226.2.patch, HIVE-13226.3.patch, sampleoutput.png
> When tez print summary is enabled, methods summary is printed which are difficult to
correlate with the actual execution time. We can improve that to print  the execution times
in the sequence of operations that happens behind the scenes.
> Instead of printing the methods name it will be useful to print something like below
> 1) Query Compilation time
> 2) Query Submit to DAG Submit time
> 3) DAG Submit to DAG Accept time
> 4) DAG Accept to DAG Start time
> 5) DAG Start to DAG End time
> With this it will be easier to find out where the actual time is spent. 

This message was sent by Atlassian JIRA

View raw message