hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Vineet Garg (JIRA)" <>
Subject [jira] [Commented] (HIVE-22045) HIVE-21711 introduced regression in data load
Date Tue, 30 Jul 2019 18:34:00 GMT


Vineet Garg commented on HIVE-22045:

bq. Is that going to be another perf regression?
I suppose not. Due to {{isBlobStorage}} LOAD data regressed significantly but it helped CTAS/CM
statements, now with this patch behavior for CTAS/CM should be same as with {{isBlobStorage}}.

bq. The FileSinkDesc:isCTas* method needs more docs in the declaration.
Upload patch which should address this.

> HIVE-21711 introduced regression in data load
> ---------------------------------------------
>                 Key: HIVE-22045
>                 URL:
>             Project: Hive
>          Issue Type: Bug
>    Affects Versions: 4.0.0
>            Reporter: Vineet Garg
>            Assignee: Vineet Garg
>            Priority: Major
>         Attachments: HIVE-22045.1.patch, HIVE-22045.2.patch, HIVE-22045.3.patch, HIVE-22045.4.patch
> Better fix for HIVE-21711 is to specialize the handling for CTAS/Create MV statements
to avoid intermittent rename operation but keep INSERT etc statements do intermittent rename
since otherwise final move by file operation is significantly slow for such statements.

This message was sent by Atlassian JIRA

View raw message