hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Matt McCline (JIRA)" <j...@apache.org>
Subject [jira] [Assigned] (HIVE-20381) Vectorization: Reduce dedup of GroupBy + PTF turns off vectorization
Date Tue, 04 Sep 2018 18:53:00 GMT

     [ https://issues.apache.org/jira/browse/HIVE-20381?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Matt McCline reassigned HIVE-20381:
-----------------------------------

    Assignee:     (was: Matt McCline)

> Vectorization: Reduce dedup of GroupBy + PTF turns off vectorization
> --------------------------------------------------------------------
>
>                 Key: HIVE-20381
>                 URL: https://issues.apache.org/jira/browse/HIVE-20381
>             Project: Hive
>          Issue Type: Bug
>          Components: Vectorization
>    Affects Versions: 3.1.0, 4.0.0
>            Reporter: Gopal V
>            Priority: Major
>
> One of the PTF Reducers in Query51 is not vectorized because there's a reduce deduplication
which combines a group-by and a windowing shuffle.
> {code}
> |         Reducer 8                                  |
> |             Execution mode: llap                   |
> |             Reduce Vectorization:                  |
> |                 enabled: true                      |
> |                 enableConditionsMet: hive.vectorized.execution.reduce.enabled IS true,
hive.execution.engine tez IN [tez, spark] IS true |
> |                 notVectorizedReason: PTF operator: Only PTF directly under reduce-shuffle
is supported |
> |                 vectorized: false   
> {code}
> It vectorizes all PTF vertices (after HIVE-20367), with {{ set hive.optimize.reducededuplication=false;}}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message