hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "ASF GitHub Bot (Jira)" <j...@apache.org>
Subject [jira] [Work logged] (HIVE-22538) RS deduplication does not always enforce hive.optimize.reducededuplication.min.reducer
Date Mon, 27 Jan 2020 04:38:00 GMT

     [ https://issues.apache.org/jira/browse/HIVE-22538?focusedWorklogId=377469&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-377469
]

ASF GitHub Bot logged work on HIVE-22538:
-----------------------------------------

                Author: ASF GitHub Bot
            Created on: 27/Jan/20 04:37
            Start Date: 27/Jan/20 04:37
    Worklog Time Spent: 10m 
      Work Description: jcamachor commented on pull request #877: HIVE-22538: RS deduplication
does not always enforce hive.optimize.reducededuplication.min.reducer
URL: https://github.com/apache/hive/pull/877#discussion_r371061414
 
 

 ##########
 File path: ql/src/test/results/clientpositive/acid_table_directories_test.q.out
 ##########
 @@ -154,6 +154,7 @@ POSTHOOK: Input: default@acidparttbl@p=200
 ### ACID BASE DIR ###
 ### ACID BASE DIR ###
 ### ACID BASE DIR ###
+### ACID BASE DIR ###
 
 Review comment:
   Is this expected?
 
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


Issue Time Tracking
-------------------

    Worklog Id:     (was: 377469)
    Time Spent: 0.5h  (was: 20m)

> RS deduplication does not always enforce hive.optimize.reducededuplication.min.reducer
> --------------------------------------------------------------------------------------
>
>                 Key: HIVE-22538
>                 URL: https://issues.apache.org/jira/browse/HIVE-22538
>             Project: Hive
>          Issue Type: Bug
>          Components: Physical Optimizer
>            Reporter: Jesus Camacho Rodriguez
>            Assignee: Krisztian Kasa
>            Priority: Major
>              Labels: pull-request-available
>         Attachments: HIVE-22538.2.patch, HIVE-22538.3.patch, HIVE-22538.4.patch, HIVE-22538.5.patch,
HIVE-22538.6.patch, HIVE-22538.6.patch, HIVE-22538.patch
>
>          Time Spent: 0.5h
>  Remaining Estimate: 0h
>
> For transactional tables, that property might be overriden to 1, which can lead to merging
final aggregation into a single stage (hence leading to performance degradation). For instance,
when autogather column stats is enabled, this can happen for the following query:
> {code}
> set hive.support.concurrency=true;
> set hive.txn.manager=org.apache.hadoop.hive.ql.lockmgr.DbTxnManager;
> EXPLAIN
> CREATE TABLE x STORED AS ORC TBLPROPERTIES('transactional'='true') AS
> SELECT * FROM SRC x CLUSTER BY x.key;
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Mime
View raw message