hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "ASF GitHub Bot (Jira)" <j...@apache.org>
Subject [jira] [Updated] (HIVE-15848) count or sum distinct incorrect when hive.optimize.reducededuplication set to true
Date Tue, 16 Jun 2020 16:58:02 GMT

     [ https://issues.apache.org/jira/browse/HIVE-15848?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

ASF GitHub Bot updated HIVE-15848:
----------------------------------
    Labels: pull-request-available  (was: )

> count or sum distinct incorrect when hive.optimize.reducededuplication set to true
> ----------------------------------------------------------------------------------
>
>                 Key: HIVE-15848
>                 URL: https://issues.apache.org/jira/browse/HIVE-15848
>             Project: Hive
>          Issue Type: Bug
>    Affects Versions: 0.13.0
>            Reporter: Biao Wu
>            Assignee: Zoltan Haindrich
>            Priority: Critical
>              Labels: pull-request-available
>             Fix For: 2.3.0
>
>         Attachments: HIVE-15848.1.patch, HIVE-15848.2.patch
>
>          Time Spent: 10m
>  Remaining Estimate: 0h
>
> Test Table:
> {code:sql}
> create table test(id int,key int,name int);
> {code}
> Data:
> ||id||key||name||
> |1	|1	|2
> |1	|2	|3
> |1	|3	|2
> |1	|4	|2
> |1	|5	|3
> Test SQL1:
> {code:sql}
> select id,count(Distinct key),count(Distinct name)
> from (select id,key,name from count_distinct_test group by id,key,name)m
> group by id;
> {code}
> result:
> |1|5|4
> expect:
> |1|5|2
> Test SQL2:
> {code:sql}
> select id,count(Distinct name),count(Distinct key)
> from (select id,key,name from count_distinct_test group by id,name,key)m
> group by id;
> {code}
> result:
> |1|2|5



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Mime
View raw message