hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Zoltan Haindrich (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (HIVE-16495) ColumnStats merge should consider the accuracy of the current stats
Date Tue, 05 Sep 2017 15:03:01 GMT

     [ https://issues.apache.org/jira/browse/HIVE-16495?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Zoltan Haindrich updated HIVE-16495:
------------------------------------
    Attachment: HIVE-16495.03.patch

#3) minor changes:

* removed setting stats to 0 - sometimes misinterpreted as good data
* removed resetting partition props to empty...seemed out of place
* more restrictive columnstats task lookup

Note: After alter table there are no statistics for the new columns - I think this is ok;
since if the user have used {{load data}} to add the data; there might be existing values
in these "new" columns.
https://github.com/apache/hive/blob/eb0034c0cdcc5f10fd5d7382e2caf787a8003e7a/ql/src/test/queries/clientpositive/autoColumnStats_5.q#L27

> ColumnStats merge should consider the accuracy of the current stats
> -------------------------------------------------------------------
>
>                 Key: HIVE-16495
>                 URL: https://issues.apache.org/jira/browse/HIVE-16495
>             Project: Hive
>          Issue Type: Sub-task
>            Reporter: Pengcheng Xiong
>            Assignee: Zoltan Haindrich
>         Attachments: HIVE-16495.01.patch, HIVE-16495.02.patch, HIVE-16495.03.patch
>
>




--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Mime
View raw message