hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jesus Camacho Rodriguez (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (HIVE-22046) Differentiate among column stats computed by different engines
Date Fri, 26 Jul 2019 06:24:00 GMT

     [ https://issues.apache.org/jira/browse/HIVE-22046?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Jesus Camacho Rodriguez updated HIVE-22046:
-------------------------------------------
    Attachment: HIVE-22046.03.patch

> Differentiate among column stats computed by different engines
> --------------------------------------------------------------
>
>                 Key: HIVE-22046
>                 URL: https://issues.apache.org/jira/browse/HIVE-22046
>             Project: Hive
>          Issue Type: Improvement
>          Components: Metastore
>            Reporter: Jesus Camacho Rodriguez
>            Assignee: Jesus Camacho Rodriguez
>            Priority: Major
>         Attachments: HIVE-22046.01.patch, HIVE-22046.02.patch, HIVE-22046.03.patch, HIVE-22046.patch
>
>
> The goal is to avoid computation of column stats by engines to step on each other, e.g.,
Hive and Impala. In longer term, we may introduce a common representation for the column statistics
stored by different engines.
> For this issue, we will add a new column 'engine' to TAB_COL_STATS HMS table (unpartitioned
tables) and to PART_COL_STATS HMS table (partitioned tables). This will prevent conflicts
at the column level stats.



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

Mime
View raw message