hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Peter Vary (JIRA)" <>
Subject [jira] [Commented] (HIVE-17824) msck repair table should drop the missing partitions from metastore
Date Wed, 11 Apr 2018 09:15:00 GMT


Peter Vary commented on HIVE-17824:

This one looks good to me +1 (pending tests)
On one condition :), on a follow up jira we get rid of the expression based solution.
The metastore thrift interface provides the possibility to drop partitions based on names,
so we have to add this possibility to the IMetaStoreClient interface too, and use this here.
And probably at

> msck repair table should drop the missing partitions from metastore
> -------------------------------------------------------------------
>                 Key: HIVE-17824
>                 URL:
>             Project: Hive
>          Issue Type: Improvement
>            Reporter: Vihang Karajgaonkar
>            Assignee: Janaki Lahorani
>            Priority: Major
>         Attachments: HIVE-17824.1.patch, HIVE-17824.2.patch, HIVE-17824.3.patch
> {{msck repair table <tablename>}} is often used in environments where the new partitions
are loaded as directories on HDFS or S3 and users want to create the missing partitions in
bulk. However, currently it only supports addition of missing partitions. If there are any
partitions which are present in metastore but not on the FileSystem, it should also delete
them so that it truly repairs the table metadata.
> We should be careful not to break backwards compatibility so we should either introduce
a new config or keyword to add support to delete unnecessary partitions from the metastore.
This way users who want the old behavior can easily turn it off. 

This message was sent by Atlassian JIRA

View raw message