hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "ASF GitHub Bot (JIRA)" <j...@apache.org>
Subject [jira] [Work logged] (HIVE-21269) Mandate -update and -delete as DistCp options to sync data files for external tables replication.
Date Fri, 15 Feb 2019 10:22:00 GMT

     [ https://issues.apache.org/jira/browse/HIVE-21269?focusedWorklogId=199155&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-199155
]

ASF GitHub Bot logged work on HIVE-21269:
-----------------------------------------

                Author: ASF GitHub Bot
            Created on: 15/Feb/19 10:21
            Start Date: 15/Feb/19 10:21
    Worklog Time Spent: 10m 
      Work Description: sankarh commented on pull request #535: HIVE-21269: Mandate -update
and -delete as DistCp options to sync data files for external tables replication.
URL: https://github.com/apache/hive/pull/535
 
 
   
 
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


Issue Time Tracking
-------------------

    Worklog Id:     (was: 199155)
    Time Spent: 20m  (was: 10m)

>  Mandate -update and -delete as DistCp options to sync data files for external tables
replication.
> --------------------------------------------------------------------------------------------------
>
>                 Key: HIVE-21269
>                 URL: https://issues.apache.org/jira/browse/HIVE-21269
>             Project: Hive
>          Issue Type: Bug
>          Components: repl
>    Affects Versions: 4.0.0
>            Reporter: Sankar Hariappan
>            Assignee: Sankar Hariappan
>            Priority: Major
>              Labels: DR, pull-request-available, replication
>             Fix For: 4.0.0
>
>         Attachments: HIVE-21269.01.patch, HIVE-21269.02.patch
>
>          Time Spent: 20m
>  Remaining Estimate: 0h
>
> Currently, external tables replication, copies the data in directory level. So, if target
directory exist, then DistCp should compare and update or skip data files in the directory
instead of creating new directory inside pre-existing target directory.
> This can be achieved using -update.
> Also, -delete option is needed to delete the files missing in source directory but present
in target.
> Hive should mandate these DistCp options even if user passes other options.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message