sqoop-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Attila Szabo (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (SQOOP-2585) merging hive tables using sqoop
Date Mon, 30 Oct 2017 13:55:00 GMT

     [ https://issues.apache.org/jira/browse/SQOOP-2585?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Attila Szabo updated SQOOP-2585:
--------------------------------
    Fix Version/s:     (was: 1.4.7)
                   1.5.0

> merging hive tables using sqoop
> -------------------------------
>
>                 Key: SQOOP-2585
>                 URL: https://issues.apache.org/jira/browse/SQOOP-2585
>             Project: Sqoop
>          Issue Type: New Feature
>    Affects Versions: 1.4.6
>            Reporter: Rakesh Sharma
>            Assignee: Rakesh Sharma
>             Fix For: 1.5.0
>
>         Attachments: SQOOP-2585_0.patch, SQOOP-2585_1.patch, SQOOP-2585_2.patch
>
>
> Sqoop currently doesn't support merging of two hive tables. Implement a new tool for
Sqoop which :
> 1. merges two hive tables. (This will be most commonly used to merge new incremental
data into an existing hive table)
> 2. should support both partitioned and non partitioned tables
> 3. supports merge on composite keys
> 4. For partitioned tables, merges old partitions and also add new partitions.
> 5. Supports Text/RC/ORC/Seq file formats
> 6. Should make sure that only process is performing merge.
> 7. should be an atomic operation. If it fails at any point , it should revert the target
hive table to its original state.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Mime
View raw message