sqoop-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "ASF subversion and git services (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (SQOOP-3178) SQOOP PARQUET INCREMENTAL MERGE
Date Fri, 21 Jul 2017 19:30:00 GMT

    [ https://issues.apache.org/jira/browse/SQOOP-3178?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16096739#comment-16096739
] 

ASF subversion and git services commented on SQOOP-3178:
--------------------------------------------------------

Commit 46f9e2d9d8a6d65b9363ef48f4357f4cb039ea8d in sqoop's branch refs/heads/trunk from [~anna.szonyi]
[ https://git-wip-us.apache.org/repos/asf?p=sqoop.git;h=46f9e2d ]

SQOOP-3178: Incremental Merging for Parquet File Format

(Sandish Kumar HN via Anna Szonyi)


> SQOOP PARQUET INCREMENTAL MERGE 
> --------------------------------
>
>                 Key: SQOOP-3178
>                 URL: https://issues.apache.org/jira/browse/SQOOP-3178
>             Project: Sqoop
>          Issue Type: Improvement
>          Components: build, codegen, connectors
>         Environment: None
>            Reporter: Sandish Kumar HN
>            Assignee: Sandish Kumar HN
>            Priority: Blocker
>              Labels: features, newbie, sqoop
>
> Currently, sqoop-1 only supports merging of two Parquet format data sets but it doesn't
support to do incremental merge, so I have written a Sqoop Incremental Merge MR for Parquet
File Format and I have tested with million records of data with N number of iterations.
> blocked by issue https://issues.apache.org/jira/browse/SQOOP-3192



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Mime
View raw message