sqoop-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Sandish Kumar HN (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (SQOOP-3178) SQOOP PARQUET INCREMENTAL MERGE
Date Sat, 22 Jul 2017 05:27:00 GMT

    [ https://issues.apache.org/jira/browse/SQOOP-3178?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16097117#comment-16097117
] 

Sandish Kumar HN commented on SQOOP-3178:
-----------------------------------------

Thanks, [~anna.szonyi]. Thanks for accepting My patch. I'm looking to upload more patches
in future. 

> SQOOP PARQUET INCREMENTAL MERGE 
> --------------------------------
>
>                 Key: SQOOP-3178
>                 URL: https://issues.apache.org/jira/browse/SQOOP-3178
>             Project: Sqoop
>          Issue Type: Improvement
>          Components: build, codegen, connectors
>         Environment: None
>            Reporter: Sandish Kumar HN
>            Assignee: Sandish Kumar HN
>            Priority: Blocker
>              Labels: features, newbie, sqoop
>
> Currently, sqoop-1 only supports merging of two Parquet format data sets but it doesn't
support to do incremental merge, so I have written a Sqoop Incremental Merge MR for Parquet
File Format and I have tested with million records of data with N number of iterations.
> blocked by issue https://issues.apache.org/jira/browse/SQOOP-3192



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Mime
View raw message