sqoop-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Brock Noland (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (SQOOP-3178) SQOOP PARQUET INCREMENTAL MERGE
Date Wed, 31 May 2017 18:24:04 GMT

    [ https://issues.apache.org/jira/browse/SQOOP-3178?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16031629#comment-16031629
] 

Brock Noland commented on SQOOP-3178:
-------------------------------------

[~sanysandish@gmail.com]

1. Still cannot assign to you, might want to ping the dev@ address on being added as a contributor.
2. Can you post a link to the RB item? Once we add some tests I think the patch looks good.
3. Since we found the problem with the current tests are fixed with a new version of Parquet,
I think you should open another SQOOP item which is "upgrade parquet" and mark this issue
blocked by that one. Hopefully it's not too difficult.

> SQOOP PARQUET INCREMENTAL MERGE 
> --------------------------------
>
>                 Key: SQOOP-3178
>                 URL: https://issues.apache.org/jira/browse/SQOOP-3178
>             Project: Sqoop
>          Issue Type: Improvement
>          Components: build, codegen, connectors
>         Environment: None
>            Reporter: Sandish Kumar HN
>            Priority: Critical
>
> Currently, sqoop-1 only supports merging of two parquet format data sets but it doesn't
support to do incremental merge, so I have written a Sqoop Incremental Merge MR for Parquet
File Format and I have tested with million records of data with N number of iterations.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Mime
View raw message