sqoop-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Brock Noland (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (SQOOP-3178) SQOOP PARQUET INCREMENTAL MERGE
Date Mon, 29 May 2017 01:09:04 GMT

    [ https://issues.apache.org/jira/browse/SQOOP-3178?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16027983#comment-16027983
] 

Brock Noland commented on SQOOP-3178:
-------------------------------------

[~sanysandish@gmail.com] - can you ask on the dev list if you can be made a contributor to
the Sqoop project? Then the JIRA can be assigned to you.

> SQOOP PARQUET INCREMENTAL MERGE 
> --------------------------------
>
>                 Key: SQOOP-3178
>                 URL: https://issues.apache.org/jira/browse/SQOOP-3178
>             Project: Sqoop
>          Issue Type: Improvement
>          Components: build, codegen, connectors
>         Environment: None
>            Reporter: Sandish Kumar HN
>            Priority: Critical
>
> Currently, sqoop-1 only supports merging of two parquet format data sets but it doesn't
support to do incremental merge, so I have written a Sqoop Incremental Merge MR for Parquet
File Format and I have tested with million records of data with N number of iterations.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Mime
View raw message