sqoop-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "jiraposter@reviews.apache.org (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (SQOOP-443) Calling sqoop with hive import is not working multiple times due to kept output directory
Date Fri, 04 May 2012 06:50:55 GMT

    [ https://issues.apache.org/jira/browse/SQOOP-443?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13268175#comment-13268175
] 

jiraposter@reviews.apache.org commented on SQOOP-443:
-----------------------------------------------------



bq.  On 2012-05-04 06:44:10, Cheolsoo Park wrote:
bq.  > This patch has been posted for a while. It would be nice if someone could commit
this patch.
bq.  > 
bq.  > The jira SQOOP-483 will be likely to touch the same area of code, so it will be
nice if we can avoid any merge conflicts.

Hi Cheolsoo,
thank you very much for your review! However I believe that we have the "two committer" policy
in sqoop, so that I'm not allowed to commit my own patch :-(

Jarcec


- Jarek


-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/4798/#review7548
-----------------------------------------------------------


On 2012-04-19 05:56:56, Jarek Cecho wrote:
bq.  
bq.  -----------------------------------------------------------
bq.  This is an automatically generated e-mail. To reply, visit:
bq.  https://reviews.apache.org/r/4798/
bq.  -----------------------------------------------------------
bq.  
bq.  (Updated 2012-04-19 05:56:56)
bq.  
bq.  
bq.  Review request for Sqoop, Arvind Prabhakar and Cheolsoo Park.
bq.  
bq.  
bq.  Summary
bq.  -------
bq.  
bq.  I've added code that is removing export directory in case that it's empty.
bq.  
bq.  (Recreating review on moved SVN repository)
bq.  
bq.  
bq.  This addresses bug SQOOP-443.
bq.      https://issues.apache.org/jira/browse/SQOOP-443
bq.  
bq.  
bq.  Diffs
bq.  -----
bq.  
bq.    /src/java/org/apache/sqoop/hive/HiveImport.java 1327832 
bq.  
bq.  Diff: https://reviews.apache.org/r/4798/diff
bq.  
bq.  
bq.  Testing
bq.  -------
bq.  
bq.  ant -Dhadoopversion={20, 23, 100} test
bq.  real testing environment based on CDH3
bq.  
bq.  
bq.  Thanks,
bq.  
bq.  Jarek
bq.  
bq.


                
> Calling sqoop with hive import is not working multiple times due to  kept output directory
> ------------------------------------------------------------------------------------------
>
>                 Key: SQOOP-443
>                 URL: https://issues.apache.org/jira/browse/SQOOP-443
>             Project: Sqoop
>          Issue Type: Improvement
>    Affects Versions: 1.4.0-incubating, 1.4.1-incubating
>            Reporter: Jarek Jarcec Cecho
>            Assignee: Jarek Jarcec Cecho
>            Priority: Minor
>         Attachments: SQOOP-443.patch
>
>
> Hive is not removing input directory when doing "LOAD DATA" command in all cases. This
input directory is actually sqoop's export directory. Because this directory is kept, calling
same sqoop command twice is failing on exception "org.apache.hadoop.mapred.FileAlreadyExistsException:
Output directory $table already exists".
> This issue might be easily overcome by manual directory removal, however it's putting
unnecessary burden on users. It's also complicating executing saved jobs as there is additional
script execution needed.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message