sqoop-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Benjamin BONNET (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (SQOOP-2942) SQOOP import - do we have control over the target hdfs file name(s)?
Date Tue, 02 Aug 2016 08:38:20 GMT

    [ https://issues.apache.org/jira/browse/SQOOP-2942?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15403627#comment-15403627

Benjamin BONNET commented on SQOOP-2942:

I don't think it is possible but you can at least change your files prefix. Try to add -Dmapreduce.output.basename=employee
in your sqoop import command line and you wll get files such as : employee-m-0000, employee-m-0001,

> SQOOP import - do we have control over the target hdfs file name(s)?
> --------------------------------------------------------------------
>                 Key: SQOOP-2942
>                 URL: https://issues.apache.org/jira/browse/SQOOP-2942
>             Project: Sqoop
>          Issue Type: Test
>          Components: connectors/hdfs
>            Reporter: Prasenjit Patra
> I am importing data from RDBMS to HDFS via sqoop import. Say for example I am extracting
for table name: employee. I provided --target-dir /user/ppatra/employee.
> After successful execution I see files are generated like below under /user/ppatra/employee
> part-m-0000
> part-m-0001
> part-m-0002 and so on.
> but I want these file names to be like below:
> employee_0000.txt
> employee_0001.txt
> employee_0002.txt and so on
> As I am new to this and did some research but couldn't get it. 
> Could you please let me know if it is possible and how?

This message was sent by Atlassian JIRA

View raw message