sqoop-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hudson (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (SQOOP-319) The --hive-drop-import-delims option should accept a replacement string
Date Thu, 25 Aug 2011 05:08:29 GMT

    [ https://issues.apache.org/jira/browse/SQOOP-319?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13090778#comment-13090778
] 

Hudson commented on SQOOP-319:
------------------------------

Integrated in Sqoop-jdk-1.6 #17 (See [https://builds.apache.org/job/Sqoop-jdk-1.6/17/])
    SQOOP-319. Support for replacing Hive delimiters.

(Joey Echeverria via Arvind Prabhakar)

arvind : http://svn.apache.org/viewvc/?view=rev&rev=1161382
Files : 
* /incubator/sqoop/trunk/src/docs/user/hive.txt
* /incubator/sqoop/trunk/src/java/com/cloudera/sqoop/SqoopOptions.java
* /incubator/sqoop/trunk/src/java/com/cloudera/sqoop/lib/FieldFormatter.java
* /incubator/sqoop/trunk/src/java/com/cloudera/sqoop/tool/BaseSqoopTool.java
* /incubator/sqoop/trunk/src/docs/user/hive-args.txt
* /incubator/sqoop/trunk/src/java/com/cloudera/sqoop/orm/ClassWriter.java
* /incubator/sqoop/trunk/src/test/com/cloudera/sqoop/hive/TestHiveImport.java
* /incubator/sqoop/trunk/testdata/hive/scripts/fieldWithNewlineReplacementImport.q
* /incubator/sqoop/trunk/src/java/com/cloudera/sqoop/tool/ImportTool.java


> The --hive-drop-import-delims option should accept a replacement string
> -----------------------------------------------------------------------
>
>                 Key: SQOOP-319
>                 URL: https://issues.apache.org/jira/browse/SQOOP-319
>             Project: Sqoop
>          Issue Type: Bug
>          Components: hive-integration
>    Affects Versions: 1.3.0
>            Reporter: Joey Echeverria
>            Assignee: Joey Echeverria
>            Priority: Minor
>             Fix For: 1.4.0
>
>         Attachments: SQOOP-319-1.patch, SQOOP-319-2.patch
>
>
> When importing data into hive, you have the option of dropping the hive delimiters in
data fields. It would be more useful to replace the delimiters with a user defined string.
Often times the dropped delimiters (like \n) are separating words. If I want to split on white
space in my hive queries, I'll now get two words merged together. A more desirable behavior
would be to replace it with a space. Making it user configurable will give the most flexibility.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message