sqoop-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "VISHNU S NAIR (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (SQOOP-2561) Special Character removal from Column name as avro data results in duplicate column and fails the import
Date Mon, 28 Mar 2016 09:16:25 GMT

     [ https://issues.apache.org/jira/browse/SQOOP-2561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

VISHNU S NAIR updated SQOOP-2561:
---------------------------------
    Attachment: 0001-SQOOP-2561.patch

Special Character removal from Column name as avro data results in
duplicate column and fails the import
Solution:  Instead of removing the special character, replace it with
underscore

> Special Character removal from Column name as avro data results in duplicate column and
fails the import
> --------------------------------------------------------------------------------------------------------
>
>                 Key: SQOOP-2561
>                 URL: https://issues.apache.org/jira/browse/SQOOP-2561
>             Project: Sqoop
>          Issue Type: Bug
>    Affects Versions: 1.4.6
>         Environment: cdh5.3.2
>            Reporter: Suresh
>            Assignee: VISHNU S NAIR
>              Labels: AVRO, SQOOP
>             Fix For: 1.4.7
>
>         Attachments: 0001-SQOOP-2561.patch
>
>
> When a Special character like '$' or  '#' are present in column name, sqoop/avro removes
those special character. In some cases it leads to duplicate column.
> e.g. If we have COL$1 and COL1$ in the schema, it removes both of them and creates the
duplicate column as COL1 and it results in failure of the SQOOP import job as a avro data.
The same table can be loaded without --as-avarodata flag.
> The similar issue is raised in, https://issues.apache.org/jira/browse/SQOOP-1361 - which
i suppose is fixed and the fix is creating this new issue.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message