sqoop-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Rohan More (JIRA)" <j...@apache.org>
Subject [jira] [Issue Comment Deleted] (SQOOP-2878) Sqoop import into Hive transactional tables
Date Tue, 08 Mar 2016 08:48:40 GMT

     [ https://issues.apache.org/jira/browse/SQOOP-2878?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Rohan More updated SQOOP-2878:
------------------------------
    Comment: was deleted

(was: We have made changes for this. The approach used is as follows:

1.  For transactional support, the hive table should be bucketed and should be in ORC format.
So modified the create table statement to create hive bucketed orc table.

2. As the table is in ORC format, we can't load data directly into it, so we used temporary
text table as follows:
   a. First created a temporary text table and loaded data into it using "LOAD DATA INPATH..."
command.
   b. Loaded data from this temporary text table into ORC table using "Insert into table <orc_table_name>
select * from <txt_table_name>". After this deleted the temporary text table.

Please let us know if we should submit the patch for this.)

> Sqoop import into Hive transactional tables
> -------------------------------------------
>
>                 Key: SQOOP-2878
>                 URL: https://issues.apache.org/jira/browse/SQOOP-2878
>             Project: Sqoop
>          Issue Type: Improvement
>    Affects Versions: 1.4.6
>            Reporter: Rohan More
>            Priority: Minor
>
> Hive has introduced support for transactions from version 0.13. For transactional support,
the hive table should be bucketed and should be in ORC format.
> This improvement is to import data directly into hive transactional table using sqoop.




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message