sqoop-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Karel (JIRA)" <j...@apache.org>
Subject [jira] [Comment Edited] (SQOOP-472) SQOOP CLI needs to support importing data into more than one column family into HBase.
Date Tue, 30 Jan 2018 21:29:00 GMT

    [ https://issues.apache.org/jira/browse/SQOOP-472?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16345842#comment-16345842
] 

Karel edited comment on SQOOP-472 at 1/30/18 9:28 PM:
------------------------------------------------------

I'd like very much to see the support for multiple column families implemented.

For example, in a project, we'd like to use Sqoop for importing "data" and "metadata" column
families. But, because of Sqoop cannot define multiple CFs, finally other custom solution
prevailed.

Maybe there could be some option in a form of a dictionary, like

{{--hbase-cf-mappings="cf1:col1;cf2:col2,col3,col4;cf3:"}}

where
 * col1 is meant to be column family cf1,
 * col2,col3 and col4 would pertain to column family cf2, and
 * cf3 would be the default column family

Just brainstorming here..


was (Author: elkarel):
I'd like very much to see column families implemented.

For example, in a project, we'd like to use Sqoop for importing "data" and "metadata" column
families. But, because of Sqoop cannot define multiple CFs, finally other custom solution
prevailed.

Maybe there could be some option in a form of a dictionary, like

{{--hbase-cf-mappings="cf1:col1;cf2:col2,col3,col4;cf3:"}}

where
 * col1 is meant to be column family cf1,
 * col2,col3 and col4 would pertain to column family cf2, and
 * cf3 would be the default column family

Just brainstorming here..

> SQOOP CLI needs to support importing data into more than one column family into HBase.
> --------------------------------------------------------------------------------------
>
>                 Key: SQOOP-472
>                 URL: https://issues.apache.org/jira/browse/SQOOP-472
>             Project: Sqoop
>          Issue Type: Improvement
>          Components: hbase-integration
>    Affects Versions: 1.4.0-incubating
>         Environment: Unix
>            Reporter: Anil Gupta
>            Priority: Major
>
> At present, SQOOP doesnt supports importing data in more than one column family in HBase.

> At present, if i need to import data into two column families then i need to run Sqoop
twice. Two column families can be imported by invoking sqoop once rather than twice.
> Example for loading data into Hbase with more than one column families using HBase java
API can be looked into HBase Bulk Loader.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message