Yes, that is always an option. Thanks for suggestion.
I am a beginner at HBase. However, I was thinking of cutting down the time to dump the data from Database. If i do it twice(assuming i have 2 column families) then it increases the time of load the entire HBase table.
AFAIK, Sqoop generates put statements to import data into HBase. If we can generate put statements for more than one column family. Would it violate the atomicity principle of HBase? I went through the atomicity section of http://hbase.apache.org/acid-semantics.html and I cant find anything which would stop sqoop loading more than one column family and Hbase bulk load also allows more than one column family although the approach of HBase bulk loading might be different from Sqoop. Could you provide me more insight? Sorry, if my question is dumb.
Hi Anil,Sqoop does not support multiple column families because HBase only permits atomic operations.One workaround is to run two imports, specifying a different column family each time.Regards,KathleenOn Wed, Feb 22, 2012 at 11:31 AM, anil gupta <email@example.com> wrote:
I went through the User guide of Sqoop but i could not find anything for importing more than one columnfamily in HBase. Am i missing something? Is it planned for future release?
Thanks & Regards,