sqoop-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ken Krugler <kkrugler_li...@transpac.com>
Subject Support for partitioning during export into HDFS
Date Wed, 31 Aug 2011 22:47:44 GMT
Hi there,

For maximum performance when pulling data, it seems like we'd want to run multiple Sqoops
in parallel against the available partitions in a table.

That would require adding 'PARTITION <partition_name> to the select statement, something

select * from <table_name> PARTITION <partition_name> where <condition>;

1. Does this make sense, both for general Sqoop and specifically OraOop?

2. Is there a way to do this now, or would Sqoop (and OraOop) need to be extended?


-- Ken

Ken Krugler
+1 530-210-6378
custom data mining solutions

View raw message