sqoop-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Saurabh Sehgal <saurabh....@gmail.com>
Subject Re: Hadoop 0.20.append
Date Tue, 06 Dec 2011 18:39:58 GMT
Thanks for your reply.

I have a small cluster of machines  (15) running 0.20.append. Will
upgrading this to 0.21 an option ? Is 0.21 still completely unstable
for production environments ?

Thank you,


On Mon, Dec 5, 2011 at 11:20 PM, Eric Wadsworth <eric@wadhome.org> wrote:
> Saurabh,
> I am also trying to figure out how best to use sqoop on a non-CDH cluster.
> From what I have learned, they are trying to get sqoop version 1.4.x stable
> on apache hadoop 0.23.x for now. Which is still in alpha. And it doesn't
> work at all with anything before 0.21 (except for Cloudera distributions).
> So we are kinda on our own, as far as I can tell, until hadoop 0.23 is
> released and we upgrade to it, or until they resolve
> https://issues.apache.org/jira/browse/SQOOP-384
> --- wad
> On 12/05/2011 07:18 PM, Saurabh Sehgal wrote:
>> Hi,
>> I am evaluating sqoop to do DB extracts from our relational stores.
>> The hadoop cluster running in production for us is Hadoop 0.20.append.
>> According to the sqoop introduction page on github:
>> "Sqoop relies on advanced features of Apache Hadoop. As such, it
>> requires the latest beta of Cloudera’s Distribution for Hadoop (CDH3
>> beta 2). Sqoop may be compatible with the Apache 0.21.0 release, but
>> this is considered experimental and should not be used in production.
>> The COMPILING.txt file describes how to select a Hadoop distribution
>> to target at compilation time."
>> Does this still hold true ? All I want to do is incrementally import
>> tables from an Oracle database. Can someone explain what features are
>> missing from the non cloudera distributions and why is it unsafe to
>> use them in production ?
>> Thank you.

View raw message