whirr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Paul Baclace <paul.bacl...@gmail.com>
Subject Re: hadoop core cluster launch failure
Date Sat, 12 Nov 2011 19:02:34 GMT
Here is a guess:  a remote depo went missing during an install, and the 
package system was left in a locked state, never to be cleared again.

What if Whirr forced the dpkg lock clear?  Does it rely on that lock for 
serialization?

Paul


On 20111112 10:44 , Paul Baclace wrote:
> I am seeing this error, not due to any change I made:
>
> E: Could not get lock /var/lib/dpkg/lock - open (11: Resource 
> temporarily unavailable)
> E: Unable to lock the administration directory (/var/lib/dpkg/), is 
> another process using it?
>
> What causes this intermittent problem?  At the moment, it is very 
> repeatable.
>
>
> Paul
>
> On 20111111 22:23 , Andrei Savu wrote:
>> Can you make the S3 files public? Is this happening on all machines?
>>
>> You should probably consider 
>> using whirr.instance-templates-max-percent-failures as described here:
>> http://whirr.apache.org/docs/0.6.0/configuration-guide.html
>>
>> Cheers,
>>
>> -- Andrei Savu / andreisavu.ro <http://andreisavu.ro>
>>
>> On Sat, Nov 12, 2011 at 2:22 AM, Arun Ramakrishnan 
>> <sinchronized.arun@gmail.com <mailto:sinchronized.arun@gmail.com>> wrote:
>>
>>     Guys,
>>
>>     It looks like the apt hadoop packages aren't getting installed.
>>     Any ideas ?
>>
>>     ###################################################
>>
>>     2011-11-11 12:31:31,893 DEBUG [jclouds.compute] (user thread 6)
>>     << stderr from jclouds-script-1321043482986 as arun@107.20.122.48
>>     <mailto:arun@107.20.122.48>
>>     sed: can't read /etc/hadoop-0.20/conf.dist/hadoop-env.sh: No such
>>     file or directory
>>     sed: can't read /etc/hadoop-0.20/conf.dist/hadoop-env.sh: No such
>>     file or directory
>>     chgrp: invalid group: `hadoop'
>>     chgrp: invalid group: `hadoop'
>>     E: Could not get lock /var/lib/dpkg/lock - open (11: Resource
>>     temporarily unavailable)
>>     E: Unable to lock the administration directory (/var/lib/dpkg/),
>>     is another process using it?
>>     hadoop-0.20-datanode: unrecognized service
>>     E: Could not get lock /var/lib/dpkg/lock - open (11: Resource
>>     temporarily unavailable)
>>     E: Unable to lock the administration directory (/var/lib/dpkg/),
>>     is another process using it?
>>     hadoop-0.20-tasktracker: unrecognized service
>>
>>     ##################################################
>>
>>     I am using a binaries that i built form 0.7 a few weeks back.
>>
>>
>>     Full log : http://incentica-public.s3.amazonaws.com/whirr-ccore44.log
>>     Config  :
>>     http://incentica-public.s3.amazonaws.com/whirr_cdh.properties
>>
>>
>>     This seems to happen non-deterministically and more so for larger
>>     clusters 10+
>>
>>
>>     thanks
>>     Arun
>>
>>
>


Mime
View raw message