cloudstack-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Daan Hoogland <daan.hoogl...@gmail.com>
Subject Re: XenServer 6.0.2 won't stay connected after upgrading to CS 4.2
Date Fri, 18 Oct 2013 10:54:06 GMT
Ryan,

A null pointer exception is most definitely a bug. Can you file a jira ticket?

thanks,
Daan

On Thu, Oct 10, 2013 at 5:01 PM, Ryan James <Ryan@colocateusa.net> wrote:
> We just upgraded to CloudStack 4.2 (from 4.0.2) and now our Xen Cluster will not stay
connected and the host are in alert states.
>
>
> Here is a snip it out of the management-server.log
>
>
> 2013-10-10 09:22:13,995 DEBUG [cloud.capacity.CapacityManagerImpl] (AgentTaskPool-1:null)
Found 6 VMs on host 4
>
> 2013-10-10 09:22:14,003 ERROR [agent.manager.AgentManagerImpl] (AgentTaskPool-1:null)
Monitor ComputeCapacityListener says there is an error in the connect process for 4 due to
null
>
> java.lang.NullPointerException
>
> at com.cloud.capacity.CapacityManagerImpl.updateCapacityForHost(CapacityManagerImpl.java:543)
>
> at com.cloud.utils.component.ComponentInstantiationPostProcessor$InterceptorDispatcher.intercept(ComponentInstantiationPostProcessor.java:125)
>
> at com.cloud.capacity.ComputeCapacityListener.processConnect(ComputeCapacityListener.java:78)
>
> at com.cloud.agent.manager.AgentManagerImpl.notifyMonitorsOfConnection(AgentManagerImpl.java:587)
>
> at com.cloud.agent.manager.AgentManagerImpl.handleDirectConnectAgent(AgentManagerImpl.java:1479)
>
> at com.cloud.resource.ResourceManagerImpl.createHostAndAgent(ResourceManagerImpl.java:1762)
>
> at com.cloud.resource.ResourceManagerImpl.createHostAndAgent(ResourceManagerImpl.java:1924)
>
> at com.cloud.agent.manager.AgentManagerImpl$SimulateStartTask.run(AgentManagerImpl.java:1130)
>
> at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1146)
>
> at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
>
> at java.lang.Thread.run(Thread.java:679)
>
> 2013-10-10 09:22:14,004 INFO  [agent.manager.AgentManagerImpl] (AgentTaskPool-1:null)
Host 4 is disconnecting with event AgentDisconnected
>
> 2013-10-10 09:22:14,008 DEBUG [agent.manager.AgentManagerImpl] (AgentTaskPool-1:null)
The next status of agent 4is Alert, current status is Connecting
>
> 2013-10-10 09:22:14,008 DEBUG [agent.manager.AgentManagerImpl] (AgentTaskPool-1:null)
Deregistering link for 4 with state Alert
>
> 2013-10-10 09:22:14,008 DEBUG [agent.manager.AgentManagerImpl] (AgentTaskPool-1:null)
Remove Agent : 4
>
> 2013-10-10 09:22:14,009 DEBUG [agent.manager.DirectAgentAttache] (AgentTaskPool-1:null)
Processing disconnect 4
>
> 2013-10-10 09:22:14,009 DEBUG [agent.manager.AgentManagerImpl] (AgentTaskPool-1:null)
Sending Disconnect to listener: com.cloud.hypervisor.xen.discoverer.XcpServerDiscoverer_EnhancerByCloudStack_434ade97
>
> 2013-10-10 09:22:14,009 DEBUG [agent.manager.AgentManagerImpl] (AgentTaskPool-1:null)
Sending Disconnect to listener: com.cloud.deploy.DeploymentPlanningManagerImpl_EnhancerByCloudStack_a0f690d
>
> 2013-10-10 09:22:14,009 DEBUG [agent.manager.AgentManagerImpl] (AgentTaskPool-1:null)
Sending Disconnect to listener: com.cloud.network.NetworkManagerImpl_EnhancerByCloudStack_1ba07aa0
>
> 2013-10-10 09:22:14,009 DEBUG [agent.manager.AgentManagerImpl] (AgentTaskPool-1:null)
Sending Disconnect to listener: com.cloud.storage.secondary.SecondaryStorageListener
>
> 2013-10-10 09:22:14,009 DEBUG [agent.manager.AgentManagerImpl] (AgentTaskPool-1:null)
Sending Disconnect to listener: com.cloud.hypervisor.vmware.manager.VmwareManagerImpl_EnhancerByCloudStack_b315799a
>
> 2013-10-10 09:22:14,009 DEBUG [agent.manager.AgentManagerImpl] (AgentTaskPool-1:null)
Sending Disconnect to listener: com.cloud.network.security.SecurityGroupListener
>
> 2013-10-10 09:22:14,009 DEBUG [agent.manager.AgentManagerImpl] (AgentTaskPool-1:null)
Sending Disconnect to listener: com.cloud.storage.listener.StoragePoolMonitor
>
> 2013-10-10 09:22:14,009 DEBUG [agent.manager.AgentManagerImpl] (AgentTaskPool-1:null)
Sending Disconnect to listener: com.cloud.vm.ClusteredVirtualMachineManagerImpl_EnhancerByCloudStack_48612ba4
>
> 2013-10-10 09:22:14,009 DEBUG [agent.manager.AgentManagerImpl] (AgentTaskPool-1:null)
Sending Disconnect to listener: com.cloud.storage.LocalStoragePoolListener
>
> 2013-10-10 09:22:14,009 DEBUG [agent.manager.AgentManagerImpl] (AgentTaskPool-1:null)
Sending Disconnect to listener: com.cloud.network.SshKeysDistriMonitor
>
> 2013-10-10 09:22:14,009 DEBUG [agent.manager.AgentManagerImpl] (AgentTaskPool-1:null)
Sending Disconnect to listener: com.cloud.network.router.VirtualNetworkApplianceManagerImpl_EnhancerByCloudStack_e1d29845
>
> 2013-10-10 09:22:14,009 DEBUG [agent.manager.AgentManagerImpl] (AgentTaskPool-1:null)
Sending Disconnect to listener: com.cloud.network.SshKeysDistriMonitor
>
> 2013-10-10 09:22:14,009 DEBUG [agent.manager.AgentManagerImpl] (AgentTaskPool-1:null)
Sending Disconnect to listener: com.cloud.network.router.VpcVirtualNetworkApplianceManagerImpl_EnhancerByCloudStack_5cb66068
>
> 2013-10-10 09:22:14,009 DEBUG [agent.manager.AgentManagerImpl] (AgentTaskPool-1:null)
Sending Disconnect to listener: com.cloud.storage.upload.UploadListener
>
> 2013-10-10 09:22:14,009 DEBUG [agent.manager.AgentManagerImpl] (AgentTaskPool-1:null)
Sending Disconnect to listener: com.cloud.storage.download.DownloadListener
>
> 2013-10-10 09:22:14,009 DEBUG [agent.manager.AgentManagerImpl] (AgentTaskPool-1:null)
Sending Disconnect to listener: com.cloud.agent.manager.AgentMonitor
>
> 2013-10-10 09:22:14,009 DEBUG [agent.manager.AgentManagerImpl] (AgentTaskPool-1:null)
Sending Disconnect to listener: com.cloud.capacity.StorageCapacityListener
>
> 2013-10-10 09:22:14,009 DEBUG [agent.manager.AgentManagerImpl] (AgentTaskPool-1:null)
Sending Disconnect to listener: com.cloud.capacity.ComputeCapacityListener
>
> 2013-10-10 09:22:14,009 DEBUG [agent.manager.AgentManagerImpl] (AgentTaskPool-1:null)
Sending Disconnect to listener: com.cloud.network.NetworkUsageManagerImpl$DirectNetworkStatsListener
>
> 2013-10-10 09:22:14,009 DEBUG [cloud.network.NetworkUsageManagerImpl] (AgentTaskPool-1:null)
Disconnected called on 4 with status Alert
>
> 2013-10-10 09:22:14,009 DEBUG [agent.manager.AgentManagerImpl] (AgentTaskPool-1:null)
Sending Disconnect to listener: com.cloud.consoleproxy.ConsoleProxyListener
>
> 2013-10-10 09:22:14,014 DEBUG [cloud.host.Status] (AgentTaskPool-1:null) Transition:[Resource
state = Enabled, Agent event = AgentDisconnected, Host id = 4, name = c14-c1-3]
>
> 2013-10-10 09:22:14,026 DEBUG [cloud.host.Status] (AgentTaskPool-1:null) Agent status
update: [id = 4; name = c14-c1-3; old status = Connecting; event = AgentDisconnected; new
status = Alert; old update count = 2314; new update count = 2315]
>
> 2013-10-10 09:22:14,026 DEBUG [agent.manager.ClusteredAgentManagerImpl] (AgentTaskPool-1:null)
Notifying other nodes of to disconnect
>
> 2013-10-10 09:22:14,029 WARN  [cloud.resource.ResourceManagerImpl] (AgentTaskPool-1:null)
Unable to connect due to
>
> com.cloud.utils.exception.CloudRuntimeException: Unable to connect 4
>
> at com.cloud.agent.manager.AgentManagerImpl.notifyMonitorsOfConnection(AgentManagerImpl.java:606)
>
> at com.cloud.agent.manager.AgentManagerImpl.handleDirectConnectAgent(AgentManagerImpl.java:1479)
>
> at com.cloud.resource.ResourceManagerImpl.createHostAndAgent(ResourceManagerImpl.java:1762)
>
> at com.cloud.resource.ResourceManagerImpl.createHostAndAgent(ResourceManagerImpl.java:1924)
>
> at com.cloud.agent.manager.AgentManagerImpl$SimulateStartTask.run(AgentManagerImpl.java:1130)
>
> at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1146)
>
> at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
>
> at java.lang.Thread.run(Thread.java:679)
>
> Caused by: java.lang.NullPointerException
>
> at com.cloud.capacity.CapacityManagerImpl.updateCapacityForHost(CapacityManagerImpl.java:543)
>
> at com.cloud.utils.component.ComponentInstantiationPostProcessor$InterceptorDispatcher.intercept(ComponentInstantiationPostProcessor.java:125)
>
> at com.cloud.capacity.ComputeCapacityListener.processConnect(ComputeCapacityListener.java:78)
>
> at com.cloud.agent.manager.AgentManagerImpl.notifyMonitorsOfConnection(AgentManagerImpl.java:587)
>
> ... 7 more
>
> 2013-10-10 09:22:14,030 DEBUG [cloud.host.Status] (AgentTaskPool-1:null) Transition:[Resource
state = Enabled, Agent event = AgentDisconnected, Host id = 4, name = c14-c1-3]
>
> 2013-10-10 09:22:14,041 DEBUG [cloud.host.Status] (AgentTaskPool-1:null) Agent status
update: [id = 4; name = c14-c1-3; old status = Alert; event = AgentDisconnected; new status
= Alert; old update count = 2315; new update count = 2316]
>
>
> I have not been able to find any information online about this error or how to get the
cluster to connect again. The Cluster is up to date on Hot Fixes and was working fine before
the upgrade.
>
> The cluster is a 3 node cluster with fiber luns.
>
> Any help on this is greatly appreciated.
>
> --
> Ryan James
> ColocateUSA
> http://www.colocateUSA.net
> Ryan@colocateUSA.net

Mime
View raw message