环境信息: kvm主机,CS4.0.2,基本网络模式
问题描述:增加主机时,管理节点报错,日志如下。每次加入失败,会将主机的SELINUX配置恢复默认,libvirt服务也会停止。
2014-11-28 13:13:26,816 INFO [cloud.resource.ResourceManagerImpl] (catalina-exec-15:null)
Trying to add a new host at http://10.6.31.4 in data center 1
2014-11-28 13:13:27,204 DEBUG [utils.ssh.SSHCmdHelper] (catalina-exec-15:null) Executing cmd:
lsmod|grep kvm
2014-11-28 13:13:28,324 DEBUG [utils.ssh.SSHCmdHelper] (catalina-exec-15:null) lsmod|grep
kvm output:kvm_intel 52570 0
kvm 314739 1 kvm_intel
2014-11-28 13:13:29,330 DEBUG [utils.ssh.SSHCmdHelper] (catalina-exec-15:null) Executing
cmd: cloud-setup-agent -m 10.6.27.103 -z 1 -p 1 -c 1 -g f20023a3-34a2-3ac5-91bd-f99a046ae76a
-a --pubNic=cloudbr0 --prvNic=cloudbr0 --guestNic=cloudbr0
2014-11-28 13:13:31,394 DEBUG [cloud.server.StatsCollector] (StatsCollector-1:null) HostStatsCollector
is running...
2014-11-28 13:13:37,120 DEBUG [cloud.consoleproxy.ConsoleProxyManagerImpl] (consoleproxy-1:null)
Skip capacity scan due to there is no Primary Storage UPintenance mode
2014-11-28 13:13:37,517 DEBUG [network.router.VirtualNetworkApplianceManagerImpl] (RouterStatusMonitor-1:null)
Found 0 routers.
2014-11-28 13:13:44,931 DEBUG [utils.ssh.SSHCmdHelper] (catalina-exec-15:null) cloud-setup-agent
-m 10.6.27.103 -z 1 -p 1 -c 1 -g f20023a3-34a2-3ac5-91bd-f99a046ae76a -a --pubNic=cloudbr0
--prvNic=cloudbr0 --guestNic=cloudbr0 output:[Failed]
ore Libvirt ... bvirt
Try to restore your system:
Restore SElinux ...
2014-11-28 13:14:07,120 DEBUG [cloud.consoleproxy.ConsoleProxyManagerImpl] (consoleproxy-1:null)
Skip capacity scan due to there is no Primary Storage UPintenance mode
2014-11-28 13:14:07,517 DEBUG [network.router.VirtualNetworkApplianceManagerImpl] (RouterStatusMonitor-1:null)
Found 0 routers.
2014-11-28 13:14:24,049 DEBUG [cloud.server.StatsCollector] (StatsCollector-1:null) VmStatsCollector
is running...
2014-11-28 13:14:24,865 DEBUG [cloud.server.StatsCollector] (StatsCollector-1:null) StorageCollector
is running...
2014-11-28 13:14:31,395 DEBUG [cloud.server.StatsCollector] (StatsCollector-3:null) HostStatsCollector
is running...
2014-11-28 13:14:37,120 DEBUG [cloud.consoleproxy.ConsoleProxyManagerImpl] (consoleproxy-1:null)
Skip capacity scan due to there is no Primary Storage UPintenance mode
2014-11-28 13:14:37,517 DEBUG [network.router.VirtualNetworkApplianceManagerImpl] (RouterStatusMonitor-1:null)
Found 0 routers.
在主机侧,日志一直在打印
2014-11-28 13:18:13,560 INFO [utils.nio.NioClient] (Agent-Selector:null) Connecting to localhost:8250
2014-11-28 13:18:13,560 ERROR [utils.nio.NioConnection] (Agent-Selector:null) Unable to connect
to remote
2014-11-28 13:18:18,561 INFO [utils.nio.NioClient] (Agent-Selector:null) Connecting to localhost:8250
2014-11-28 13:18:18,561 ERROR [utils.nio.NioConnection] (Agent-Selector:null) Unable to connect
to remote
2014-11-28 13:18:23,562 INFO [utils.nio.NioClient] (Agent-Selector:null) Connecting to localhost:8250
2014-11-28 13:18:23,563 ERROR [utils.nio.NioConnection] (Agent-Selector:null) Unable to connect
to remote
初步分析,是主机的哪些配置文件有问题。但是不知道如何进一步定位,求高手指导。
主机目前已经配置的内容:
一个网桥:管理,存储,来宾共用
hostname:已经配置/etc/hosts
SELINUX=permissive
防火墙增加:
-A INPUT -p tcp -m tcp --dport 22 -j ACCEPT
-A INPUT -p tcp -m tcp --dport 1798 -j ACCEPT
-A INPUT -p tcp -m tcp --dport 16509 -j ACCEPT
-A INPUT -p tcp -m tcp --dport 5900:6100 -j ACCEPT
-A INPUT -p tcp -m tcp --dport 49152:49216 -j ACCEPT
配置文件
/etc/libvirt/qemu.conf
/etc/libvirt/libvirtd.conf
/etc/sysconfig/libvirtd
都已经修改过了,确定没有问题 |