Nodes leaving and re-joining intermittently

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]




Hi all,

We are trying to get to the bottom of some odd intermittent behavior on a cluster. We are intermittently seeing nodes leave and rejoin clusters, without being fenced. Further the gap between leaving on re-joining is 8 minutes. We are monitoring the latency between boxes, and it is acceptable (<5ms).

How can nodes exhibit this behavior? There seem to be no impact on the services running on the box, just this leaving and re-joining. The SNMP messages are below.

All help decoding this gratefully received! :)

Thanks,

Matt


Sat Dec 10 15:22:00 GMT 2011: cluster3.localdomain DISMAN-EVENT-MIB::
sysUpTimeInstance = 3:2:52:23.35, SNMPv2-MIB::snmpTrapOID.0 = COROSYNC-MIB::corosyncNoticesNodeStatus, COROSYNC-MIB::corosyncObjectsNodeName.0 = "cluster1.localdomain", COROSYNC-MIB::corosyncObjectsNodeID.0 = 1, COROSYNC-MIB::corosyncObjectsNodeAddress.0 = "10.79.202.1", COROSYNC-MIB::corosyncObjectsNodeStatus.0 = "left"

Sat Dec 10 15:30:25 GMT 2011: cluster3.localdomain DISMAN-EVENT-MIB::sysUpTimeInstance = 3:3:00:48.75, SNMPv2-MIB::snmpTrapOID.0 = COROSYNC-MIB::corosyncNoticesNodeStatus, COROSYNC-MIB::corosyncObjectsNodeName.0 = "cluster1.localdomain", COROSYNC-MIB::corosyncObjectsNodeID.0 = 1, COROSYNC-MIB::corosyncObjectsNodeAddress.0 = "10.79.202.1", COROSYNC-MIB::corosyncObjectsNodeStatus.0 = "joined"

--
Linux-cluster mailing list
Linux-cluster@xxxxxxxxxx
https://www.redhat.com/mailman/listinfo/linux-cluster

[Corosync Cluster Engine]     [Linux RAID]     [Fedora Users]     [Fedora Legacy List]     [Fedora Desktop]     [Fedora SELinux]     [Big List of Linux Books]     [Yosemite News]     [Yosemite Photos]     [KDE Users]

Add to Google Powered by Linux