Invention Grant
US07941690B2 Reliable fault resolution in a cluster 有权
群集中可靠的故障解决方案

Reliable fault resolution in a cluster
Abstract:
A method and system for localizing and resolving a fault in a cluster environment. The cluster is configured with at least one multi-homed node, and at least one gateway for each network interface. Heartbeat messages are sent between peer nodes and the gateway in predefined periodic intervals. In the event of loss of a heartbeat message by any node or gateway, an ICMP echo is issued to each node and gateway in the cluster for each network interface. If neither a node loss not a network loss is validated in response to the ICMP echo, an application level ping is issued to determine if the fault associated with the absence of the heartbeat message is a transient error condition or an application software fault.
Public/Granted literature
Information query
Patent Agency Ranking
0/0