Node down in Ganglia
If a node is down in ganglia do this (assumes ganglia config is ok...):
From frontend, ping private interface
OK: try ssh to node
OK: test public interface and if ok, run service gmond restart
NOPE: ping public interface
NOPE: ping ipmi interface
NOPE: investigate console; power cycle machine
node has crashed
run dell diagnostics
dump logs
--
TomRockwell
- 01 May 2008
This topic: AGLT2
>
WebHome
>
MaintenanceProcedures
>
RespondToDownNode
Topic revision:
01 May 2008,
TomRockwell
Copyright © by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding Foswiki?
Send feedback