Node down in Ganglia

If a node is down in ganglia do this (assumes ganglia config is ok...):

  • From frontend, ping private interface
    • OK: try ssh to node
      • OK: test public interface and if ok, run service gmond restart
    • NOPE: ping public interface
    • NOPE: ping ipmi interface
      • NOPE: investigate console; power cycle machine

node has crashed

  • run dell diagnostics
  • dump logs

-- TomRockwell - 01 May 2008
Topic revision: r2 - 01 May 2008, TomRockwell
This site is powered by FoswikiCopyright © by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding Foswiki? Send feedback