ResourceManager[31705]: CRIT: Resource STOP failure. Reboot required!
This happened on a cluster Iam running with heartbeat for no particular reason that I can figure out.
The box ended up rebooting itself for some reason. It was not a big deal in the sense that the other servers in the cluster kept running but it would be nice to find the cause of this.........
heartbeat is stopped for some reason
Anyway hnode2 was active and the services are running fine but I see heartbeat has been stopped somehow.
Here is the last log I see of heartbeat:
[quote:23c84415f5]
Sep 9 17:15:32 hnode2 heartbeat: [16738]: info: MSG stats: 9/1762471 ms age 0 [pid16738/MST_CONTROL]
Sep 9 17:15:32 hnode2 heartbeat: [16738]: info: cl_malloc stats: 716/51784021 152624/74519 [pid16738/MST_CONTROL]
Sep 9 17:15:32........