[LEAPSECS] Coding this week, and a trick for timeouts over leap seconds.

Paul J. Ste. Marie paul_j at ste-marie.org
Sat Nov 5 21:31:33 EDT 2011



> On 10/1/2011 5:16 AM, Paul Sheer wrote:

>> I am busy implementing some heartbeat monitoring code between two

>> machines. The spec calls for a 1 second recovery.

>>

>> Basically if I get no heartbeats for 1 full second then I should

>> consider the peer system to have failed.


Sounds like you're generating once per second heartbeats, and failing if
a second elapses between heartbeats. This is a recipe for failure.

Increase either the timeout or the heartbeat pulse rate.

--
--Paul


More information about the LEAPSECS mailing list