On Thu, Apr 24, 2008 at 02:21:39PM -0500, John Hasler wrote: > Gabor writes: > > That will be difficult since sometimes the bug does not hit for weeks and > > then suddenly chrony starts to loop all the time. > > Are you saying that you have seen the bug?
Yes, see bug #447011. In fact, #474294 is a duplicate of #447011... > > So I'd say go ahead and upload the new version to unstable, and if there > > are no new occurances of the bug for 1-2 months then you can close it. > > Which would probably result in Chrony being removed from Lenny. Well, then someone should start debugging it. The gdb trace sent by Goshwin is quite promising. If UTI_NormaliseTimeval() is called with x->tv_usec being a very large value (say LONG_MAX), that would clearly explain the hang, and it would also explain why i386 does not seem to be affected even if it is just as buggy as amd64: on i386, the while {} loops execute at most 2147 times which is basically unnoticable, while on amd64 that can be 2^32 times more. So, IMHO turning the two while {} loops in UTI_NormaliseTimeval() into divide/remainder operations should fix the hang. However, it still needs investigation _why_ UTI_NormaliseTimeval() is being called with such a bad time value, as it may be a result of a more severe bug like memory corruption. Maybe upstream could help here. Gabor -- --------------------------------------------------------- MTA SZTAKI Computer and Automation Research Institute Hungarian Academy of Sciences, Laboratory of Parallel and Distributed Systems Address : H-1132 Budapest Victor Hugo u. 18-22. Hungary Phone/Fax : +36 1 329-78-64 (secretary) W3 : http://www.lpds.sztaki.hu --------------------------------------------------------- -- To UNSUBSCRIBE, email to [EMAIL PROTECTED] with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]