We use servers witch motherboard Supermicro X8DTT-H and meet with such problem:
when watchdogd started, server is rebooted by IPMI watchdog several times per 
week.

After some debugging I've found, that sometimes IPMI driver entered endless loop, and watchdogd have no chances to reset watchdog timer.
In such situation top show:

PID USERNAME      PRI NICE   SIZE    RES STATE   C   TIME   WCPU COMMAND
...
113 root          -16    -     0K    16K CPU4    4  17:18 99.17% ipmi0: kcs

Endless loop located in file /sys/dev/ipmi/ipmi_kcs.c and function kcs_wait_for_obf():

        int status, start = ticks;

        status = INB(sc, KCS_CTL_STS);
        if (state == 0) {
                /* WAIT FOR OBF = 0 */
                while (ticks - start < MAX_TIMEOUT && status & KCS_STATUS_OBF) {
                        DELAY(100);
                        status = INB(sc, KCS_CTL_STS);
                }
        } else {
                /* WAIT FOR OBF = 1 */
                while (ticks - start < MAX_TIMEOUT &&
                    !(status & KCS_STATUS_OBF)) {
                        DELAY(100);
                        status = INB(sc, KCS_CTL_STS);
                }
        }

It seems to be, that this loop intended to run no more than MAX_TIMEOUT ticks.
but by some reason this timeout does not works and loop runs until reboot.

Questions:
1. Is it correct to check ticks to implement timeout here?
2. how to fix this timeout?

--
 Anton Yuzhaninov
_______________________________________________
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"

Reply via email to