On Sun, 13 Nov 2011 20:32:07 -0500 Chet Ramey <chet.ra...@case.edu> wrote:
> > This script fails roughly 3% of the time for me, only when running on a > > loaded system, and then again only when run through "solar". I've tried to > > run bash manually reproducing the *exact* environment by dumping "env" but > > no luck. > > It appears to me that this is a problem with the environment solar provides > to the shell script, and the included version of libintl chokes on > something about it (the value of $LC_IDENTIFICATION?). You've tried > several different versions of bash, and they all seem to fail inside > libintl. Apparently, yes. Though I also have weird "xalloc" failures before that (although more rare). > Without knowing more about the data in the stack traceback, like the > values passed to strlen, it's hard to say more. It might be an > interaction between libc and the version of libintl included with bash; > it might be that libc isn't calling out to the bash libintl at all. I don't know what exactly is going on, but I've build everything with -O0 -ggdb and yet cannot get a decent stack trace (maybe there are some hardcoded flags on the makefiles or whatnot). I didn't spent too much time on it however. Since I'd like to investigate more on this issue, what your suggestion would be to dig deeper into the issue?