[sage-devel] Re: (cy)PARI in Parallel, Heisenbugs, and Merging Policy

dwb...@gmail.com Sat, 14 May 2022 22:15:29 -0700

A bit more information: as far as we know there are problems only on Linux: 
the logs badlog, badlog1, badlog2 and badlog3 are made by one machine (a 
Xeon box running Ubuntu 18.04) and badlog-match is another machne (an i7 
also running Ubuntu 18.04).
In all the logs except badlog, there is a segmentation fault.
In badlog3, gdb attaches the running process and produces a backtrace.
We are currently not seeing crashes on MacOS.


Daniel Bump
On Saturday, May 14, 2022 at 8:06:18 PM UTC-7 Travis Scrimshaw wrote:

> Hi everyone,
>    On ticket #30423 <https://trac.sagemath.org/ticket/30423>, Dan, 
> Willie, and I have been working on a parallel-computation based 
> implementation for computing F-matrices that are used in math physics. 
> However, we have been seeing some doctest failures sporadically that 
> involve segfaults and linked-list corruption from (cy)PARI. Here are the 
> logs from testing with the first and the last having full tracebacks.
>
> http://sporadic.stanford.edu/badlog-match
> http://sporadic.stanford.edu/badlog
> http://sporadic.stanford.edu/badlog1
> http://sporadic.stanford.edu/badlog2
> http://sporadic.stanford.edu/badlog3
>
> The first question would be if anyone has an idea about what is causing 
> this. I have this impression that PARI is thread-safe, but I am wondering 
> if cypari is also thread/parallel-safe or if there are any specific things 
> that we should be careful about. (We’ve already had to work around a 
> pickling issue with polynomials IIRC.)
>
> Second question is that because this is a Heisenbug and I suspect it is 
> something upstream (and so far, nobody has been able to reproducing it 
> during an interactive version of Sage), I was wondering what the policy 
> would be for merging the ticket. I recall in the past that we have merged 
> tickets with Heisenbugs with followup tickets noting the behavior, but I am 
> not 100% sure about that (and I wouldn’t necessarily know how to find any 
> explicit examples). I was wondering if we could merge the ticket in an 
> early beta version so that many people/systems can test it to see if it 
> becomes more reproducible; of course this is assuming that the build bots 
> are not consistent in reproducing this. Should we just mark any offending 
> test(s) as “# known bug” and is there some general policy about this?
>
> Thanks,
> Travis
>
>
>

-- 
You received this message because you are subscribed to the Google Groups 
"sage-devel" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to sage-devel+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/sage-devel/ed1a0c3a-ed0e-41c3-825a-153b738a7afcn%40googlegroups.com.

[sage-devel] Re: (cy)PARI in Parallel, Heisenbugs, and Merging Policy

Reply via email to