On Wed, 14 Aug 2024 19:34:26 GMT, Leonid Mesnik <lmes...@openjdk.org> wrote:

> The summary of the problem. Test is intermittently failing because can't get 
> expected field watch event.
> The test is failing to get event in the 'setfmodw001b' thread with Xcomp only.
> I verified that frame with the method 'run' of setfmodw001b is compiled when 
> line
> 118: setfmodw001.setWatch(4);
> is executed, however the thread is in interp_only mode. The watch events are 
> supported by interpreter only and just ignored by compiled code.
> 
> The reason of failure is race beteween setting interp_only mode in line
> 
> https://github.com/openjdk/jdk/blob/master/test/hotspot/jtreg/vmTestbase/nsk/jvmti/SetFieldModificationWatch/setfmodw001.java#L75
> 
>  and calling method call_helper while
>  the method run()
> https://github.com/openjdk/jdk/blob/master/test/hotspot/jtreg/vmTestbase/nsk/jvmti/SetFieldModificationWatch/setfmodw001.java#L116
> 
>  in newly created thread 'setfmodw001b' is invoked.
> 
> The javaCalls:call are used to invoke methods from hotspot, so it might be 
> rare issues. But still, synchronization might be improved.
> The
> void JavaCalls::call_helper(JavaValue* result, const methodHandle& method, 
> JavaCallArguments* args, TRAPS)
> 
> checks if interp_only mode is set and use 'Method::from_interpreted_entry()' 
> if not. However the interp_only might be set later before compiled method is 
> called (or enter first safe point?). This might happens in safepoint during 
> transition via handshake.
> So the running thread is in interp_only mode however the run() method is 
> compiled and executed already and never going to be deoptimized.
> 
> The additional setWatch calls don't try to deptimize method that are already 
> in interp_only mode.
> 
> BTW, the when JVMCI is enabled and verified adapter exists it is also will be 
> loaded even in interp_only mode set. There is not race here, just ignoring of 
> interp_only mode.
> 
> I run failing test with Xcomp ~1000 times and tiers1-5.

I was able to reproduce failure in "-Xcomp +ZGC'' once in a couple of several 
hundred runs. Not reproduced anymore with thousands of executions. 
Run tier1-5 to ensure all svc tests passed.

There is no easy way to develop regression test. It should provoke call_helper 
for compiled method with breakpoint and setting the only single setWatch. Not 
sure it can find anything in reasonable time.

The call_helper is used only to call methods from hotspot directly, like 
<clinit> , run() for Thread.start and similar not very common methods. 
I think to review any other possible cases and and thread stack verification 
that check that interp_only thread don't contain compiled frames on the stack. 
So we could find similar issues using our testing. Need to find the good place 
to inject this self-check. 
There are also a couple of places for improvements in this events handling. 
They are not directly rtelated to this bug. Will file them separately.

-------------

PR Comment: https://git.openjdk.org/jdk/pull/20587#issuecomment-2299317622

Reply via email to