> Also, I am very curious about this new runner.. mind sharing? It's a runner to run on HPC environments orchestrated by Schrodinger Job Server.
On Fri, Oct 4, 2024 at 1:57 PM Priyans Desai via dev <dev@beam.apache.org> wrote: > You need to make sure you have some kind of progress check of the SDK that > tells you it is alive and doing work as needed. I had do that via the > control channel. > > Also, I am very curious about this new runner.. mind sharing? > > On Fri, Oct 4, 2024 at 1:46 PM Joey Tran <joey.t...@schrodinger.com> > wrote: > >> Hey all, >> >> I have a python runner I've written and I'm debugging the case when a SDK >> worker crashes. Currently my driver/ runner starts the SDK worker and then >> pushes data / instructions through the data and control connections, but >> this just silently waits forever if the worker has actually crashed for >> whatever reason. >> >> Is there a way to poll SDK workers to see if they're even alive? >> >> Cheers, >> Joey >> >