On 3/21/19 4:40 PM, Reuti wrote:

Am 21.03.2019 um 16:26 schrieb Prentice Bisbal <pbis...@pppl.gov>:


On 3/20/19 1:58 PM, Christopher Samuel wrote:
On 3/20/19 4:20 AM, Frava wrote:

Hi Chris, thank you for the reply.
The team that manages that cluster is not very fond of upgrading SLURM, which I 
understand.
As a system admin who manages clusters myself, I don't understand this. Our job 
is to provide and maintain resources for our users. Part of that maintenance is 
to provide updates for security, performance, and functionality (new features) 
reasons. HPC has always been a leading-edge kind if field, so I feel this is 
even more important for HPC admins.

Yes, there can be issues caused by updates, but those can be with proper 
planning: Have a plan to do the actual upgrade, have a plan to test for issues, 
and have a plan to revert to an earlier version if issues are discovered. This 
is work, but it's really not all that much work, and this is exactly the work 
we are being paid to do as cluster admins.
Besides the work on the side of the admins, also the users are involved: 
exchanging libraries also means to run the test suites of their applications 
again.

-- Reuti

That implies the users actually wrote test suites. ;-)



Reply via email to