Hi all, we're using OpenHPC packages to run SLURM. Current OpenHPC Version is 1.3.8 (SLURM 18.08.8), though we're still at 1.3.3 (SLURM 17.02.7), for now.
I've successfully attempted an upgrade in a separate testing environment, which works fine once you adhere to the upgrading notes... So the upgrade itself is not the issue here. However, I do see that the SLURM Job ID gets reset to 1, instead of continuing as sequential number, whereas the job_db_inx is incremented as before. This is visible for example when looking at the job queue. From a database perspective this looks like this: MariaDB [slurm_acct_db]> select job_db_inx,id_job,pack_job_id,job_name from clustername_job_table limit 96070,96100; +------------+--------+-------------+--------------+ | job_db_inx | id_job | pack_job_id | job_name | +------------+--------+-------------+--------------+ | 107116 | 96155 | 0 | bt | | 107118 | 96156 | 0 | bt | | 107119 | 96157 | 0 | bt | | 107120 | 96158 | 0 | cs_01 | | 107121 | 96159 | 0 | cs_01 | | 107123 | 96160 | 0 | cs_01 | | 107124 | 96161 | 0 | cs_01 | | 107125 | 96162 | 0 | cs_01 | | 107126 | 96163 | 0 | cs_01 | | 107127 | 96164 | 0 | cs_01 | <--- Last Job old version | 107128 | 2 | 0 | hostname | <--- Jobs after upgrade | 107130 | 3 | 0 | hostname | | 107131 | 4 | 0 | hostname | | 107133 | 5 | 0 | hostname | | 107135 | 6 | 0 | hostname | | 107137 | 7 | 0 | hostname | | 107138 | 8 | 0 | hostname | | 107140 | 9 | 0 | hostname | | 107142 | 10 | 0 | hostname | | 107144 | 11 | 0 | test | | 107145 | 12 | 0 | test | | 107146 | 13 | 0 | test | | 107147 | 14 | 0 | test | | 107148 | 15 | 0 | test | | 107149 | 16 | 0 | testzilloooo | | 107150 | 17 | 0 | testzilloooo | | 107151 | 18 | 0 | testzilloooo | | 107152 | 19 | 0 | testzilloooo | | 107153 | 20 | 0 | testzilloooo | | 107154 | 21 | 0 | testzilloooo | +------------+--------+-------------+--------------+ 30 rows in set (0.134 sec) Question: is there a way to a) either let SLURM continue the job IDs as usual, or b) set any arbitrary number? If this is a known thing I failed to find it. Thx! Florian