I had a similar issue in v9.x. I added "cloud storage=cloud-sd allpools allfrompool upload"
as a run after Console command in the cloud job resource. This seems to do the trick as I've not seen any orphaned volumes in the cache since. Chris On Wed, 6 Mar 2024, 18:53 K. M. Peterson, <kmp.lists+bacula-us...@gmail.com> wrote: > Hi all, > > I've been using the S3 driver for some time now, and working around > various idiosyncrasies - the most lasting one being where occasionally due > to something being shut down there are backup parts in the cache that > haven't been uploaded for some reason. > > I started using a job that ran at boot to run cloud upload > storage=$STORAGE pool=$POOL allfrompool after an interval just as a > catchall (POOL and STORAGE are set to the Cloud pool and Storage > respectively. A couple of months ago (approximately) I realized that the > Bacula Director was inoperative - hung - and I tracked this down to the > Storage Director hanging when this command is executed. Restarting the SD > clears the problem (though it also kills the upload). Otherwise, the > Cloud-based storage continues to work fine. > > I don't see any error messages anywhere, and activity in the Director and > File Daemon seem to return to normal when the Storage Daemon is > killed/restarted. I did run the SD with debug level 100, but I suspect > it's the absence of something I'm looking for. Here's the debug output > (somewhat anonymized): > > Mar 05 16:10:46 four bacula-sd[2263159]: List plugins. Hook count=0 > Mar 05 16:10:46 four bacula-sd[2263159]: four-sd: bsock.c:861-0 socket=4 > who=client host=[2001:0DB8::4] port=9103 > Mar 05 16:10:46 four bacula-sd[2263159]: four-sd: bnet_server.c:235-0 > Accept socket=2001:0DB8::4.9103:2001:0DB8::4.37168 s=0x5636929cedf8 > Mar 05 16:10:46 four bacula-sd[2263159]: four-sd: dircmd.c:196-0 Got a DIR > connection at 05-Mar-2024 16:10:46 > Mar 05 16:10:46 four bacula-sd[2263159]: four-sd: > authenticatebase.cc:365-0 TLSPSK Remote need 100 > Mar 05 16:10:46 four bacula-sd[2263159]: four-sd: > authenticatebase.cc:335-0 TLSPSK Local need 100 > Mar 05 16:10:46 four bacula-sd[2263159]: four-sd: > authenticatebase.cc:563-0 TLSPSK Start PSK > Mar 05 16:10:46 four bacula-sd[2263159]: four-sd: bnet.c:96-0 TLS server > negotiation established. > Mar 05 16:10:46 four bacula-sd[2263159]: four-sd: cram-md5.c:68-0 send: > auth cram-md5 challenge <104636656.1709673046@four-sd> ssl=0 > Mar 05 16:10:46 four bacula-sd[2263159]: four-sd: cram-md5.c:132-0 > cram-get received: auth cram-md5 <111733638.1709673046@four-dir> ssl=0 > Mar 05 16:10:46 four bacula-sd[2263159]: four-sd: cram-md5.c:156-0 sending > resp to challenge: CR+Psy+qh9NHh1+BA++VSA > Mar 05 16:10:46 four bacula-sd[2263159]: four-sd: dircmd.c:227-0 Message > channel init completed. > Mar 05 16:10:46 four bacula-sd[2263159]: four-sd: status.c:1153-0 > cmd=devices > Mar 05 16:10:46 four bacula-sd[2263159]: four-sd: bsock.c:861-0 socket=4 > who=client host=[2001:0DB8::4] port=9103 > Mar 05 16:10:46 four bacula-sd[2263159]: four-sd: bnet_server.c:235-0 > Accept socket=2001:0DB8::4.9103:2001:0DB8::4.37182 s=0x5636929f14b8 > Mar 05 16:10:46 four bacula-sd[2263159]: four-sd: dircmd.c:196-0 Got a DIR > connection at 05-Mar-2024 16:10:46 > Mar 05 16:10:46 four bacula-sd[2263159]: four-sd: > authenticatebase.cc:365-0 TLSPSK Remote need 100 > Mar 05 16:10:46 four bacula-sd[2263159]: four-sd: > authenticatebase.cc:335-0 TLSPSK Local need 100 > Mar 05 16:10:46 four bacula-sd[2263159]: four-sd: > authenticatebase.cc:563-0 TLSPSK Start PSK > Mar 05 16:10:46 four bacula-sd[2263159]: four-sd: bnet.c:96-0 TLS server > negotiation established. > Mar 05 16:10:46 four bacula-sd[2263159]: four-sd: cram-md5.c:68-0 send: > auth cram-md5 challenge <934142435.1709673046@four-sd> ssl=0 > Mar 05 16:10:46 four bacula-sd[2263159]: four-sd: cram-md5.c:132-0 > cram-get received: auth cram-md5 <2101906992.1709673046@four-dir> ssl=0 > Mar 05 16:10:46 four bacula-sd[2263159]: four-sd: cram-md5.c:156-0 sending > resp to challenge: b6/yPWVPOzU/cm/0C9/+/B > Mar 05 16:10:46 four bacula-sd[2263159]: four-sd: dircmd.c:227-0 Message > channel init completed. > Mar 05 16:10:47 four bacula-sd[2263159]: List plugins. Hook count=0 > Mar 05 16:10:47 four bacula-sd[2263159]: four-sd: bsock.c:861-0 socket=4 > who=client host=[2001:0DB8::4] port=9103 > Mar 05 16:10:47 four bacula-sd[2263159]: four-sd: bnet_server.c:235-0 > Accept socket=2001:0DB8::4.9103:2001:0DB8::4.37186 s=0x563692a08dd8 > Mar 05 16:10:47 four bacula-sd[2263159]: four-sd: dircmd.c:196-0 Got a DIR > connection at 05-Mar-2024 16:10:47 > Mar 05 16:10:47 four bacula-sd[2263159]: four-sd: > authenticatebase.cc:365-0 TLSPSK Remote need 100 > Mar 05 16:10:47 four bacula-sd[2263159]: four-sd: > authenticatebase.cc:335-0 TLSPSK Local need 100 > Mar 05 16:10:47 four bacula-sd[2263159]: four-sd: > authenticatebase.cc:563-0 TLSPSK Start PSK > Mar 05 16:10:47 four bacula-sd[2263159]: four-sd: bnet.c:96-0 TLS server > negotiation established. > Mar 05 16:10:47 four bacula-sd[2263159]: four-sd: cram-md5.c:68-0 send: > auth cram-md5 challenge <1669111798.1709673047@four-sd> ssl=0 > Mar 05 16:10:47 four bacula-sd[2263159]: four-sd: cram-md5.c:132-0 > cram-get received: auth cram-md5 <1290027090.1709673047@four-dir> ssl=0 > Mar 05 16:10:47 four bacula-sd[2263159]: four-sd: cram-md5.c:156-0 sending > resp to challenge: F/oa1Ssdqw1iEwcd6j/CKA > Mar 05 16:10:47 four bacula-sd[2263159]: four-sd: dircmd.c:227-0 Message > channel init completed. > Mar 05 16:10:47 four bacula-sd[2263159]: four-sd: dircmd.c:1210-0 Found > device AWS_S3_Cloud1 > Mar 05 16:10:47 four bacula-sd[2263159]: four-sd: dircmd.c:1254-0 Found > device AWS_S3_Cloud1 > Mar 05 16:10:47 four bacula-sd[2263159]: four-sd: acquire.c:671-0 Attach > 0x9401bbf8 to dev "AWS_S3_Cloud1" (/data/Backup/bstor_aws_cache) > Mar 05 16:10:48 four bacula-sd[2263159]: four-sd: bsock.c:861-0 socket=6 > who=client host=[2001:0DB8::4] port=9103 > Mar 05 16:10:48 four bacula-sd[2263159]: four-sd: bnet_server.c:235-0 > Accept socket=2001:0DB8::4.9103:2001:0DB8::4.37198 s=0x5636929cedf8 > Mar 05 16:10:48 four bacula-sd[2263159]: four-sd: dircmd.c:196-0 Got a DIR > connection at 05-Mar-2024 16:10:48 > Mar 05 16:10:48 four bacula-sd[2263159]: four-sd: > authenticatebase.cc:365-0 TLSPSK Remote need 100 > Mar 05 16:10:48 four bacula-sd[2263159]: four-sd: > authenticatebase.cc:335-0 TLSPSK Local need 100 > Mar 05 16:10:48 four bacula-sd[2263159]: four-sd: > authenticatebase.cc:563-0 TLSPSK Start PSK > Mar 05 16:10:48 four bacula-sd[2263159]: four-sd: bnet.c:96-0 TLS server > negotiation established. > Mar 05 16:10:48 four bacula-sd[2263159]: four-sd: cram-md5.c:68-0 send: > auth cram-md5 challenge <76485448.1709673048@four-sd> ssl=0 > Mar 05 16:10:48 four bacula-sd[2263159]: four-sd: cram-md5.c:132-0 > cram-get received: auth cram-md5 <76485448.1709673048@four-dir> ssl=0 > Mar 05 16:10:48 four bacula-sd[2263159]: four-sd: cram-md5.c:156-0 sending > resp to challenge: HC+BF8+G9mVUIlAV99+5LC > Mar 05 16:10:48 four bacula-sd[2263159]: four-sd: dircmd.c:227-0 Message > channel init completed. > Mar 05 16:14:40 four bacula-sd[2263159]: four-sd: bsock.c:861-0 socket=7 > who=client host=[2001:0DB8::4] port=9103 > Mar 05 16:14:40 four bacula-sd[2263159]: four-sd: bnet_server.c:235-0 > Accept socket=2001:0DB8::4.9103:2001:0DB8::4.55784 s=0x5636929f14b8 > Mar 05 16:14:40 four bacula-sd[2263159]: four-sd: dircmd.c:196-0 Got a DIR > connection at 05-Mar-2024 16:14:40 > Mar 05 16:14:40 four bacula-sd[2263159]: four-sd: > authenticatebase.cc:365-0 TLSPSK Remote need 100 > Mar 05 16:14:40 four bacula-sd[2263159]: four-sd: > authenticatebase.cc:335-0 TLSPSK Local need 100 > Mar 05 16:14:40 four bacula-sd[2263159]: four-sd: > authenticatebase.cc:563-0 TLSPSK Start PSK > Mar 05 16:14:40 four bacula-sd[2263159]: four-sd: bnet.c:96-0 TLS server > negotiation established. > Mar 05 16:14:40 four bacula-sd[2263159]: four-sd: cram-md5.c:68-0 send: > auth cram-md5 challenge <1104246352.1709673280@four-sd> ssl=0 > Mar 05 16:14:40 four bacula-sd[2263159]: four-sd: cram-md5.c:132-0 > cram-get received: auth cram-md5 <78526969.1709673280@four-dir> ssl=0 > Mar 05 16:14:40 four bacula-sd[2263159]: four-sd: cram-md5.c:156-0 sending > resp to challenge: 49UP5RldJ5ZVEg/gS++skD > Mar 05 16:14:40 four bacula-sd[2263159]: four-sd: dircmd.c:227-0 Message > channel init completed. > Mar 05 16:32:16 four systemd[1]: bacula-sd.service: Main process exited, > code=killed, status=9/KILL > Mar 05 16:32:16 four systemd[1]: bacula-sd.service: Failed with result > 'signal'. > Mar 05 16:32:51 four systemd[1]: Started Bacula Storage Daemon service. > > In the case above, I killed the daemon after 20+ minutes, but it went for > days a couple of times before I realized that it was toasted. No output > from the upload command. The SD does connect to AWS, but times out. Also, > the bconsole process can't be killed by ^C. > > I'm not sure this is going to help anyone with debugging - I'm posting to > ask whether anyone has any ideas on either what might be going wrong here, > or where I could develop more information before trying to file a bug. > > BTW, this is RHEL 8, AMD, and Bacula 13.0.2, 18Feb23. Thanks for any > ideas! > _______________________________________________ > Bacula-users mailing list > Bacula-users@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/bacula-users >
_______________________________________________ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users