On Do, 28 Jul 2005, Kern Sibbald <[EMAIL PROTECTED]> wrote: > On Thursday 28 July 2005 16:17, Volker Sauer wrote: > > > I'll run the dirctor under the debugger and we'll see..
Hi Kern, it happend again ;-) Surprisingly this time the bconsole was not locked - I could still connect to bacula-dir where I saw this: [EMAIL PROTECTED]: ~ > bconsole Connecting to Director backup:9101 1000 OK: backup-dir Version: 1.36.3 (22 April 2005) Enter a period to cancel a command. *stat dir Using default Catalog name=MyCatalog DB=bacula backup-dir Version: 1.36.3 (22 April 2005) i386-pc-linux-gnu debian 3.1 Daemon started 28-Jul-05 16:25, 5 Jobs run since started. Scheduled Jobs: [ ... deleted ... ] Running Jobs: JobId Level Name Status ====================================================================== 1304 Full BackupCatalog.2005-07-29_05.00.00 is waiting execution 1303 Increme tokyo-sap.2005-07-28_23.00.12 is waiting on max Storage jobs 1302 Increme hanau-web.2005-07-28_23.00.11 is running 1301 Increme gelnhausen-export_tmp_vsauer.2005-07-28_23.00.10 is running 1299 Increme donar-home.2005-07-28_23.00.08 is running 1298 Increme caracas.2005-07-28_23.00.07 is running 1297 Increme paris-varmail.2005-07-28_23.00.06 is waiting on max Client jobs 1296 Increme paris-shared.2005-07-28_23.00.05 is waiting on max Client jobs 1295 Increme paris-home.prak.2005-07-28_23.00.04 is waiting on max Client jobs 1294 Increme paris-home.staff.3.2005-07-28_23.00.03 is waiting on max Client jobs 1293 Increme paris-home.staff.2.2005-07-28_23.00.02 is waiting on max Client jobs 1292 Increme paris-home.staff.1.2005-07-28_23.00.01 is running 1291 Increme paris-home.guest.2005-07-28_23.00.00 is running ==== Terminated Jobs: [ ... deleted ... ] Again, all jobs were locked. nothing was going on. Here the output of gdb: (gdb) run -s -f -c /etc/bacula/bacula-dir.conf The program being debugged has been started already. Start it from the beginning? (y or n) y Starting program: /usr/sbin/bacula-dir -s -f -c /etc/bacula/bacula-dir.conf [Thread debugging using libthread_db enabled] [New Thread 1078020896 (LWP 25378)] [New Thread 1086450608 (LWP 25380)] [New Thread 1094839216 (LWP 25381)] [New Thread 1103227824 (LWP 25383)] [Thread 1103227824 (LWP 25383) exited] [New Thread 1103227824 (LWP 25399)] backup-dir: dird.c:438 Director's configuration file reread. [Thread 1103227824 (LWP 25399) exited] [New Thread 1103227824 (LWP 26367)] [New Thread 1111620528 (LWP 26368)] [New Thread 1120074672 (LWP 26370)] [New Thread 1128463280 (LWP 26371)] [New Thread 1136860080 (LWP 26374)] [New Thread 1145248688 (LWP 26375)] [New Thread 1153637296 (LWP 26377)] [New Thread 1162025904 (LWP 26378)] [New Thread 1170422704 (LWP 26380)] [New Thread 1178819504 (LWP 26382)] [New Thread 1187216304 (LWP 26385)] [New Thread 1195613104 (LWP 26388)] [Thread 1187216304 (LWP 26385) exited] [New Thread 1187216304 (LWP 26494)] [New Thread 1204001712 (LWP 28543)] [Thread 1204001712 (LWP 28543) exited] [New Thread 1204001712 (LWP 29205)] [Thread 1204001712 (LWP 29205) exited] [New Thread 1204001712 (LWP 29206)] [Thread 1204001712 (LWP 29206) exited] [New Thread 1204001712 (LWP 29803)] Program received signal SIGINT, Interrupt. [Switching to Thread 1078020896 (LWP 25378)] 0x401a6dfc in __nanosleep_nocancel () from /lib/tls/libpthread.so.0 (gdb) thread apply all bt Thread 22 (Thread 1204001712 (LWP 29803)): #0 0x401a66a1 in __read_nocancel () from /lib/tls/libpthread.so.0 #1 0x08084d4c in read_nbytes (bsock=0x80f4e70, ptr=0x47c39a5c "wm\b\bp]\017\bX]\017\b\210\232ÃGúF\a\bpN\017\bÿÿÿÿi:[EMAIL PROTECTED] Y\f\bØ\232ÃGÛä\t\bpN\017\bZ\001", nbytes=4) at bnet.c:72 #2 0x08085067 in bnet_recv (bsock=0x80f4e70) at bnet.c:175 #3 0x080746fa in handle_UA_client_request (arg=0x80f5d70) at ua_server.c:133 #4 0x0809e4db in workq_server (arg=0x80c5920) at workq.c:347 #5 0x401a1b63 in start_thread () from /lib/tls/libpthread.so.0 #6 0x4037318a in clone () from /lib/tls/libc.so.6 Thread 18 (Thread 1187216304 (LWP 26494)): #0 0x401a66a1 in __read_nocancel () from /lib/tls/libpthread.so.0 #1 0x08084d4c in read_nbytes (bsock=0x80d30f8, ptr=0x46c3782c "6", nbytes=4) at bnet.c:72 #2 0x08085067 in bnet_recv (bsock=0x80d30f8) at bnet.c:175 #3 0x08055d88 in bget_dirmsg (bs=0x80d30f8) at getmsg.c:79 #4 0x0805e508 in msg_thread (arg=0x80d60a0) at msgchan.c:235 #5 0x401a1b63 in start_thread () from /lib/tls/libpthread.so.0 #6 0x4037318a in clone () from /lib/tls/libc.so.6 Thread 17 (Thread 1195613104 (LWP 26388)): #0 0x401a66a1 in __read_nocancel () from /lib/tls/libpthread.so.0 #1 0x08084d4c in read_nbytes (bsock=0x80f30c8, ptr=0x4743982c "I", nbytes=4) at bnet.c:72 #2 0x08085067 in bnet_recv (bsock=0x80f30c8) at bnet.c:175 #3 0x08055d88 in bget_dirmsg (bs=0x80f30c8) at getmsg.c:79 #4 0x0805e508 in msg_thread (arg=0x80d4c40) at msgchan.c:235 #5 0x401a1b63 in start_thread () from /lib/tls/libpthread.so.0 #6 0x4037318a in clone () from /lib/tls/libc.so.6 Thread 15 (Thread 1178819504 (LWP 26382)): #0 0x401a66a1 in __read_nocancel () from /lib/tls/libpthread.so.0 #1 0x08084d4c in read_nbytes (bsock=0x80d46f0, ptr=0x4643582c "7", nbytes=4) at bnet.c:72 #2 0x08085067 in bnet_recv (bsock=0x80d46f0) at bnet.c:175 #3 0x08055d88 in bget_dirmsg (bs=0x80d46f0) at getmsg.c:79 #4 0x0805e508 in msg_thread (arg=0x80d0a60) at msgchan.c:235 #5 0x401a1b63 in start_thread () from /lib/tls/libpthread.so.0 #6 0x4037318a in clone () from /lib/tls/libc.so.6 Thread 14 (Thread 1170422704 (LWP 26380)): #0 0x401a66a1 in __read_nocancel () from /lib/tls/libpthread.so.0 #1 0x08084d4c in read_nbytes (bsock=0x80d2240, ptr=0x45c3382c "4", nbytes=4) at bnet.c:72 #2 0x08085067 in bnet_recv (bsock=0x80d2240) at bnet.c:175 #3 0x08055d88 in bget_dirmsg (bs=0x80d2240) at getmsg.c:79 #4 0x0805e508 in msg_thread (arg=0x80cfcd0) at msgchan.c:235 #5 0x401a1b63 in start_thread () from /lib/tls/libpthread.so.0 #6 0x4037318a in clone () from /lib/tls/libc.so.6 Thread 13 (Thread 1162025904 (LWP 26378)): #0 0x401a66a1 in __read_nocancel () from /lib/tls/libpthread.so.0 #1 0x08084d4c in read_nbytes (bsock=0x80d5bc0, ptr=0x4543108c "[EMAIL PROTECTED]<@[EMAIL PROTECTED]", nbytes=4) at bnet.c:72 #2 0x08085067 in bnet_recv (bsock=0x80d5bc0) at bnet.c:175 #3 0x08055d88 in bget_dirmsg (bs=0x80d5bc0) at getmsg.c:79 #4 0x0804daf8 in wait_for_job_termination (jcr=0x80d0a60) at backup.c:243 #5 0x0804da23 in do_backup (jcr=0x80d0a60) at backup.c:207 #6 0x08058946 in job_thread (arg=0x80d0a60) at job.c:215 #7 0x0805c08a in jobq_server (arg=0x80c57a0) at jobq.c:444 #8 0x401a1b63 in start_thread () from /lib/tls/libpthread.so.0 #9 0x4037318a in clone () from /lib/tls/libc.so.6 Thread 12 (Thread 1153637296 (LWP 26377)): #0 0x401a66a1 in __read_nocancel () from /lib/tls/libpthread.so.0 #1 0x08084d4c in read_nbytes (bsock=0x80d38f0, ptr=0x44c3108c "[EMAIL PROTECTED]<@[EMAIL PROTECTED]", nbytes=4) at bnet.c:72 #2 0x08085067 in bnet_recv (bsock=0x80d38f0) at bnet.c:175 #3 0x08055d88 in bget_dirmsg (bs=0x80d38f0) at getmsg.c:79 #4 0x0804daf8 in wait_for_job_termination (jcr=0x80cfcd0) at backup.c:243 #5 0x0804da23 in do_backup (jcr=0x80cfcd0) at backup.c:207 #6 0x08058946 in job_thread (arg=0x80cfcd0) at job.c:215 #7 0x0805c08a in jobq_server (arg=0x80c57a0) at jobq.c:444 #8 0x401a1b63 in start_thread () from /lib/tls/libpthread.so.0 #9 0x4037318a in clone () from /lib/tls/libc.so.6 Thread 11 (Thread 1145248688 (LWP 26375)): #0 0x401a66a1 in __read_nocancel () from /lib/tls/libpthread.so.0 #1 0x08084d4c in read_nbytes (bsock=0x80d9bc0, ptr=0x4443108c "9Q\b\b `\r\bÀ\233\r\bX\022CD\210]\005\bÀ\233\r\b [EMAIL PROTECTED],w\r\b\200Î<@[EMAIL PROTECTED]", nbytes=4) at bnet.c:72 #2 0x08085067 in bnet_recv (bsock=0x80d9bc0) at bnet.c:175 #3 0x08055d88 in bget_dirmsg (bs=0x80d9bc0) at getmsg.c:79 #4 0x0804daf8 in wait_for_job_termination (jcr=0x80d60a0) at backup.c:243 #5 0x0804da23 in do_backup (jcr=0x80d60a0) at backup.c:207 #6 0x08058946 in job_thread (arg=0x80d60a0) at job.c:215 #7 0x0805c08a in jobq_server (arg=0x80c57a0) at jobq.c:444 #8 0x401a1b63 in start_thread () from /lib/tls/libpthread.so.0 #9 0x4037318a in clone () from /lib/tls/libc.so.6 Thread 10 (Thread 1136860080 (LWP 26374)): #0 0x401a66a1 in __read_nocancel () from /lib/tls/libpthread.so.0 #1 0x08084d4c in read_nbytes (bsock=0x80d8da8, ptr=0x43c3182c "?", nbytes=4) at bnet.c:72 #2 0x08085067 in bnet_recv (bsock=0x80d8da8) at bnet.c:175 #3 0x08055d88 in bget_dirmsg (bs=0x80d8da8) at getmsg.c:79 #4 0x0805e508 in msg_thread (arg=0x80c7230) at msgchan.c:235 #5 0x401a1b63 in start_thread () from /lib/tls/libpthread.so.0 #6 0x4037318a in clone () from /lib/tls/libc.so.6 Thread 9 (Thread 1128463280 (LWP 26371)): #0 0x401a66a1 in __read_nocancel () from /lib/tls/libpthread.so.0 #1 0x08084d4c in read_nbytes (bsock=0x80d9588, ptr=0x4342f08c "[EMAIL PROTECTED]<@[EMAIL PROTECTED]", nbytes=4) at bnet.c:72 #2 0x08085067 in bnet_recv (bsock=0x80d9588) at bnet.c:175 #3 0x08055d88 in bget_dirmsg (bs=0x80d9588) at getmsg.c:79 #4 0x0804daf8 in wait_for_job_termination (jcr=0x80c7230) at backup.c:243 #5 0x0804da23 in do_backup (jcr=0x80c7230) at backup.c:207 #6 0x08058946 in job_thread (arg=0x80c7230) at job.c:215 #7 0x0805c08a in jobq_server (arg=0x80c57a0) at jobq.c:444 #8 0x401a1b63 in start_thread () from /lib/tls/libpthread.so.0 #9 0x4037318a in clone () from /lib/tls/libc.so.6 Thread 8 (Thread 1120074672 (LWP 26370)): #0 0x401a66a1 in __read_nocancel () from /lib/tls/libpthread.so.0 #1 0x08084d4c in read_nbytes (bsock=0x80ceae8, ptr=0x42c2f82c "=", nbytes=4) at bnet.c:72 #2 0x08085067 in bnet_recv (bsock=0x80ceae8) at bnet.c:175 #3 0x08055d88 in bget_dirmsg (bs=0x80ceae8) at getmsg.c:79 #4 0x0805e508 in msg_thread (arg=0x80dcc48) at msgchan.c:235 #5 0x401a1b63 in start_thread () from /lib/tls/libpthread.so.0 #6 0x4037318a in clone () from /lib/tls/libc.so.6 Thread 7 (Thread 1111620528 (LWP 26368)): #0 0x401a66a1 in __read_nocancel () from /lib/tls/libpthread.so.0 #1 0x08084d4c in read_nbytes (bsock=0x80d8128, ptr=0x4241f08c "9Q\b\bHÌ\r\b(\201\r\bXòAB\210]\005\b([EMAIL PROTECTED]<@[EMAIL PROTECTED]", nbytes=4) at bnet.c:72 #2 0x08085067 in bnet_recv (bsock=0x80d8128) at bnet.c:175 #3 0x08055d88 in bget_dirmsg (bs=0x80d8128) at getmsg.c:79 #4 0x0804daf8 in wait_for_job_termination (jcr=0x80dcc48) at backup.c:243 #5 0x0804da23 in do_backup (jcr=0x80dcc48) at backup.c:207 #6 0x08058946 in job_thread (arg=0x80dcc48) at job.c:215 #7 0x0805c08a in jobq_server (arg=0x80c57a0) at jobq.c:444 #8 0x401a1b63 in start_thread () from /lib/tls/libpthread.so.0 #9 0x4037318a in clone () from /lib/tls/libc.so.6 Thread 6 (Thread 1103227824 (LWP 26367)): #0 0x401a66a1 in __read_nocancel () from /lib/tls/libpthread.so.0 #1 0x08084d4c in read_nbytes (bsock=0x80f37b0, ptr=0x41c1e08c "[EMAIL PROTECTED]@[EMAIL PROTECTED]<@[EMAIL PROTECTED]", nbytes=4) at bnet.c:72 #2 0x08085067 in bnet_recv (bsock=0x80f37b0) at bnet.c:175 #3 0x08055d88 in bget_dirmsg (bs=0x80f37b0) at getmsg.c:79 #4 0x0804daf8 in wait_for_job_termination (jcr=0x80d4c40) at backup.c:243 #5 0x0804da23 in do_backup (jcr=0x80d4c40) at backup.c:207 #6 0x08058946 in job_thread (arg=0x80d4c40) at job.c:215 #7 0x0805c08a in jobq_server (arg=0x80c57a0) at jobq.c:444 #8 0x401a1b63 in start_thread () from /lib/tls/libpthread.so.0 #9 0x4037318a in clone () from /lib/tls/libc.so.6 Thread 3 (Thread 1094839216 (LWP 25381)): #0 0x401a4440 in pthread_cond_timedwait@@GLIBC_2.3.2 () from /lib/tls/libpthread.so.0 #1 0x0809dbd8 in watchdog_thread (arg=0x0) at watchdog.c:289 #2 0x401a1b63 in start_thread () from /lib/tls/libpthread.so.0 #3 0x4037318a in clone () from /lib/tls/libc.so.6 Thread 2 (Thread 1086450608 (LWP 25380)): #0 0x4036ca27 in select () from /lib/tls/libc.so.6 #1 0x080877e0 in bnet_thread_server (addrs=0x40c1eb90, max_clients=-514, client_wq=0x80c5920, handle_client_request=0xfffffdfe) at bnet_server.c:154 #2 0x08074569 in connect_thread (arg=0xfffffdfe) at ua_server.c:79 #3 0x401a1b63 in start_thread () from /lib/tls/libpthread.so.0 #4 0x4037318a in clone () from /lib/tls/libc.so.6 Thread 1 (Thread 1078020896 (LWP 25378)): #0 0x401a6dfc in __nanosleep_nocancel () from /lib/tls/libpthread.so.0 #1 0x08083c64 in bmicrosleep (sec=60, usec=0) at bsys.c:59 #2 0x08061d8d in wait_for_next_job (one_shot_job_to_run=0x0) at scheduler.c:101 #3 0x0804b368 in main (argc=135079760, argv=0x80a0a58) at dird.c:244 I'm not sure, but it could have to do with the Concurrent Client Jobs = 2 in one of my client-configs. I'm restarting the director under the debugger with Concurrent Client Jobs = 1 and see what happens. Does this help? Regards Volker -- Volker Sauer * Alexanderstrasse 39/217 * 64283 Darmstadt Telefon: 06151-154260 * Mobil: 0179-6901475 * ICQ#98164307 mailto:[EMAIL PROTECTED] * http://www.volker-sauer.de PGPKey-Fingerprint: DB2611C7B12E0B2739992E4F7E354E4D5DD5D0E0
signature.asc
Description: Digital signature