Hi,

If either of you has updated any packages in Debian recently I would do so again. I had some serious issues with a recent syslog-ng package that was causing multiple sub-systems to fail (bind, sendmail, SSH ... etc.) . With bind not responding the Bacula director also stopped responding. This happend a few times to me last week before updating the syslog-ng package.

I might be off on this one but double check your syslog-ng package for a newer version and install if you need it.

Sean

John Donagher wrote:

I've also started having this problem within the past few weeks (since I
upgraded to 1.36.3, I believe). It doesn't happen every night, but it
happens more than once a week. Before that I have never seen this
problem, and I've been using Bacula for 6 months..

I'm running Debian Sarge and had no problems with /lib/tls - but I moved
it aside anyway to see if it helped a couple weeks ago, but it doesn't
seem to make a difference.

I don't have any debugging output unfortunately - but I will try and get
some next time it happens.


On Tue, 2005-05-24 at 13:25 +0200, Masopust Christian wrote:
Yesterday in the evening, just when starting some jobs my director again freezes...

here's the output of btraceback (my system is Fedora Core 3, Bacula is
1.36.3):

From [EMAIL PROTECTED] Mon May 23 22:01:32 2005 Return-Path: <[EMAIL PROTECTED]> Received: from atpcc7fc.sie.siemens.at (atpcc7fc.sie.siemens.at [127.0.0.1]) by atpcc7fc.sie.siemens.at (8.13.1/8.13.1) with SMTP id j4NK1VFi027151 for <[EMAIL PROTECTED]>; Mon, 23 May 2005 22:01:31 +0200 Message-Id: <[EMAIL PROTECTED]> From: [EMAIL PROTECTED] Subject: Bacula GDB traceback of bacula-dir Sender: [EMAIL PROTECTED] To: [EMAIL PROTECTED] Date: Mon, 23 May 2005 22:01:31 +0200 Status: R

Using host libthread_db library "/lib/libthread_db.so.1". [Thread debugging using libthread_db enabled] [New Thread 16384 (LWP 3346)] [New Thread 32769 (LWP 3351)] [Thread debugging using libthread_db enabled] [New Thread 16384 (LWP 3346)] [New Thread 32769 (LWP 3351)] [Thread debugging using libthread_db enabled] [New Thread 16384 (LWP 3346)] [New Thread 32769 (LWP 3351)] [New Thread 16386 (LWP 3352)] [New Thread 32771 (LWP 3353)] [New Thread 19726340 (LWP 26151)] [New Thread 19742725 (LWP 26152)] [New Thread 19759110 (LWP 26164)] [New Thread 19775495 (LWP 26172)] [New Thread 19791880 (LWP 26180)] [New Thread 19808265 (LWP 26203)] [New Thread 19824650 (LWP 26267)] [New Thread 19841035 (LWP 26294)] [New Thread 19857420 (LWP 26320)] [New Thread 19873805 (LWP 26381)] [New Thread 19890190 (LWP 26411)] [New Thread 19906575 (LWP 26434)] 0x004c80d4 in __pthread_sigsuspend () from /lib/i686/libpthread.so.0 $1 = "atpcc7fc-dir", '\0' <repeats 17 times> $2 = 0x80b5230 "bacula-dir" $3 = 0x80b5dd0 "/opt/bacula/sbin/" $4 = "MySQL" $5 = 0x80a321c "1.36.3 (22 April 2005)" $6 = 0x809bfb8 "i686-redhat-linux-gnu" $7 = 0x809bfb1 "redhat" $8 = 0x809bfa4 "(Heidelberg)" #0 0x004c80d4 in __pthread_sigsuspend () from /lib/i686/libpthread.so.0 #1 0x004c7708 in __pthread_wait_for_restart_signal () from /lib/i686/libpthread.so.0 #2 0x004c9720 in __pthread_alt_lock () from /lib/i686/libpthread.so.0 #3 0x004c614e in pthread_mutex_lock () from /lib/i686/libpthread.so.0 #4 0x08057dab in jobq_add (jq=0x80b4300, jcr=0x80fc570) at jobq.c:240 #5 0x080566d8 in run_job (jcr=0x80fc570) at job.c:140 #6 0x0804c034 in main (argc=0, argv=0x8090b55) at dird.c:241

Thread 16 (Thread 19906575 (LWP 26434)): #0 0x004c80d4 in __pthread_sigsuspend () from /lib/i686/libpthread.so.0 #1 0x004c7708 in __pthread_wait_for_restart_signal () from /lib/i686/libpthread.so.0 #2 0x004c3fab in [EMAIL PROTECTED] () from /lib/i686/libpthread.so.0 #3 0x08087a7a in rwl_writelock (rwl=0x80b46c0) at rwlock.c:231 #4 0x0807fd00 in lock_jcr_chain () at jcr.c:544 #5 0x08080491 in new_jcr (size=-1254097504, daemon_free_jcr=0xfffffffc) at jcr.c:218 #6 0x0806d182 in new_control_jcr (base_name=0x809bf69 "*Console*", job_type=-4) at ua_server.c:90 #7 0x0806d38b in handle_UA_client_request (arg=0x8117e88) at ua_server.c:122 #8 0x0808ee1b in workq_server (arg=0x80b4480) at workq.c:347 #9 0x004c4ce1 in pthread_start_thread () from /lib/i686/libpthread.so.0 #10 0x0043661a in clone () from /lib/i686/libc.so.6

Thread 15 (Thread 19890190 (LWP 26411)): #0 0x004c80d4 in __pthread_sigsuspend () from /lib/i686/libpthread.so.0 #1 0x004c7708 in __pthread_wait_for_restart_signal () from /lib/i686/libpthread.so.0 #2 0x004c3fab in [EMAIL PROTECTED] () from /lib/i686/libpthread.so.0 #3 0x08087a7a in rwl_writelock (rwl=0x80b46c0) at rwlock.c:231 #4 0x0807fd00 in lock_jcr_chain () at jcr.c:544 #5 0x08080491 in new_jcr (size=-1252000352, daemon_free_jcr=0xfffffffc) at jcr.c:218 #6 0x0806d182 in new_control_jcr (base_name=0x809bf69 "*Console*", job_type=-4) at ua_server.c:90 #7 0x0806d38b in handle_UA_client_request (arg=0x8116c68) at ua_server.c:122 #8 0x0808ee1b in workq_server (arg=0x80b4480) at workq.c:347 #9 0x004c4ce1 in pthread_start_thread () from /lib/i686/libpthread.so.0 #10 0x0043661a in clone () from /lib/i686/libc.so.6

Thread 14 (Thread 19873805 (LWP 26381)): #0 0x004c80d4 in __pthread_sigsuspend () from /lib/i686/libpthread.so.0 #1 0x004c7708 in __pthread_wait_for_restart_signal () from /lib/i686/libpthread.so.0 #2 0x004c3fab in [EMAIL PROTECTED] () from /lib/i686/libpthread.so.0 #3 0x08087a7a in rwl_writelock (rwl=0x80b46c0) at rwlock.c:231 #4 0x0807fd00 in lock_jcr_chain () at jcr.c:544 #5 0x08080491 in new_jcr (size=-1249903200, daemon_free_jcr=0xfffffffc) at jcr.c:218 #6 0x0806d182 in new_control_jcr (base_name=0x809bf69 "*Console*", job_type=-4) at ua_server.c:90 #7 0x0806d38b in handle_UA_client_request (arg=0x8115a48) at ua_server.c:122 #8 0x0808ee1b in workq_server (arg=0x80b4480) at workq.c:347 #9 0x004c4ce1 in pthread_start_thread () from /lib/i686/libpthread.so.0 #10 0x0043661a in clone () from /lib/i686/libc.so.6

Thread 13 (Thread 19857420 (LWP 26320)): #0 0x004c80d4 in __pthread_sigsuspend () from /lib/i686/libpthread.so.0 #1 0x004c7708 in __pthread_wait_for_restart_signal () from /lib/i686/libpthread.so.0 #2 0x004c3fab in [EMAIL PROTECTED] () from /lib/i686/libpthread.so.0 #3 0x08087a7a in rwl_writelock (rwl=0x80b46c0) at rwlock.c:231 #4 0x0807fd00 in lock_jcr_chain () at jcr.c:544 #5 0x08080491 in new_jcr (size=-1247806048, daemon_free_jcr=0xfffffffc) at jcr.c:218 #6 0x0806d182 in new_control_jcr (base_name=0x809bf69 "*Console*", job_type=-4) at ua_server.c:90 #7 0x0806d38b in handle_UA_client_request (arg=0x80fdf48) at ua_server.c:122 #8 0x0808ee1b in workq_server (arg=0x80b4480) at workq.c:347 #9 0x004c4ce1 in pthread_start_thread () from /lib/i686/libpthread.so.0 #10 0x0043661a in clone () from /lib/i686/libc.so.6

Thread 12 (Thread 19841035 (LWP 26294)): #0 0x004c80d4 in __pthread_sigsuspend () from /lib/i686/libpthread.so.0 #1 0x004c7708 in __pthread_wait_for_restart_signal () from /lib/i686/libpthread.so.0 #2 0x004c3fab in [EMAIL PROTECTED] () from /lib/i686/libpthread.so.0 #3 0x08087a7a in rwl_writelock (rwl=0x80b46c0) at rwlock.c:231 #4 0x0807fd00 in lock_jcr_chain () at jcr.c:544 #5 0x08080491 in new_jcr (size=-1245708896, daemon_free_jcr=0xfffffffc) at jcr.c:218 #6 0x0806d182 in new_control_jcr (base_name=0x809bf69 "*Console*", job_type=-4) at ua_server.c:90 #7 0x0806d38b in handle_UA_client_request (arg=0x80fde78) at ua_server.c:122 #8 0x0808ee1b in workq_server (arg=0x80b4480) at workq.c:347 #9 0x004c4ce1 in pthread_start_thread () from /lib/i686/libpthread.so.0 #10 0x0043661a in clone () from /lib/i686/libc.so.6

Thread 11 (Thread 19824650 (LWP 26267)): #0 0x004c80d4 in __pthread_sigsuspend () from /lib/i686/libpthread.so.0 #1 0x004c7708 in __pthread_wait_for_restart_signal () from /lib/i686/libpthread.so.0 #2 0x004c3fab in [EMAIL PROTECTED] () from /lib/i686/libpthread.so.0 #3 0x08087a7a in rwl_writelock (rwl=0x80b46c0) at rwlock.c:231 #4 0x0807fd00 in lock_jcr_chain () at jcr.c:544 #5 0x08080491 in new_jcr (size=-1243611744, daemon_free_jcr=0xfffffffc) at jcr.c:218 #6 0x0806d182 in new_control_jcr (base_name=0x809bf69 "*Console*", job_type=-4) at ua_server.c:90 #7 0x0806d38b in handle_UA_client_request (arg=0x80fddf8) at ua_server.c:122 #8 0x0808ee1b in workq_server (arg=0x80b4480) at workq.c:347 #9 0x004c4ce1 in pthread_start_thread () from /lib/i686/libpthread.so.0 #10 0x0043661a in clone () from /lib/i686/libc.so.6

Thread 10 (Thread 19808265 (LWP 26203)): #0 0x004c80d4 in __pthread_sigsuspend () from /lib/i686/libpthread.so.0 #1 0x004c7708 in __pthread_wait_for_restart_signal () from /lib/i686/libpthread.so.0 #2 0x004c3fab in [EMAIL PROTECTED] () from /lib/i686/libpthread.so.0 #3 0x08087a7a in rwl_writelock (rwl=0x80b46c0) at rwlock.c:231 #4 0x0807fd00 in lock_jcr_chain () at jcr.c:544 #5 0x08080491 in new_jcr (size=-1241514592, daemon_free_jcr=0xfffffffc) at jcr.c:218 #6 0x0806d182 in new_control_jcr (base_name=0x809bf69 "*Console*", job_type=-4) at ua_server.c:90 #7 0x0806d38b in handle_UA_client_request (arg=0x80fdd78) at ua_server.c:122 #8 0x0808ee1b in workq_server (arg=0x80b4480) at workq.c:347 #9 0x004c4ce1 in pthread_start_thread () from /lib/i686/libpthread.so.0 #10 0x0043661a in clone () from /lib/i686/libc.so.6

Thread 9 (Thread 19791880 (LWP 26180)): #0 0x004c80d4 in __pthread_sigsuspend () from /lib/i686/libpthread.so.0 #1 0x004c7708 in __pthread_wait_for_restart_signal () from /lib/i686/libpthread.so.0 #2 0x004c3fab in [EMAIL PROTECTED] () from /lib/i686/libpthread.so.0 #3 0x08087a7a in rwl_writelock (rwl=0x80b46c0) at rwlock.c:231 #4 0x0807fd00 in lock_jcr_chain () at jcr.c:544 #5 0x08080491 in new_jcr (size=-1239417440, daemon_free_jcr=0xfffffffc) at jcr.c:218 #6 0x0806d182 in new_control_jcr (base_name=0x809bf69 "*Console*", job_type=-4) at ua_server.c:90 #7 0x0806d38b in handle_UA_client_request (arg=0x80fdcc8) at ua_server.c:122 #8 0x0808ee1b in workq_server (arg=0x80b4480) at workq.c:347 #9 0x004c4ce1 in pthread_start_thread () from /lib/i686/libpthread.so.0 #10 0x0043661a in clone () from /lib/i686/libc.so.6

Thread 8 (Thread 19775495 (LWP 26172)): #0 0x004c80d4 in __pthread_sigsuspend () from /lib/i686/libpthread.so.0 #1 0x004c7708 in __pthread_wait_for_restart_signal () from /lib/i686/libpthread.so.0 #2 0x004c3fab in [EMAIL PROTECTED] () from /lib/i686/libpthread.so.0 #3 0x08087a7a in rwl_writelock (rwl=0x80b46c0) at rwlock.c:231 #4 0x0807fd00 in lock_jcr_chain () at jcr.c:544 #5 0x08080491 in new_jcr (size=-1237320288, daemon_free_jcr=0xfffffffc) at jcr.c:218 #6 0x0806d182 in new_control_jcr (base_name=0x809bf69 "*Console*", job_type=-4) at ua_server.c:90 #7 0x0806d38b in handle_UA_client_request (arg=0x810b308) at ua_server.c:122 #8 0x0808ee1b in workq_server (arg=0x80b4480) at workq.c:347 #9 0x004c4ce1 in pthread_start_thread () from /lib/i686/libpthread.so.0 #10 0x0043661a in clone () from /lib/i686/libc.so.6

Thread 7 (Thread 19759110 (LWP 26164)): #0 0x004c80d4 in __pthread_sigsuspend () from /lib/i686/libpthread.so.0 #1 0x004c7708 in __pthread_wait_for_restart_signal () from /lib/i686/libpthread.so.0 #2 0x004c3fab in [EMAIL PROTECTED] () from /lib/i686/libpthread.so.0 #3 0x08087a7a in rwl_writelock (rwl=0x80b46c0) at rwlock.c:231 #4 0x0807fd00 in lock_jcr_chain () at jcr.c:544 #5 0x08080491 in new_jcr (size=-1235223136, daemon_free_jcr=0xfffffffc) at jcr.c:218 #6 0x0806d182 in new_control_jcr (base_name=0x809bf69 "*Console*", job_type=-4) at ua_server.c:90 #7 0x0806d38b in handle_UA_client_request (arg=0x810b288) at ua_server.c:122 #8 0x0808ee1b in workq_server (arg=0x80b4480) at workq.c:347 #9 0x004c4ce1 in pthread_start_thread () from /lib/i686/libpthread.so.0 #10 0x0043661a in clone () from /lib/i686/libc.so.6

Thread 6 (Thread 19742725 (LWP 26152)): #0 0x004c80d4 in __pthread_sigsuspend () from /lib/i686/libpthread.so.0 #1 0x004c7708 in __pthread_wait_for_restart_signal () from /lib/i686/libpthread.so.0 #2 0x004c3fab in [EMAIL PROTECTED] () from /lib/i686/libpthread.so.0 #3 0x08087a7a in rwl_writelock (rwl=0x80b4ee0) at rwlock.c:231 #4 0x0808e304 in wd_lock () at watchdog.c:305 #5 0x0808e5e4 in unregister_watchdog (wd=0x80da6b0) at watchdog.c:200 #6 0x0808f33d in stop_btimer (wid=0x80e7470) at btimers.c:246 #7 0x0804c63b in authenticate_storage_daemon (jcr=0x80ef9f0, store=0x80b85d8) at authenticate.c:103 #8 0x08059dc5 in connect_to_storage_daemon (jcr=0x80ef9f0, retry_interval=10, max_retry_time=1800, verbose=1) at msgchan.c:89 #9 0x0804da74 in do_backup (jcr=0x80ef9f0) at backup.c:145 #10 0x08056204 in job_thread (arg=0x80ef9f0) at job.c:215 #11 0x080583bd in jobq_server (arg=0x80b4300) at jobq.c:444 #12 0x004c4ce1 in pthread_start_thread () from /lib/i686/libpthread.so.0 #13 0x0043661a in clone () from /lib/i686/libc.so.6

Thread 5 (Thread 19726340 (LWP 26151)): #0 0x004c80d4 in __pthread_sigsuspend () from /lib/i686/libpthread.so.0 #1 0x004c7708 in __pthread_wait_for_restart_signal () from /lib/i686/libpthread.so.0 #2 0x004c3fab in [EMAIL PROTECTED] () from /lib/i686/libpthread.so.0 #3 0x08087a7a in rwl_writelock (rwl=0x80b46c0) at rwlock.c:231 #4 0x0807fd00 in lock_jcr_chain () at jcr.c:544 #5 0x080588e1 in jobq_server (arg=0x80b4300) at jobq.c:582 #6 0x004c4ce1 in pthread_start_thread () from /lib/i686/libpthread.so.0 #7 0x0043661a in clone () from /lib/i686/libc.so.6

Thread 4 (Thread 32771 (LWP 3353)): #0 0x004c80d4 in __pthread_sigsuspend () from /lib/i686/libpthread.so.0 #1 0x004c7708 in __pthread_wait_for_restart_signal () from /lib/i686/libpthread.so.0 #2 0x004c9720 in __pthread_alt_lock () from /lib/i686/libpthread.so.0 #3 0x004c614e in pthread_mutex_lock () from /lib/i686/libpthread.so.0 #4 0x080804f6 in get_next_jcr (prev_jcr=0xfffffffc) at jcr.c:581 #5 0x08080619 in jcr_timeout_check (self=0x80c3360) at jcr.c:615 #6 0x0808e533 in watchdog_thread (arg=0x0) at watchdog.c:257 #7 0x004c4ce1 in pthread_start_thread () from /lib/i686/libpthread.so.0 #8 0x0043661a in clone () from /lib/i686/libc.so.6

Thread 3 (Thread 16386 (LWP 3352)): #0 0x0042f251 in select () from /lib/i686/libc.so.6 #1 0x00000006 in ?? () #2 0x080cf47c in ?? () #3 0xb7f572f0 in ?? () #4 0x00000000 in ?? ()

Thread 2 (Thread 32769 (LWP 3351)): #0 0x0042cf7a in poll () from /lib/i686/libc.so.6 #1 0x004c54c0 in __pthread_manager () from /lib/i686/libpthread.so.0 #2 0x0043661a in clone () from /lib/i686/libc.so.6

Thread 1 (Thread 16384 (LWP 3346)): #0 0x004c80d4 in __pthread_sigsuspend () from /lib/i686/libpthread.so.0 #1 0x004c7708 in __pthread_wait_for_restart_signal () from /lib/i686/libpthread.so.0 #2 0x004c9720 in __pthread_alt_lock () from /lib/i686/libpthread.so.0 #3 0x004c614e in pthread_mutex_lock () from /lib/i686/libpthread.so.0 #4 0x08057dab in jobq_add (jq=0x80b4300, jcr=0x80fc570) at jobq.c:240 #5 0x080566d8 in run_job (jcr=0x80fc570) at job.c:140 #6 0x0804c034 in main (argc=0, argv=0x8090b55) at dird.c:241 #0 0x004c80d4 in __pthread_sigsuspend () from /lib/i686/libpthread.so.0 No symbol table info available. #1 0x004c7708 in __pthread_wait_for_restart_signal () from /lib/i686/libpthread.so.0 No symbol table info available. #2 0x004c9720 in __pthread_alt_lock () from /lib/i686/libpthread.so.0 No symbol table info available. #3 0x004c614e in pthread_mutex_lock () from /lib/i686/libpthread.so.0 No symbol table info available. #4 0x08057dab in jobq_add (jq=0x80b4300, jcr=0x80fc570) at jobq.c:240 240 if ((stat = pthread_mutex_lock(&jq->mutex)) != 0) { Current language: auto; currently c++ stat = 135251312 sched_pkt = (wait_pkt *) 0xfffffffc item = (jobq_item_t *) 0x80fc570 li = (jobq_item_t *) 0x7f wtime = -1 id = 135251948 #5 0x080566d8 in run_job (jcr=0x80fc570) at job.c:140 140 if ((stat = jobq_add(&job_queue, jcr)) != 0) { be = {<SMARTALLOC> = {<No data fields>}, buf_ = 0x80fb448 "ðù\016\bpÅ \017\b\001", berrno_ = 1} stat = 134822556 errstat = 134822556 JobId = 346 #6 0x0804c034 in main (argc=0, argv=0x8090b55) at dird.c:241 241 run_job(jcr); /* run job */ jcr = (JCR *) 0x80fc570 test_config = 0 ch = 135251312 no_signals = 0 uid = 0x0 gid = 0x0 #0 0x00000000 in ?? () No symbol table info available.


any idea??

what's your practice? do you restart bacula every day? should i?

Thanks a lot, Christian





-------------------------------------------------------
This SF.Net email is sponsored by Yahoo.
Introducing Yahoo! Search Developer Network - Create apps using Yahoo!
Search APIs Find out how you can build Yahoo! directly into your own
Applications - visit http://developer.yahoo.net/?fr=fad-ysdn-ostg-q22005
_______________________________________________
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users



-------------------------------------------------------
This SF.Net email is sponsored by Yahoo.
Introducing Yahoo! Search Developer Network - Create apps using Yahoo!
Search APIs Find out how you can build Yahoo! directly into your own
Applications - visit http://developer.yahoo.net/?fr=offad-ysdn-ostg-q22005
_______________________________________________
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users

Reply via email to