I've also started having this problem within the past few weeks (since I
upgraded to 1.36.3, I believe). It doesn't happen every night, but it
happens more than once a week. Before that I have never seen this
problem, and I've been using Bacula for 6 months..

I'm running Debian Sarge and had no problems with /lib/tls - but I moved
it aside anyway to see if it helped a couple weeks ago, but it doesn't
seem to make a difference.

I don't have any debugging output unfortunately - but I will try and get
some next time it happens.


On Tue, 2005-05-24 at 13:25 +0200, Masopust Christian wrote:
> 
> Yesterday in the evening, just when starting some jobs my director 
> again freezes...
> 
> here's the output of btraceback (my system is Fedora Core 3, Bacula is
> 1.36.3):
> 
> From [EMAIL PROTECTED]  Mon May 23 22:01:32 2005 
> Return-Path: <[EMAIL PROTECTED]> 
> Received: from atpcc7fc.sie.siemens.at (atpcc7fc.sie.siemens.at
> [127.0.0.1]) 
>         by atpcc7fc.sie.siemens.at (8.13.1/8.13.1) with SMTP id
> j4NK1VFi027151 
>         for <[EMAIL PROTECTED]>; Mon, 23 May 2005 22:01:31 +0200 
> Message-Id: <[EMAIL PROTECTED]> 
> From: [EMAIL PROTECTED] 
> Subject: Bacula GDB traceback of bacula-dir 
> Sender: [EMAIL PROTECTED] 
> To: [EMAIL PROTECTED] 
> Date: Mon, 23 May 2005 22:01:31 +0200 
> Status: R
> 
> Using host libthread_db library "/lib/libthread_db.so.1". 
> [Thread debugging using libthread_db enabled] 
> [New Thread 16384 (LWP 3346)] 
> [New Thread 32769 (LWP 3351)] 
> [Thread debugging using libthread_db enabled] 
> [New Thread 16384 (LWP 3346)] 
> [New Thread 32769 (LWP 3351)] 
> [Thread debugging using libthread_db enabled] 
> [New Thread 16384 (LWP 3346)] 
> [New Thread 32769 (LWP 3351)] 
> [New Thread 16386 (LWP 3352)] 
> [New Thread 32771 (LWP 3353)] 
> [New Thread 19726340 (LWP 26151)] 
> [New Thread 19742725 (LWP 26152)] 
> [New Thread 19759110 (LWP 26164)] 
> [New Thread 19775495 (LWP 26172)] 
> [New Thread 19791880 (LWP 26180)] 
> [New Thread 19808265 (LWP 26203)] 
> [New Thread 19824650 (LWP 26267)] 
> [New Thread 19841035 (LWP 26294)] 
> [New Thread 19857420 (LWP 26320)] 
> [New Thread 19873805 (LWP 26381)] 
> [New Thread 19890190 (LWP 26411)] 
> [New Thread 19906575 (LWP 26434)] 
> 0x004c80d4 in __pthread_sigsuspend () from /lib/i686/libpthread.so.0 
> $1 = "atpcc7fc-dir", '\0' <repeats 17 times> 
> $2 = 0x80b5230 "bacula-dir" 
> $3 = 0x80b5dd0 "/opt/bacula/sbin/" 
> $4 = "MySQL" 
> $5 = 0x80a321c "1.36.3 (22 April 2005)" 
> $6 = 0x809bfb8 "i686-redhat-linux-gnu" 
> $7 = 0x809bfb1 "redhat" 
> $8 = 0x809bfa4 "(Heidelberg)" 
> #0  0x004c80d4 in __pthread_sigsuspend ()
> from /lib/i686/libpthread.so.0 
> #1  0x004c7708 in __pthread_wait_for_restart_signal ()
> from /lib/i686/libpthread.so.0 
> #2  0x004c9720 in __pthread_alt_lock ()
> from /lib/i686/libpthread.so.0 
> #3  0x004c614e in pthread_mutex_lock ()
> from /lib/i686/libpthread.so.0 
> #4  0x08057dab in jobq_add (jq=0x80b4300, jcr=0x80fc570) at
> jobq.c:240 
> #5  0x080566d8 in run_job (jcr=0x80fc570) at job.c:140 
> #6  0x0804c034 in main (argc=0, argv=0x8090b55) at dird.c:241
> 
> Thread 16 (Thread 19906575 (LWP 26434)): 
> #0  0x004c80d4 in __pthread_sigsuspend ()
> from /lib/i686/libpthread.so.0 
> #1  0x004c7708 in __pthread_wait_for_restart_signal ()
> from /lib/i686/libpthread.so.0 
> #2  0x004c3fab in [EMAIL PROTECTED] ()
> from /lib/i686/libpthread.so.0 
> #3  0x08087a7a in rwl_writelock (rwl=0x80b46c0) at rwlock.c:231 
> #4  0x0807fd00 in lock_jcr_chain () at jcr.c:544 
> #5  0x08080491 in new_jcr (size=-1254097504,
> daemon_free_jcr=0xfffffffc) at jcr.c:218 
> #6  0x0806d182 in new_control_jcr (base_name=0x809bf69 "*Console*",
> job_type=-4) at ua_server.c:90 
> #7  0x0806d38b in handle_UA_client_request (arg=0x8117e88) at
> ua_server.c:122 
> #8  0x0808ee1b in workq_server (arg=0x80b4480) at workq.c:347 
> #9  0x004c4ce1 in pthread_start_thread ()
> from /lib/i686/libpthread.so.0 
> #10 0x0043661a in clone () from /lib/i686/libc.so.6
> 
> Thread 15 (Thread 19890190 (LWP 26411)): 
> #0  0x004c80d4 in __pthread_sigsuspend ()
> from /lib/i686/libpthread.so.0 
> #1  0x004c7708 in __pthread_wait_for_restart_signal ()
> from /lib/i686/libpthread.so.0 
> #2  0x004c3fab in [EMAIL PROTECTED] ()
> from /lib/i686/libpthread.so.0 
> #3  0x08087a7a in rwl_writelock (rwl=0x80b46c0) at rwlock.c:231 
> #4  0x0807fd00 in lock_jcr_chain () at jcr.c:544 
> #5  0x08080491 in new_jcr (size=-1252000352,
> daemon_free_jcr=0xfffffffc) at jcr.c:218 
> #6  0x0806d182 in new_control_jcr (base_name=0x809bf69 "*Console*",
> job_type=-4) at ua_server.c:90 
> #7  0x0806d38b in handle_UA_client_request (arg=0x8116c68) at
> ua_server.c:122 
> #8  0x0808ee1b in workq_server (arg=0x80b4480) at workq.c:347 
> #9  0x004c4ce1 in pthread_start_thread ()
> from /lib/i686/libpthread.so.0 
> #10 0x0043661a in clone () from /lib/i686/libc.so.6
> 
> Thread 14 (Thread 19873805 (LWP 26381)): 
> #0  0x004c80d4 in __pthread_sigsuspend ()
> from /lib/i686/libpthread.so.0 
> #1  0x004c7708 in __pthread_wait_for_restart_signal ()
> from /lib/i686/libpthread.so.0 
> #2  0x004c3fab in [EMAIL PROTECTED] ()
> from /lib/i686/libpthread.so.0 
> #3  0x08087a7a in rwl_writelock (rwl=0x80b46c0) at rwlock.c:231 
> #4  0x0807fd00 in lock_jcr_chain () at jcr.c:544 
> #5  0x08080491 in new_jcr (size=-1249903200,
> daemon_free_jcr=0xfffffffc) at jcr.c:218 
> #6  0x0806d182 in new_control_jcr (base_name=0x809bf69 "*Console*",
> job_type=-4) at ua_server.c:90 
> #7  0x0806d38b in handle_UA_client_request (arg=0x8115a48) at
> ua_server.c:122 
> #8  0x0808ee1b in workq_server (arg=0x80b4480) at workq.c:347 
> #9  0x004c4ce1 in pthread_start_thread ()
> from /lib/i686/libpthread.so.0 
> #10 0x0043661a in clone () from /lib/i686/libc.so.6
> 
> Thread 13 (Thread 19857420 (LWP 26320)): 
> #0  0x004c80d4 in __pthread_sigsuspend ()
> from /lib/i686/libpthread.so.0 
> #1  0x004c7708 in __pthread_wait_for_restart_signal ()
> from /lib/i686/libpthread.so.0 
> #2  0x004c3fab in [EMAIL PROTECTED] ()
> from /lib/i686/libpthread.so.0 
> #3  0x08087a7a in rwl_writelock (rwl=0x80b46c0) at rwlock.c:231 
> #4  0x0807fd00 in lock_jcr_chain () at jcr.c:544 
> #5  0x08080491 in new_jcr (size=-1247806048,
> daemon_free_jcr=0xfffffffc) at jcr.c:218 
> #6  0x0806d182 in new_control_jcr (base_name=0x809bf69 "*Console*",
> job_type=-4) at ua_server.c:90 
> #7  0x0806d38b in handle_UA_client_request (arg=0x80fdf48) at
> ua_server.c:122 
> #8  0x0808ee1b in workq_server (arg=0x80b4480) at workq.c:347 
> #9  0x004c4ce1 in pthread_start_thread ()
> from /lib/i686/libpthread.so.0 
> #10 0x0043661a in clone () from /lib/i686/libc.so.6
> 
> Thread 12 (Thread 19841035 (LWP 26294)): 
> #0  0x004c80d4 in __pthread_sigsuspend ()
> from /lib/i686/libpthread.so.0 
> #1  0x004c7708 in __pthread_wait_for_restart_signal ()
> from /lib/i686/libpthread.so.0 
> #2  0x004c3fab in [EMAIL PROTECTED] ()
> from /lib/i686/libpthread.so.0 
> #3  0x08087a7a in rwl_writelock (rwl=0x80b46c0) at rwlock.c:231 
> #4  0x0807fd00 in lock_jcr_chain () at jcr.c:544 
> #5  0x08080491 in new_jcr (size=-1245708896,
> daemon_free_jcr=0xfffffffc) at jcr.c:218 
> #6  0x0806d182 in new_control_jcr (base_name=0x809bf69 "*Console*",
> job_type=-4) at ua_server.c:90 
> #7  0x0806d38b in handle_UA_client_request (arg=0x80fde78) at
> ua_server.c:122 
> #8  0x0808ee1b in workq_server (arg=0x80b4480) at workq.c:347 
> #9  0x004c4ce1 in pthread_start_thread ()
> from /lib/i686/libpthread.so.0 
> #10 0x0043661a in clone () from /lib/i686/libc.so.6
> 
> Thread 11 (Thread 19824650 (LWP 26267)): 
> #0  0x004c80d4 in __pthread_sigsuspend ()
> from /lib/i686/libpthread.so.0 
> #1  0x004c7708 in __pthread_wait_for_restart_signal ()
> from /lib/i686/libpthread.so.0 
> #2  0x004c3fab in [EMAIL PROTECTED] ()
> from /lib/i686/libpthread.so.0 
> #3  0x08087a7a in rwl_writelock (rwl=0x80b46c0) at rwlock.c:231 
> #4  0x0807fd00 in lock_jcr_chain () at jcr.c:544 
> #5  0x08080491 in new_jcr (size=-1243611744,
> daemon_free_jcr=0xfffffffc) at jcr.c:218 
> #6  0x0806d182 in new_control_jcr (base_name=0x809bf69 "*Console*",
> job_type=-4) at ua_server.c:90 
> #7  0x0806d38b in handle_UA_client_request (arg=0x80fddf8) at
> ua_server.c:122 
> #8  0x0808ee1b in workq_server (arg=0x80b4480) at workq.c:347 
> #9  0x004c4ce1 in pthread_start_thread ()
> from /lib/i686/libpthread.so.0 
> #10 0x0043661a in clone () from /lib/i686/libc.so.6
> 
> Thread 10 (Thread 19808265 (LWP 26203)): 
> #0  0x004c80d4 in __pthread_sigsuspend ()
> from /lib/i686/libpthread.so.0 
> #1  0x004c7708 in __pthread_wait_for_restart_signal ()
> from /lib/i686/libpthread.so.0 
> #2  0x004c3fab in [EMAIL PROTECTED] ()
> from /lib/i686/libpthread.so.0 
> #3  0x08087a7a in rwl_writelock (rwl=0x80b46c0) at rwlock.c:231 
> #4  0x0807fd00 in lock_jcr_chain () at jcr.c:544 
> #5  0x08080491 in new_jcr (size=-1241514592,
> daemon_free_jcr=0xfffffffc) at jcr.c:218 
> #6  0x0806d182 in new_control_jcr (base_name=0x809bf69 "*Console*",
> job_type=-4) at ua_server.c:90 
> #7  0x0806d38b in handle_UA_client_request (arg=0x80fdd78) at
> ua_server.c:122 
> #8  0x0808ee1b in workq_server (arg=0x80b4480) at workq.c:347 
> #9  0x004c4ce1 in pthread_start_thread ()
> from /lib/i686/libpthread.so.0 
> #10 0x0043661a in clone () from /lib/i686/libc.so.6
> 
> Thread 9 (Thread 19791880 (LWP 26180)): 
> #0  0x004c80d4 in __pthread_sigsuspend ()
> from /lib/i686/libpthread.so.0 
> #1  0x004c7708 in __pthread_wait_for_restart_signal ()
> from /lib/i686/libpthread.so.0 
> #2  0x004c3fab in [EMAIL PROTECTED] ()
> from /lib/i686/libpthread.so.0 
> #3  0x08087a7a in rwl_writelock (rwl=0x80b46c0) at rwlock.c:231 
> #4  0x0807fd00 in lock_jcr_chain () at jcr.c:544 
> #5  0x08080491 in new_jcr (size=-1239417440,
> daemon_free_jcr=0xfffffffc) at jcr.c:218 
> #6  0x0806d182 in new_control_jcr (base_name=0x809bf69 "*Console*",
> job_type=-4) at ua_server.c:90 
> #7  0x0806d38b in handle_UA_client_request (arg=0x80fdcc8) at
> ua_server.c:122 
> #8  0x0808ee1b in workq_server (arg=0x80b4480) at workq.c:347 
> #9  0x004c4ce1 in pthread_start_thread ()
> from /lib/i686/libpthread.so.0 
> #10 0x0043661a in clone () from /lib/i686/libc.so.6
> 
> Thread 8 (Thread 19775495 (LWP 26172)): 
> #0  0x004c80d4 in __pthread_sigsuspend ()
> from /lib/i686/libpthread.so.0 
> #1  0x004c7708 in __pthread_wait_for_restart_signal ()
> from /lib/i686/libpthread.so.0 
> #2  0x004c3fab in [EMAIL PROTECTED] ()
> from /lib/i686/libpthread.so.0 
> #3  0x08087a7a in rwl_writelock (rwl=0x80b46c0) at rwlock.c:231 
> #4  0x0807fd00 in lock_jcr_chain () at jcr.c:544 
> #5  0x08080491 in new_jcr (size=-1237320288,
> daemon_free_jcr=0xfffffffc) at jcr.c:218 
> #6  0x0806d182 in new_control_jcr (base_name=0x809bf69 "*Console*",
> job_type=-4) at ua_server.c:90 
> #7  0x0806d38b in handle_UA_client_request (arg=0x810b308) at
> ua_server.c:122 
> #8  0x0808ee1b in workq_server (arg=0x80b4480) at workq.c:347 
> #9  0x004c4ce1 in pthread_start_thread ()
> from /lib/i686/libpthread.so.0 
> #10 0x0043661a in clone () from /lib/i686/libc.so.6
> 
> Thread 7 (Thread 19759110 (LWP 26164)): 
> #0  0x004c80d4 in __pthread_sigsuspend ()
> from /lib/i686/libpthread.so.0 
> #1  0x004c7708 in __pthread_wait_for_restart_signal ()
> from /lib/i686/libpthread.so.0 
> #2  0x004c3fab in [EMAIL PROTECTED] ()
> from /lib/i686/libpthread.so.0 
> #3  0x08087a7a in rwl_writelock (rwl=0x80b46c0) at rwlock.c:231 
> #4  0x0807fd00 in lock_jcr_chain () at jcr.c:544 
> #5  0x08080491 in new_jcr (size=-1235223136,
> daemon_free_jcr=0xfffffffc) at jcr.c:218 
> #6  0x0806d182 in new_control_jcr (base_name=0x809bf69 "*Console*",
> job_type=-4) at ua_server.c:90 
> #7  0x0806d38b in handle_UA_client_request (arg=0x810b288) at
> ua_server.c:122 
> #8  0x0808ee1b in workq_server (arg=0x80b4480) at workq.c:347 
> #9  0x004c4ce1 in pthread_start_thread ()
> from /lib/i686/libpthread.so.0 
> #10 0x0043661a in clone () from /lib/i686/libc.so.6
> 
> Thread 6 (Thread 19742725 (LWP 26152)): 
> #0  0x004c80d4 in __pthread_sigsuspend ()
> from /lib/i686/libpthread.so.0 
> #1  0x004c7708 in __pthread_wait_for_restart_signal ()
> from /lib/i686/libpthread.so.0 
> #2  0x004c3fab in [EMAIL PROTECTED] ()
> from /lib/i686/libpthread.so.0 
> #3  0x08087a7a in rwl_writelock (rwl=0x80b4ee0) at rwlock.c:231 
> #4  0x0808e304 in wd_lock () at watchdog.c:305 
> #5  0x0808e5e4 in unregister_watchdog (wd=0x80da6b0) at
> watchdog.c:200 
> #6  0x0808f33d in stop_btimer (wid=0x80e7470) at btimers.c:246 
> #7  0x0804c63b in authenticate_storage_daemon (jcr=0x80ef9f0,
> store=0x80b85d8) at authenticate.c:103 
> #8  0x08059dc5 in connect_to_storage_daemon (jcr=0x80ef9f0,
> retry_interval=10, max_retry_time=1800, verbose=1) 
>     at msgchan.c:89 
> #9  0x0804da74 in do_backup (jcr=0x80ef9f0) at backup.c:145 
> #10 0x08056204 in job_thread (arg=0x80ef9f0) at job.c:215 
> #11 0x080583bd in jobq_server (arg=0x80b4300) at jobq.c:444 
> #12 0x004c4ce1 in pthread_start_thread ()
> from /lib/i686/libpthread.so.0 
> #13 0x0043661a in clone () from /lib/i686/libc.so.6
> 
> Thread 5 (Thread 19726340 (LWP 26151)): 
> #0  0x004c80d4 in __pthread_sigsuspend ()
> from /lib/i686/libpthread.so.0 
> #1  0x004c7708 in __pthread_wait_for_restart_signal ()
> from /lib/i686/libpthread.so.0 
> #2  0x004c3fab in [EMAIL PROTECTED] ()
> from /lib/i686/libpthread.so.0 
> #3  0x08087a7a in rwl_writelock (rwl=0x80b46c0) at rwlock.c:231 
> #4  0x0807fd00 in lock_jcr_chain () at jcr.c:544 
> #5  0x080588e1 in jobq_server (arg=0x80b4300) at jobq.c:582 
> #6  0x004c4ce1 in pthread_start_thread ()
> from /lib/i686/libpthread.so.0 
> #7  0x0043661a in clone () from /lib/i686/libc.so.6
> 
> Thread 4 (Thread 32771 (LWP 3353)): 
> #0  0x004c80d4 in __pthread_sigsuspend ()
> from /lib/i686/libpthread.so.0 
> #1  0x004c7708 in __pthread_wait_for_restart_signal ()
> from /lib/i686/libpthread.so.0 
> #2  0x004c9720 in __pthread_alt_lock ()
> from /lib/i686/libpthread.so.0 
> #3  0x004c614e in pthread_mutex_lock ()
> from /lib/i686/libpthread.so.0 
> #4  0x080804f6 in get_next_jcr (prev_jcr=0xfffffffc) at jcr.c:581 
> #5  0x08080619 in jcr_timeout_check (self=0x80c3360) at jcr.c:615 
> #6  0x0808e533 in watchdog_thread (arg=0x0) at watchdog.c:257 
> #7  0x004c4ce1 in pthread_start_thread ()
> from /lib/i686/libpthread.so.0 
> #8  0x0043661a in clone () from /lib/i686/libc.so.6
> 
> Thread 3 (Thread 16386 (LWP 3352)): 
> #0  0x0042f251 in select () from /lib/i686/libc.so.6 
> #1  0x00000006 in ?? () 
> #2  0x080cf47c in ?? () 
> #3  0xb7f572f0 in ?? () 
> #4  0x00000000 in ?? ()
> 
> Thread 2 (Thread 32769 (LWP 3351)): 
> #0  0x0042cf7a in poll () from /lib/i686/libc.so.6 
> #1  0x004c54c0 in __pthread_manager () from /lib/i686/libpthread.so.0 
> #2  0x0043661a in clone () from /lib/i686/libc.so.6
> 
> Thread 1 (Thread 16384 (LWP 3346)): 
> #0  0x004c80d4 in __pthread_sigsuspend ()
> from /lib/i686/libpthread.so.0 
> #1  0x004c7708 in __pthread_wait_for_restart_signal ()
> from /lib/i686/libpthread.so.0 
> #2  0x004c9720 in __pthread_alt_lock ()
> from /lib/i686/libpthread.so.0 
> #3  0x004c614e in pthread_mutex_lock ()
> from /lib/i686/libpthread.so.0 
> #4  0x08057dab in jobq_add (jq=0x80b4300, jcr=0x80fc570) at
> jobq.c:240 
> #5  0x080566d8 in run_job (jcr=0x80fc570) at job.c:140 
> #6  0x0804c034 in main (argc=0, argv=0x8090b55) at dird.c:241 
> #0  0x004c80d4 in __pthread_sigsuspend ()
> from /lib/i686/libpthread.so.0 
> No symbol table info available. 
> #1  0x004c7708 in __pthread_wait_for_restart_signal ()
> from /lib/i686/libpthread.so.0 
> No symbol table info available. 
> #2  0x004c9720 in __pthread_alt_lock ()
> from /lib/i686/libpthread.so.0 
> No symbol table info available. 
> #3  0x004c614e in pthread_mutex_lock ()
> from /lib/i686/libpthread.so.0 
> No symbol table info available. 
> #4  0x08057dab in jobq_add (jq=0x80b4300, jcr=0x80fc570) at
> jobq.c:240 
> 240        if ((stat = pthread_mutex_lock(&jq->mutex)) != 0) { 
> Current language:  auto; currently c++ 
> stat = 135251312 
> sched_pkt = (wait_pkt *) 0xfffffffc 
> item = (jobq_item_t *) 0x80fc570 
> li = (jobq_item_t *) 0x7f 
> wtime = -1 
> id = 135251948 
> #5  0x080566d8 in run_job (jcr=0x80fc570) at job.c:140 
> 140        if ((stat = jobq_add(&job_queue, jcr)) != 0) { 
> be = {<SMARTALLOC> = {<No data fields>}, buf_ = 0x80fb448 "ðù\016\bpÅ
> \017\b\001", berrno_ = 1} 
> stat = 134822556 
> errstat = 134822556 
> JobId = 346 
> #6  0x0804c034 in main (argc=0, argv=0x8090b55) at dird.c:241 
> 241           run_job(jcr);                   /* run job */ 
> jcr = (JCR *) 0x80fc570 
> test_config = 0 
> ch = 135251312 
> no_signals = 0 
> uid = 0x0 
> gid = 0x0 
> #0  0x00000000 in ?? () 
> No symbol table info available.
> 
> 
> any idea??
> 
> what's your practice? do you restart bacula every day? should i?
> 
> Thanks a lot, 
> Christian
> 
> 



-------------------------------------------------------
This SF.Net email is sponsored by Yahoo.
Introducing Yahoo! Search Developer Network - Create apps using Yahoo!
Search APIs Find out how you can build Yahoo! directly into your own
Applications - visit http://developer.yahoo.net/?fr=offad-ysdn-ostg-q22005
_______________________________________________
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users

Reply via email to