Title: AW: [Bacula-users] bacula-dir hangs....

Hello Kern, hello Michael,

yes, my tls-directory is still renamed, there were no patches recreating it.

now i had running my director via gdb last weekend and it was so nice to stop
working saturday in the morning ;-))

when backups are scheduled, the director seems to create some too much processes/threads.
i've 8 backup-jobs configured to start at 1:05am and the following processes/threads
are created:

root      9927  9844  0 Apr01 pts/1    00:00:11 gdb ./bacula-dir
root      9929  9927  0 Apr01 pts/1    00:00:00 /opt/bacula/sbin/bacula-dir -s -f -c /opt/bacula/etc/bacula-dir.conf
root      9938  9929  0 Apr01 pts/1    00:00:00 /opt/bacula/sbin/bacula-dir -s -f -c /opt/bacula/etc/bacula-dir.conf
root      9939  9938  0 Apr01 pts/1    00:00:00 /opt/bacula/sbin/bacula-dir -s -f -c /opt/bacula/etc/bacula-dir.conf
root      9940  9938  0 Apr01 pts/1    00:00:00 /opt/bacula/sbin/bacula-dir -s -f -c /opt/bacula/etc/bacula-dir.conf
root     14234  9938  0 Apr02 pts/1    00:00:00 /opt/bacula/sbin/bacula-dir -s -f -c /opt/bacula/etc/bacula-dir.conf
root     14235  9938  0 Apr02 pts/1    00:00:00 /opt/bacula/sbin/bacula-dir -s -f -c /opt/bacula/etc/bacula-dir.conf
root     14238  9938  0 Apr02 pts/1    00:00:00 /opt/bacula/sbin/bacula-dir -s -f -c /opt/bacula/etc/bacula-dir.conf
root     14239  9938  0 Apr02 pts/1    00:00:00 /opt/bacula/sbin/bacula-dir -s -f -c /opt/bacula/etc/bacula-dir.conf
root     14243  9938  0 Apr02 pts/1    00:00:00 /opt/bacula/sbin/bacula-dir -s -f -c /opt/bacula/etc/bacula-dir.conf
root     14244  9938  0 Apr02 pts/1    00:00:00 /opt/bacula/sbin/bacula-dir -s -f -c /opt/bacula/etc/bacula-dir.conf
root     14246  9938  0 Apr02 pts/1    00:00:00 /opt/bacula/sbin/bacula-dir -s -f -c /opt/bacula/etc/bacula-dir.conf
root     14248  9938  0 Apr02 pts/1    00:00:00 /opt/bacula/sbin/bacula-dir -s -f -c /opt/bacula/etc/bacula-dir.conf
root     14250  9938  0 Apr02 pts/1    00:00:00 /opt/bacula/sbin/bacula-dir -s -f -c /opt/bacula/etc/bacula-dir.conf
root     14253  9938  0 Apr02 pts/1    00:00:00 /opt/bacula/sbin/bacula-dir -s -f -c /opt/bacula/etc/bacula-dir.conf
root     14254  9938  0 Apr02 pts/1    00:00:00 /opt/bacula/sbin/bacula-dir -s -f -c /opt/bacula/etc/bacula-dir.conf
root     14263  9938  0 Apr02 pts/1    00:00:00 /opt/bacula/sbin/bacula-dir -s -f -c /opt/bacula/etc/bacula-dir.conf

Apr, 1st (sorry for this date) was when i started the director with gdb and Apr, 2nd
was the first time when jobs should be running.

and here is the output of gdb:

[EMAIL PROTECTED] sbin]# gdb ./bacula-dir
GNU gdb Red Hat Linux (6.1post-1.20040607.52rh)
Copyright 2004 Free Software Foundation, Inc.
GDB is free software, covered by the GNU General Public License, and you are
welcome to change it and/or distribute copies of it under certain conditions.
Type "show copying" to see the conditions.
There is absolutely no warranty for GDB.  Type "show warranty" for details.
This GDB was configured as "i386-redhat-linux-gnu"...Using host libthread_db library "/lib/libthread_db.so.1".

(gdb) run -s -f -c /opt/bacula/etc/bacula-dir.conf
Starting program: /opt/bacula/sbin/bacula-dir -s -f -c /opt/bacula/etc/bacula-dir.conf
[Thread debugging using libthread_db enabled]
[New Thread 16384 (LWP 9929)]
[New Thread 32769 (LWP 9938)]
[New Thread 16386 (LWP 9939)]
[New Thread 32771 (LWP 9940)]
[New Thread 49156 (LWP 9954)]
[Thread 49156 (LWP 9954) exited]
[New Thread 65540 (LWP 9964)]
[Thread 65540 (LWP 9964) exited]
[New Thread 81924 (LWP 9988)]
[Thread 81924 (LWP 9988) exited]
[New Thread 98308 (LWP 10014)]
[Thread 98308 (LWP 10014) exited]
[New Thread 114692 (LWP 10038)]
[Thread 114692 (LWP 10038) exited]
[New Thread 131076 (LWP 10064)]
[Thread 131076 (LWP 10064) exited]
[New Thread 147460 (LWP 10087)]
....
...
[New Thread 2752516 (LWP 14206)]
[Thread 2752516 (LWP 14206) exited]
[New Thread 2768900 (LWP 14234)]
[New Thread 2785285 (LWP 14235)]
[New Thread 2801670 (LWP 14237)]
[New Thread 2818055 (LWP 14238)]
[New Thread 2834440 (LWP 14239)]
[New Thread 2850825 (LWP 14243)]
[New Thread 2867210 (LWP 14244)]
[New Thread 2883595 (LWP 14246)]
[New Thread 2899980 (LWP 14248)]
[New Thread 2916365 (LWP 14250)]
[New Thread 2932750 (LWP 14253)]
[New Thread 2949135 (LWP 14254)]
[Thread 2801670 (LWP 14237) exited]
Detaching after fork from child process 14255.
ptrace: No such process.
[Switching to Thread 2801670 (LWP 14237)]
Cannot remove breakpoints because program is no longer writable.
It might be running in another process.
Further execution is probably impossible.
0x00000000 in ?? ()
(gdb) thread apply all bt
Cannot find new threads: generic error
(gdb) q
The program is running.  Exit anyway? (y or n) n
Not confirmed.



what more can i do to help in analysing this problem?

thanks a lot,
christian




-----Ursprüngliche Nachricht-----
Von: Kern Sibbald [mailto:[EMAIL PROTECTED]]
Gesendet: Freitag, 01. April 2005 12:17
An: Masopust Christian
Cc: 'Michael Joyner'; bacula-users@lists.sourceforge.net
Betreff: Re: AW: AW: [Bacula-users] bacula-dir hangs....

Hello Christian,

Have you checked that the library is still renamed? All it takes is one system
upgrade and the file can come back!  As double insurance, you might also use
the environment variable -- though I find it hard to feel comfortable that it
actually works ...

On Friday 01 April 2005 11:12, Masopust Christian wrote:
> Hello Kern,
>
> i've renamed the /lib/tls directly after i installed bacula (last
> december!) i didn't
> have this problem in the beginning, it came up some weeks ago...
>
> but i'll try to run the director in debug mode and will let you know what
> i'll find out
> (if i'm able to find out anything ;-))
>
> thanks,
> christian
>
>
> -----Ursprüngliche Nachricht-----
> Von: [EMAIL PROTECTED]
> [mailto:[EMAIL PROTECTED]] Im Auftrag von Michael
> Joyner
> Gesendet: Freitag, 01. April 2005 00:42
> An: Kern Sibbald; bacula-users@lists.sourceforge.net
> Betreff: Re: AW: [Bacula-users] bacula-dir hangs....
>
> Kern Sibbald wrote:
> > Hello,
> >
> > I've noticed a number of people reporting that the Director hangs, though
>
> I
>
> > haven't followed this thread in detail.  I've never seen this except if
>
> you
>
> > are running /lib/tls, so you might:
> >
> > 1. Remove or rename /lib/tls if it exists on your system.
> > 2. Restart all the bacula daemons.
>
> Doing:
>
> cd /lib
> tar cvfz tls.tgz tls
> mv tls tls.CAUSES.BACULA-DIR.HANG
> /etc/init.d/bacula-dir restart
> /etc/init.d/bacula-fd restart
> /etc/init.d/bacula-sd restart
> /etc/init.d/mysql restart # "just in case"
>
> changed max storage jobs to 10
> running about 100+ jobs tonight, we'll see in the morning. :)
>
> (System: SuSE 9.1, bacula & mysql from apt-get pre-built binaries)
>
>
>
> -------------------------------------------------------
> This SF.net email is sponsored by Demarc:
> A global provider of Threat Management Solutions.
> Download our HomeAdmin security software for free today!
> http://www.demarc.com/Info/Sentarus/hamr30
> _______________________________________________
> Bacula-users mailing list
> Bacula-users@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/bacula-users

--
Best regards,

Kern

Reply via email to