Oh, just one last note:
Bacula does work perfectly well on
RedHat 7.1 (Seawolf) kernel version 2.4.20-28.7
but on that machine, the HAVE_POSIX_FADVISE is turned off automatically by
the ./configure process.
Perhaps you did not start with a clean build directory (i.e. you built it
first on some other system and later copied the directory). ./configure has
the habit of caching the config process, which could explain why your
HAVE_POSIX_FDADVISE is improperly set for your system.
One must be careful to either start from a fresh tar file or do a:
make distclean
before trying to re-run a ./configure on a different architecture OS version.
Regards,
Kern
On Thursday 11 October 2007 23:39, [EMAIL PROTECTED] wrote:
> In the message dated: Tue, 09 Oct 2007 15:55:58 +0200,
> The pithy ruminations from Kern Sibbald on
> <Re: [Bacula-devel] bacula-sd 2.2.4 goes kaboom! (segfault on despooling
> data)> were:
> => Hello,
>
>
> Since 2.2.5 has been released, I've built that version, and I'm seeing the
> same behavior.
>
> =>
> => Most likely you forgot to put in an Autochanger resource for controlling
> your
>
> Nope. The same configuration works fine with the autochanger under 1.38.11.
> Here are excerpts from the config files:
>
> -------------- bacula-dir.conf ----------------------
> # Definition of file storage device
> Storage {
> Name = pv132t
> # Do not use "localhost" here
> Address = parthenon # N.B. Use a fully qualified name here
> SDPort = 9103
> Password = "IeRUDo7djGxxxxxxxxxxxxxxxxxxxxxxxxxlM1xUkbrEJnq4K"
> Device = pv132t
> Media Type = LTO2
> Autochanger = yes
> Maximum Concurrent Jobs = 25
> }
> -----------------------------------------------------
>
>
>
> --------------- bacula-sd.conf ----------------------
> #
> # An autochanger device with two drives
> #
> Autochanger {
> Name = pv132t
> Device = Drive-0
> Device = Drive-1
> Changer Command = "/usr/local/bacula-2.2.5/bin/mtx-changer %c %o %S %a
> %d" Changer Device = /dev/changer
> }
>
> Device {
> Name = Drive-0 #
> LabelMedia = yes; # lets Bacula label unlabeled media
> Drive Index = 0
> Media Type = LTO2
> Archive Device = /dev/tape0
> AutomaticMount = yes; # when device opened, read it
> AlwaysOpen = yes;
> RemovableMedia = yes;
> RandomAccess = no;
> AutoChanger = yes
> # Enable the Alert command only if you have the mtx package loaded
> Alert Command = "sh -c 'tapeinfo -f %c |grep TapeAlert|cat'"
> Maximum Spool Size = 60G # For network client
> Maximum Network Buffer Size = 65536
> Spool Directory = /san3/var/spool/bacula
> Autoselect = yes
> }
>
> Device {
> Name = Drive-1 #
> LabelMedia = yes; # lets Bacula label unlabeled media
> Drive Index = 1
> Media Type = LTO2
> Archive Device = /dev/tape1
> AutomaticMount = yes; # when device opened, read it
> AlwaysOpen = yes;
> RemovableMedia = yes;
> RandomAccess = no;
> AutoChanger = yes
> # Enable the Alert command only if you have the mtx package loaded
> Alert Command = "sh -c 'tapeinfo -f %c |grep TapeAlert|cat'"
> Maximum Spool Size = 60G # For network client
> Maximum Network Buffer Size = 65536
> Spool Directory = /san2/var/spool/bacula
> Autoselect = yes
> }
> -----------------------------------------------------
> => autochanger. Barring that, you will need to run the SD under the
> debugger as => defined in the Kaboom chapter of the manual and obtain a
> good traceback.
>
>
> Done. Here's a script(1) of the gdb output. Please let me know if there's
> any more information that I can provide, or any alternative way of testing
> this.
>
> --------------------------------------------------------------------
> Script started on Thu 11 Oct 2007 05:25:41 PM EDT
> [EMAIL PROTECTED] bacula-2.2.5]$ ./test_bacula.rc start
> Starting the Storage daemon
> Starting the File daemon
> Starting the Director daemon
> [EMAIL PROTECTED] bacula-2.2.5]$ ps -ef | grep bacula
> root 18416 18385 0 17:25 pts/24 00:00:00 script bacula-debugging
> root 18417 18416 0 17:25 pts/24 00:00:00 script bacula-debugging
> root 18449 1 0 17:25 ? 00:00:00
> /usr/local/bacula-2.2.5/sbin/bacula-fd -u root -g root -v -c
> /usr/local/bacula-2.2.5/etc/bacula-fd.conf root 18451 18449 0 17:25 ?
> 00:00:00 /usr/local/bacula-2.2.5/sbin/bacula-fd -u root -g root -v -c
> /usr/local/bacula-2.2.5/etc/bacula-fd.conf root 18452 18451 0 17:25 ?
> 00:00:00 /usr/local/bacula-2.2.5/sbin/bacula-fd -u root -g root -v -c
> /usr/local/bacula-2.2.5/etc/bacula-fd.conf root 18454 1 0 17:25 ?
> 00:00:00 [bacula-dir]
> root 18456 18454 0 17:25 ? 00:00:00 [bacula-dir]
> root 18457 18456 0 17:25 ? 00:00:00 [bacula-dir]
> root 18458 18456 0 17:25 ? 00:00:00 [bacula-dir]
> root 18460 18418 0 17:25 pts/2 00:00:00 grep bacula
> [EMAIL PROTECTED] bacula-2.2.5]$ gdb sbin/bacula-sd
> GNU gdb Red Hat Linux (5.3.90-0.20030710.41rh)
> Copyright 2003 Free Software Foundation, Inc.
> GDB is free software, covered by the GNU General Public License, and you
> are welcome to change it and/or distribute copies of it under certain
> conditions. Type "show copying" to see the conditions.
> There is absolutely no warranty for GDB. Type "show warranty" for details.
> This GDB was configured as "i386-redhat-linux-gnu"...Using host
> libthread_db library "/lib/libthread_db.so.1".
>
> (gdb) run -s -f -c etc/bacula-sd.conf
> Starting program: /usr/local/bacula-2.2.5/sbin/bacula-sd -s -f -c
> etc/bacula-sd.conf [Thread debugging using libthread_db enabled]
> [New Thread 16384 (LWP 18507)]
> [New Thread 32769 (LWP 18537)]
> [New Thread 16386 (LWP 18538)]
> [New Thread 32771 (LWP 18540)]
> [New Thread 49156 (LWP 18832)]
> [New Thread 65541 (LWP 18841)]
> [New Thread 81926 (LWP 18854)]
> [New Thread 98311 (LWP 18863)]
> [New Thread 114696 (LWP 18865)]
> [New Thread 131081 (LWP 18874)]
> [New Thread 147466 (LWP 18930)]
> [New Thread 163851 (LWP 18933)]
> [New Thread 180236 (LWP 18961)]
> [New Thread 196621 (LWP 18970)]
>
> Program received signal SIGSEGV, Segmentation fault.
> [Switching to Thread 147466 (LWP 18930)]
> 0x08137148 in ?? ()
> (gdb) thread apply all bt
>
> Thread 13 (Thread 180236 (LWP 18961)):
> #0 0x4003415b in read () from /lib/i686/libpthread.so.0
> #1 0x00000004 in ?? ()
> #2 0x00000001 in ?? ()
> #3 0x08072e40 in read_nbytes(BSOCK*, char*, int) (bsock=0x8150160,
> ptr=0x423e1924 "\002", nbytes=4) at bnet.c:82
> #4 0x08074ab2 in BSOCK::recv() (this=0x8150160) at bsock.c:381
> #5 0x08072c65 in bget_msg(BSOCK*) (sock=0x8150160) at bget_msg.c:60
> #6 0x0804f9c5 in do_append_data(JCR*) (jcr=0x813ef08) at append.c:202
> #7 0x0805f5b1 in append_data_cmd (jcr=0x813ef08) at fd_cmds.c:194
> #8 0x0805f50f in do_fd_commands(JCR*) (jcr=0x813ef08) at fd_cmds.c:165
> #9 0x0805f385 in run_job(JCR*) (jcr=0x813ef08) at fd_cmds.c:128
> #10 0x080604f5 in run_cmd(JCR*) (jcr=0x813ef08) at job.c:195
> #11 0x0805b1b0 in handle_connection_request(void*) (arg=0x813e498) at
> dircmd.c:229 #12 0x0808788c in workq_server (arg=0x80a3900) at workq.c:357
> #13 0x4002edb2 in pthread_start_thread () from /lib/i686/libpthread.so.0
> #14 0x4002ef45 in pthread_start_thread_event () from
> /lib/i686/libpthread.so.0 #15 0x401797fa in clone () from
> /lib/i686/libc.so.6
>
> Thread 11 (Thread 147466 (LWP 18930)):
> #0 0x08137148 in ?? ()
> #1 0x08114570 in ?? ()
> #2 0x081156e0 in ?? ()
> #3 0x0806da33 in despool_data (dcr=0x8115260, commit=true) at spool.c:273
> #4 0x0806d596 in commit_data_spool(DCR*) (dcr=0x8115260) at spool.c:139
> #5 0x0804fd34 in do_append_data(JCR*) (jcr=0x8114570) at append.c:318
> #6 0x0805f5b1 in append_data_cmd (jcr=0x8114570) at fd_cmds.c:194
> #7 0x0805f50f in do_fd_commands(JCR*) (jcr=0x8114570) at fd_cmds.c:165
> ---Type <return> to continue, or q <return> to quit---
> #8 0x0805f385 in run_job(JCR*) (jcr=0x8114570) at fd_cmds.c:128
> #9 0x080604f5 in run_cmd(JCR*) (jcr=0x8114570) at job.c:195
> #10 0x0805b1b0 in handle_connection_request(void*) (arg=0x80c2958) at
> dircmd.c:229 #11 0x0808788c in workq_server (arg=0x80a3900) at workq.c:357
> #12 0x4002edb2 in pthread_start_thread () from /lib/i686/libpthread.so.0
> #13 0x4002ef45 in pthread_start_thread_event () from
> /lib/i686/libpthread.so.0 #14 0x401797fa in clone () from
> /lib/i686/libc.so.6
>
> Thread 9 (Thread 114696 (LWP 18865)):
> #0 0x400340db in write () from /lib/i686/libpthread.so.0
>
> Thread 7 (Thread 81926 (LWP 18854)):
> #0 0x080805c7 in sm_sizeof_pool_memory(char const*, int, char*)
> (fname=0x80c0328 "", lineno=1098783052, obuf=0x80c0328 "") at
> mem_pool.c:166
> #1 0x08072c65 in bget_msg(BSOCK*) (sock=0x80c0328) at bget_msg.c:60
> #2 0x0804f892 in do_append_data(JCR*) (jcr=0x80be998) at append.c:154
> #3 0x0805f5b1 in append_data_cmd (jcr=0x80be998) at fd_cmds.c:194
> #4 0x0805f50f in do_fd_commands(JCR*) (jcr=0x80be998) at fd_cmds.c:165
> #5 0x0805f385 in run_job(JCR*) (jcr=0x80be998) at fd_cmds.c:128
> #6 0x080604f5 in run_cmd(JCR*) (jcr=0x80be998) at job.c:195
> #7 0x0805b1b0 in handle_connection_request(void*) (arg=0x80be3b0) at
> dircmd.c:229 #8 0x0808788c in workq_server (arg=0x80a3900) at workq.c:357
> #9 0x4002edb2 in pthread_start_thread () from /lib/i686/libpthread.so.0
> #10 0x4002ef45 in pthread_start_thread_event () from
> /lib/i686/libpthread.so.0 #11 0x401797fa in clone () from
> /lib/i686/libc.so.6
> Current language: auto; currently c++
>
> Thread 5 (Thread 49156 (LWP 18832)):
> #0 0x4003415b in read () from /lib/i686/libpthread.so.0
> #1 0x00000004 in ?? ()
> ---Type <return> to continue, or q <return> to quit---
> #2 0x00000001 in ?? ()
> #3 0x08072e40 in read_nbytes(BSOCK*, char*, int) (bsock=0x80b9e78,
> ptr=0x413d6924 "", nbytes=4) at bnet.c:82
> #4 0x08074ab2 in BSOCK::recv() (this=0x80b9e78) at bsock.c:381
> #5 0x08072c65 in bget_msg(BSOCK*) (sock=0x80b9e78) at bget_msg.c:60
> #6 0x0804f9c5 in do_append_data(JCR*) (jcr=0x80a9380) at append.c:202
> #7 0x0805f5b1 in append_data_cmd (jcr=0x80a9380) at fd_cmds.c:194
> #8 0x0805f50f in do_fd_commands(JCR*) (jcr=0x80a9380) at fd_cmds.c:165
> #9 0x0805f385 in run_job(JCR*) (jcr=0x80a9380) at fd_cmds.c:128
> #10 0x080604f5 in run_cmd(JCR*) (jcr=0x80a9380) at job.c:195
> #11 0x0805b1b0 in handle_connection_request(void*) (arg=0x80a56a0) at
> dircmd.c:229 #12 0x0808788c in workq_server (arg=0x80a3900) at workq.c:357
> #13 0x4002edb2 in pthread_start_thread () from /lib/i686/libpthread.so.0
> #14 0x4002ef45 in pthread_start_thread_event () from
> /lib/i686/libpthread.so.0 #15 0x401797fa in clone () from
> /lib/i686/libc.so.6
>
> Thread 4 (Thread 32771 (LWP 18540)):
> #0 0x40034936 in nanosleep () from /lib/i686/libpthread.so.0
> #1 0x00000001 in ?? ()
> #2 0x40030b5a in __pthread_timedsuspend_new () from
> /lib/i686/libpthread.so.0
>
> Thread 2 (Thread 32769 (LWP 18537)):
> #0 0x4017075a in poll () from /lib/i686/libc.so.6
> #1 0x4002dd1a in __pthread_manager () from /lib/i686/libpthread.so.0
> #2 0x4002dfea in __pthread_manager_event () from /lib/i686/libpthread.so.0
> #3 0x401797fa in clone () from /lib/i686/libc.so.6
>
> Thread 1 (Thread 16384 (LWP 18507)):
> #0 0x401728d1 in select () from /lib/i686/libc.so.6
> ---Type <return> to continue, or q <return> to quit---
> #1 0x00000009 in ?? ()
> #2 0x401d3d20 in buffer () from /lib/i686/libc.so.6
> #0 0x08137148 in ?? ()
> (gdb)
> (gdb) quit
> The program is running. Exit anyway? (y or n) y
> [EMAIL PROTECTED] bacula-2.2.5]$ exit
> exit
>
> Script done on Thu 11 Oct 2007 05:31:46 PM EDT
> --------------------------------------------------------------------
>
> Thanks,
>
> Mark
>
> =>
> => Regards,
> =>
> => Kern
> =>
> => On Thursday 04 October 2007 17:11, [EMAIL PROTECTED] wrote:
> => > I'm testing bacula 2.2.4 and the bacula-sd daemon repeatedly exits
> with a => > segmentation fault. Bacula 1.38.11 works reliably on the same
> machine with => > the same hardware and configuration
> => > files.
> => >
> => > Environment:
> => > Fedora Core 1
> => > Kernel 2.4.26
> => > gcc 3.3.2
> => > MySQL 5.0.22
> => > Dell PV 132T autochanger
> => >
> => > Build configuration script:
> => > ./configure \
> => > --prefix=/usr/local/bacula-2.2.4 \
> => > --disable-nls \
> => > --disable-ipv6 \
> => > --enable-batch-insert \
> => > [EMAIL PROTECTED] \
> => > [EMAIL PROTECTED] \
> => > --with-db-name=bacula2 \
> => > --with-mysql \
> => > --mandir=/usr/local/bacula-2.2.4/man \
> => > --with-pid-dir=/usr/local/bacula-2.2.4/var/run \
> => > --with-subsys-dir=/usr/local/bacula-2.2.4/var/subsys \
> => > --with-dir-user=bacula \
> => > --with-dir-group=bacula \
> => > --with-sd-user=bacula \
> => > --with-sd-group=bacula \
> => > --with-fd-user=root \
> => > --with-fd-group=root && make
> => >
> => >
> => > I'm using the same configuration files for the director, sd, and fd
> that => > work with the 1.38.11 installation (after removing the "Accept
> Any Volume" => > directive, changing the paths for 2.2.4, and adding the
> directive => > "RecyclePool = Scratch").
> => >
> => > The software compiles without error. There were no errors from "btape
> test" => > or "btape autochanger".
> => >
> => > The bacula-sd daemon crashes repeatedly whether or not the variable
> => > LD_ASSUME_KERNEL was set to "2.4.19" before compiling bacula.
> => >
> => > The bacula-sd daemon is running as root (while I sort out an issue
> that's => > not present in 1.38.11 with permissions on the tape device).
> The bacula-dir => > normally runs as user "bacula".
> => >
> => > Even if I modify the bacula-dir options to run it as root, no
> traceback => > file is generated. The message (received via email) from
> Bacula is: => >
> => > Using host libthread_db library "/lib/libthread_db.so.1".
> => > 0x401728d1 in ?? ()
> => > /usr/local/bacula-2.2.4/etc/btraceback.gdb:1: Error in sourced
> command => > file: Cannot access memory at address 0x80a2e34
> => >
> => > The fault seems to occur right after the SD begins despooling data.
> I've => > got 4 log files from running the SD with debugging on (set to 200
> or => > higher), and the error always happens after the first instance of
> => > despooling data. In each case, the log file shows "stored.c:582 In =>
> > terminate_stored() sig=11".
> => >
> => > I've attached an excerpt from the SD debugging output.
> => >
> => >
> => > Thanks,
> => >
> => > Mark
> => >
> => >
> => > ----
> => > Mark Bergman [EMAIL PROTECTED]
> => > System Administrator
> => > Section of Biomedical Image Analysis 215-662-7310
> => > Department of Radiology, University of Pennsylvania
> => >
> => >
> http://pgpkeys.pca.dfn.de:11371/pks/lookup?search=mark.bergman%40.uphs.upen
> => >n.edu
> =>
-------------------------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc.
Still grepping through log files to find problems? Stop.
Now Search log events and configuration files using AJAX and a browser.
Download your FREE copy of Splunk now >> http://get.splunk.com/
_______________________________________________
Bacula-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/bacula-users