Hi,

16.07.2007 11:21,, Alfredo Marchini wrote::
> Hi,
> bacula-dir is blocked again:
> I've just made this tests:
> 
> - time command from bconsole: works

Good, so the DIR is basically up and running.

> - mysqladmin processlist:
> there are 2 processes:
> +-----+--------+-----------+--------+---------+------+-------+------------------+
>  
> 
> | Id  | User   | Host      | db     | Command | Time | State | 
> Info             |
> +-----+--------+-----------+--------+---------+------+-------+------------------+
>  
> 
> | 275 | bacula | localhost | bacula | Sleep   | 793  |       
> |                  |
> | 276 | root   | localhost |        | Query   | 0    |       | show 
> processlist |
> +-----+--------+-----------+--------+---------+------+-------+------------------+
>   

The DIR is not waiting for the catalog.

> 
> i've also setted yesterday debug at 400, strace on and launched strace 
> program.
> I attach my log files, but I cannot attach strace log files, because 
> they are too large (7.5Mb).
> 
> I've launched strace in background with these options:
> 
> strace -ff -T -o /root/strace-dir/bacula-dir.log -p 5540
> 
> Also I attach bacula-dir log.
> Reading director.log, I think that there is a bug.
> So strace can be very useful, but it's also very large.
> I can publish strace log files, if needed, on my private web-server.
> 
> If bacula developers or somelse needs to make tests on director or other 
> info,
> I can keep bacula-dir blocked without restarting for some days again.
> 
> Thanks
> Alfredo

The DIR trace file:

> 14-Jul 09:18 oracolo-director: Fatal Error at bnet_server.c:172 because:
> Error in select: Unknown error 514
> 14-Jul 09:18 oracolo-director: ABORTING due to ERROR in smartall.c:193
> qp->qnext->qprev != qp called from dlist.c:341
> 14-Jul 09:18 oracolo-director: Fatal Error because: Bacula interrupted by 
> signal 11: Segmentation violation
> 14-Jul 09:18 oracolo-director: Fatal Error at bnet_server.c:172 because:
> Error in select: Unknown error 514
> 14-Jul 09:18 oracolo-director: ABORTING due to ERROR in smartall.c:193
> qp->qnext->qprev != qp called from dlist.c:341
> 14-Jul 09:18 oracolo-director: Fatal Error because: Bacula interrupted by 
> signal 11: Segmentation violation

This looks bad. I suppose it's worth a bug report on bugs.bacula.org, 
and/or an email to the developers list. (I cc it there...)

Error 514 in select does not sound like a problem in the DIR code, but 
the error handling could perhaps catch this sort or problem.

A very quick search for error code 514 revealed this:

> /* Should never be seen by user programs */
> #define ERESTARTSYS     512
> #define ERESTARTNOINTR  513
> #define ERESTARTNOHAND  514     /* restart if no handler.. */
> #define ENOIOCTLCMD     515     /* No ioctl command */
> #define ERESTART_RESTARTBLOCK 516 /* restart by calling sys_restart_syscall */

which might indicate a kernel problem (if you encounter this on linux 
2.6...)

Arno

-- 
Arno Lehmann
IT-Service Lehmann
www.its-lehmann.de

-------------------------------------------------------------------------
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
_______________________________________________
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users

Reply via email to