Did you also get an email about this crash (see the postfix lines in the log)?
If not, check /usr/sbin/btraceback for the email address.  It should contain
the gdb information needed to diagnose this.  If so, please post it here.

__Martin


>>>>> On Sun, 1 Mar 2020 23:22:59 +0000, Chaz Vidal said:
> 
> Greetings all,
> Our Bacula system crashed on Friday with a segmentation violation. 
> 
> The system has been attempting to do a full backup of over 130TB of data over 
> the past few weeks which we've appeared to have lost because of the crash.
> 
> Feb 28 09:56:31 <<servername>> bacula-dir[4211]: Bacula interrupted by signal 
> 11: Segmentation violation
> Feb 28 09:56:31 <<servername>> bacula-dir[4211]: Kaboom! bacula-dir, 
> bacula-dir got signal 11 - Segmentation violation at 28-Feb-2020 09:56:31. 
> Attempting traceback.
> Feb 28 09:56:31 <<servername>> bacula-dir[4211]: Kaboom! exepath=/usr/sbin/
> Feb 28 09:56:31 <<servername>> bacula-dir: Bacula interrupted by signal 11: 
> Segmentation violation
> Feb 28 09:56:31 <<servername>> bacula-dir[4211]: Calling: 
> /usr/sbin/btraceback /usr/sbin/bacula-dir 4211 /var/lib/bacula
> Feb 28 09:56:31 <<servername>> postfix/smtpd[59719]: connect from 
> localhost[127.0.0.1]
> Feb 28 09:56:31 <<servername>> postfix/smtpd[59719]: 71CC36008A: 
> client=localhost[127.0.0.1]
> Feb 28 09:56:31 <<servername>> postfix/cleanup[59722]: 71CC36008A: 
> message-id=<20200227232631.71CC36008A@<<servername>>.company.com>
> Feb 28 09:56:31 <<servername>> postfix/qmgr[14399]: 71CC36008A: 
> from=<root@<<servername>>.company.com>, size=593, nrcpt=1 (queue active)
> Feb 28 09:56:31 <<servername>> postfix/smtpd[59719]: disconnect from 
> localhost[127.0.0.1] helo=1 mail=1 rcpt=1 data=1 quit=1 commands=5
> Feb 28 09:56:31 <<servername>> bacula-dir[4211]: It looks like the traceback 
> worked...
> Feb 28 09:56:31 <<servername>> bacula-dir[4211]: LockDump: 
> /var/lib/bacula/bacula.4211.traceback
> Feb 28 09:56:31 <<servername>> bacula-dir[4211]: bacula-dir: lockmgr.c:1221-0 
> lockmgr disabled
> 
> I do not know how to read a traceback file to understand what may have been 
> going on.  We are attempting to restart the backup again but unless we 
> understand what happened the crash may appear again.  
> 
> We are running Bacula Version: 9.4.2
> 
> Appreciate if anyone can share any insight? 
> 
> Attempt to dump current JCRs. njcrs=7
> threadid=0x7fb497491f40 JobId=0 JobStatus=R jcr=0x55980a04a4f8 
> name=*JobMonitor*.2020-02-11_15.29.48_01
>         use_count=1 killable=0
>         JobType=I JobLevel=
>         sched_time=11-Feb-2020 15:29 start_time=11-Feb-2020 15:29
>         end_time=01-Jan-1970 09:30 wait_time=01-Jan-1970 09:30
>         db=(nil) db_batch=(nil) batch_started=0
>         wstore=0x55980a01ff28 rstore=0x55980a01ff28 wjcr=(nil) 
> client=0x55980a026128 reschedule_count=0 SD_msg_chan_started=0
> threadid=0x7fb495897700 JobId=104686 JobStatus=R jcr=0x7fb48806aea8 
> name=job1.2020-02-11_17.38.56_13
>         use_count=2 killable=1
>         JobType=B JobLevel=F
>         sched_time=11-Feb-2020 17:38 start_time=11-Feb-2020 17:38
>         end_time=01-Jan-1970 09:30 wait_time=21-Feb-2020 16:50
>         db=0x7fb4880059a8 db_batch=(nil) batch_started=0
>         wstore=0x7fb48803fc18 rstore=(nil) wjcr=(nil) client=0x7fb4880481a8 
> reschedule_count=0 SD_msg_chan_started=1
> BDB=0x7fb4880059a8 db_name=bacula db_user=bacula connected=true
>         cmd="UPDATE Media SET InChanger=0, Slot=0 WHERE Slot=25 AND StorageId 
> IN (10) AND MediaId!=794" changes=1814
>         RWLOCK=0x7fb4880059c0 w_active=0 w_wait=0
> threadid=0x7fb47e7fc700 JobId=104687 JobStatus=R jcr=0x7fb488068978 
> name=job2.2020-02-11_17.40.43_14
>         use_count=2 killable=1
>         JobType=B JobLevel=F
>         sched_time=11-Feb-2020 17:40 start_time=11-Feb-2020 17:40
>         end_time=01-Jan-1970 09:30 wait_time=01-Jan-1970 09:30
>         db=0x7fb4880059a8 db_batch=(nil) batch_started=0
>         wstore=0x7fb48803fc18 rstore=(nil) wjcr=(nil) client=0x7fb4880481a8 
> reschedule_count=0 SD_msg_chan_started=1
> BDB=0x7fb4880059a8 db_name=bacula db_user=bacula connected=true
>         cmd="UPDATE Media SET InChanger=0, Slot=0 WHERE Slot=25 AND StorageId 
> IN (10) AND MediaId!=794" changes=1814
>         RWLOCK=0x7fb4880059c0 w_active=0 w_wait=0
> threadid=0x7fb43f7fe700 JobId=104928 JobStatus=R jcr=0x7fb44805fa88 
> name=job3.2020-02-14_15.47.06_47
>         use_count=2 killable=1
>         JobType=B JobLevel=F
>         sched_time=14-Feb-2020 15:46 start_time=14-Feb-2020 15:47
>         end_time=01-Jan-1970 09:30 wait_time=27-Feb-2020 22:21
>         db=0x7fb4880059a8 db_batch=(nil) batch_started=0
>         wstore=0x7fb448034678 rstore=(nil) wjcr=(nil) client=0x7fb44803c148 
> reschedule_count=0 SD_msg_chan_started=1
> BDB=0x7fb4880059a8 db_name=bacula db_user=bacula connected=true
>         cmd="UPDATE Media SET InChanger=0, Slot=0 WHERE Slot=25 AND StorageId 
> IN (10) AND MediaId!=794" changes=1814
>         RWLOCK=0x7fb4880059c0 w_active=0 w_wait=0
> threadid=0x7fb43e7fc700 JobId=105616 JobStatus=R jcr=0x55980a005fe8 
> name=job4.2020-02-21_21.30.01_16
>         use_count=2 killable=1
>         JobType=B JobLevel=F
>         sched_time=21-Feb-2020 21:30 start_time=24-Feb-2020 23:36
>         end_time=01-Jan-1970 09:30 wait_time=01-Jan-1970 09:30
>         db=0x7fb4880059a8 db_batch=(nil) batch_started=0
>         wstore=0x7fb448033b78 rstore=(nil) wjcr=(nil) client=0x7fb44803e9e8 
> reschedule_count=0 SD_msg_chan_started=1
> BDB=0x7fb4880059a8 db_name=bacula db_user=bacula connected=true
>         cmd="UPDATE Media SET InChanger=0, Slot=0 WHERE Slot=25 AND StorageId 
> IN (10) AND MediaId!=794" changes=1814
>         RWLOCK=0x7fb4880059c0 w_active=0 w_wait=0
> threadid=0x7fb43effd700 JobId=0 JobStatus=C jcr=0x7fb47800b2e8 
> name=-Console-.2020-02-27_08.39.19_09
>         use_count=1 killable=0
>         JobType=U JobLevel=F
>         sched_time=27-Feb-2020 08:39 start_time=27-Feb-2020 08:39
>         end_time=01-Jan-1970 09:30 wait_time=01-Jan-1970 09:30
>         db=0x7fb4880059a8 db_batch=(nil) batch_started=0
>         wstore=0x7fb448035cd8 rstore=0x7fb448034c18 wjcr=(nil) 
> client=0x7fb44803ae18 reschedule_count=0 SD_msg_chan_started=0
> BDB=0x7fb4880059a8 db_name=bacula db_user=bacula connected=true
>         cmd="UPDATE Media SET InChanger=0, Slot=0 WHERE Slot=25 AND StorageId 
> IN (10) AND MediaId!=794" changes=1814
>         RWLOCK=0x7fb4880059c0 w_active=0 w_wait=0
> threadid=0x7fb45f7fe700 JobId=0 JobStatus=C jcr=0x7fb3f400e148 
> name=-Console-.2020-02-28_09.00.13_35
>         use_count=1 killable=0
>         JobType=U JobLevel=F
>         sched_time=28-Feb-2020 09:00 start_time=28-Feb-2020 09:00
>         end_time=01-Jan-1970 09:30 wait_time=01-Jan-1970 09:30
>         db=(nil) db_batch=(nil) batch_started=0
>         wstore=0x7fb448035cd8 rstore=0x7fb448034c18 wjcr=(nil) 
> client=0x7fb44803ae18 reschedule_count=0 SD_msg_chan_started=0
> List plugins. Hook count=0
> 
> 
> 
> _______________________________________________
> Bacula-users mailing list
> Bacula-users@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/bacula-users
> 


_______________________________________________
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users

Reply via email to