Hi Olivier, I did some tests and it appears that the system crashes after I enable more than one neighbour with big amount of prefixes. I have enabled the dump but it seems that the crash freezes the system before it dumps. Now after I restart there is nothing in /data/crash (I use the same setup for dumps as it is described on bsdrp.net). I tried without enabling the dump to a device and still the crash freezes the system.
Any ideas? Regards, Lyubo On Thu, 11 Apr 2019 at 12:37, Lyubomir Yotov <l.yo...@gmail.com> wrote: > Hi Olivier, > Sorry for the late reply but I was busy these days. > I went into several issues when disabling the sessions: > --- > router.bsdrp.net(config-router)# neighbor x.x.x.1 shutdown > router.bsdrp.net(config-router)# neighbor x.x.x.2 shutdown > router.bsdrp.net(config-router)# neighbor x.x.x.3 shutdown > router.bsdrp.net(config-router)# neighbor x.x.x.4 shutdown > router.bsdrp.net(config-router)# neighbor x.x.x.5 shutdown > router.bsdrp.net(config-router)# > router.bsdrp.net(config-router)# > router.bsdrp.net(config-router)# ^ZWarning: closing connection to zebra > because of an I/O error! > Warning: connecting to zebra...failed! > > user@router/#service frr status > zebra is not running. > ospfd is running as pid 1586. > ospf6d is running as pid 3340. > bgpd is running as pid 6100. > --- > Here are the logs (please disregard the date and time). > - from /var/log/messages: > --- > Aug 16 18:07:25 router kernel: pid 44117 (zebra), jid 0, uid 168: exited > on signal 6 > --- > - from /var/log/bgpd.log > --- > 2008/08/16 18:07:25 ZEBRA: Kernel: message seq 1632715 > 2008/08/16 18:07:25 ZEBRA: Kernel: pid 44117, rtm_addrs 0x7 > 2008/08/16 18:07:25 ZEBRA: rtm_read: got rtm of type 2 (RTM_DELETE) > 2008/08/16 18:07:25 ZEBRA: Kernel: Len: 200 Type: RTM_DELETE > 2008/08/16 18:07:25 ZEBRA: Kernel: GATEWAY DONE PROTO1 > 2008/08/16 18:07:25 ZEBRA: Kernel: message seq 1634039 > 2008/08/16 18:07:25 ZEBRA: Kernel: pid 44117, rtm_addrs 0x7 > 2008/08/16 18:07:25 ZEBRA: rtm_read: got rtm of type 2 (RTM_DELETE) > 2008/08/16 18:07:25 BGP: buffer_write: write error on fd 10: Broken pipe > 2008/08/16 18:07:25 BGP: zclient_send_message: buffer_write failed to > zclient fd 10, closing > --- > I decided to change the date and check again and then I went into > something else. After enabling the sessions and checking for neighbor > statistics I decided to clear one of the sessions that was not starting: > --- > router.bsdrp.net# clear ip bgp x.xSegmentation fault > user has logged on pts/0 from y.y.y.y > --- > I used the "tab" key to complete the ip address in the above command. > In /var/log/messages I get: > --- > Apr 9 10:49:56 router kernel: pid 14177 (zebra), jid 0, uid 168: exited > on signal 6 > Apr 9 10:50:35 router kernel: pid 54325 (vtysh), jid 0, uid 0: exited on > signal 11 > --- > This time there is nothing in /var/log/bgpd.log. > --- > user@router/# service frr statusservice frr status > zebra is not running. > ospfd is running as pid 1586. > ospf6d is running as pid 3340. > bgpd is running as pid 6100. > --- > It could be a coincidence that vtysh has failed together with zebrad > (could be that the session has established at the moment I have pressed the > 'tab' key). > > I will try once again but this time will enabled some more debuging: > --- > router.bsdrp.net# show debugging > Zebra debugging status: > Zebra event debugging is on > Zebra packet debugging is on > Zebra kernel debugging is on > Zebra RIB debugging is on > > OSPF debugging status: > > > OSPF6 debugging status: > > BGP debugging status: > BGP zebra debugging is on > --- > > I will write hopefully tomorrow. > > Regards, > > Lyubo > > On Mon, 8 Apr 2019 at 20:39, Lyubomir Yotov <l.yo...@gmail.com> wrote: > >> Hi Olivier, >> I just couldn't think of anything else (except changing frr with bird). >> You are absolutely right about the panic (as well as for the zebra daemon >> crash). This should not happen, regardles the wrong configuration. >> The system is still working as expected (no traffic going in or out, as >> it is a test system, but zebrad has not crashed so far). Tomorrow I will >> check again with the original image and report back >> . >> Regards, >> >> Lyubomir >> >> On Mon, 8 Apr 2019 at 17:52, Olivier Cochard-Labbé <oliv...@cochard.me> >> wrote: >> >>> On Mon, Apr 8, 2019 at 3:37 PM Lyubomir Yotov <l.yo...@gmail.com> wrote: >>> >>>> I actually found an error in the AS number (last digit was missing). >>>> So far (more than an hour) it seems fine. As it has happened before >>>> that it could crash after several hours I will wait until tomorrow. >>>> If everything is fine I will try again with a fresh install (on another >>>> flash drive) to check if the wrong config was the problem. >>>> Just for the record here is the "show version" from the installed frr6: >>>> router.bsdrp.net#show version >>>> FRRouting 6.0.2 (router.bsdrp.net). >>>> >>>> >>> Hi, >>> Like you've seen, the binary version is stil 6.0.2, because the port >>> revision "2" didn't modified the binary: >>> - frr6.0.2_1 : It was just an update of an RC script, so no change into >>> FRR binaries >>> - frr6.0.2_2 : It was just a typo in RC script's comment >>> So I don't think the resolution came from this upgrade. >>> >>> But in any case, even if the panic came from the wrong AS number: it >>> should have not triggered a panic. >>> >>> Regards, >>> >>> Olivier >>> _______________________________________________ >>> Bsdrp-users mailing list >>> Bsdrp-users@lists.sourceforge.net >>> https://lists.sourceforge.net/lists/listinfo/bsdrp-users >>> >>
_______________________________________________ Bsdrp-users mailing list Bsdrp-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bsdrp-users