I missed the point about the CI in my other reply. If we can somehow integrate some container based tests into the “make test” infra, I wouldn’t mind at all! :-)
Regards, Florin > On Jul 22, 2020, at 4:17 AM, Ivan Shvedunov <ivan...@gmail.com> wrote: > > Hi, > sadly the patch apparently didn't work. It should have worked but for some > reason it didn't ... > > On the bright side, I've made a test case [1] using fresh upstream VPP code > with no UPF that reproduces the issues I mentioned, including both timer and > TCP retransmit one along with some other possible problems using http_static > plugin and the proxy example, along with nginx (with proxy) and wrk. > > It is docker-based, but the main scripts (start.sh and test.sh) can be used > without Docker, too. > I've used our own Dockerfiles to build the images, but I'm not sure if that > makes any difference. > I've added some log files resulting from the runs that crashed in different > places. For me, the tests crash on each run, but in different places. > > The TCP retransmit problem happens with http_static when using the debug > build. When using release build, some unrelated crash in timer_remove() > happens instead. > The SVM FIFO crash happens when using the proxy. It can happen with both > release and debug builds. > > Please see the repo [1] for details and crash logs. > > [1] https://github.com/ivan4th/vpp-tcp-test > <https://github.com/ivan4th/vpp-tcp-test> > > P.S. As the tests do expose some problems with VPP host stack and some of the > VPP plugins/examples, maybe we should consider adding them to the VPP CI, too? > > On Thu, Jul 16, 2020 at 8:33 PM Florin Coras <fcoras.li...@gmail.com > <mailto:fcoras.li...@gmail.com>> wrote: > Hi Ivan, > > Thanks for the detailed report! > > I assume this is a situation where most of the connections time out and the > rate limiting we apply on the pending timer queue delays handling for long > enough to be in a situation like the one you described. Here’s a draft patch > that starts tracking pending timers [1]. Let me know if it solves the first > problem. > > Regarding the second, it looks like the first chunk in the fifo is not > properly initialized/corrupted. It’s hard to tell what leads to that given > that I haven’t seen this sort of issues even with larger number of > connections. You could maybe try calling svm_fifo_is_sane() in the > enqueue/dequeue functions, or after the proxy allocates/shares the fifos to > catch the issue as early as possible. > > Regards, > Florin > > [1] https://gerrit.fd.io/r/c/vpp/+/27952 > <https://gerrit.fd.io/r/c/vpp/+/27952> > >> On Jul 16, 2020, at 2:03 AM, ivan...@gmail.com <mailto:ivan...@gmail.com> >> wrote: >> >> Hi, >> I'm working on the Travelping UPF project >> https://github.com/travelping/vpp <https://github.com/travelping/vpp>For >> variety of reasons, it's presently maintained as a fork of UPF that's >> rebased on top of upstream master from time to time, but really it's just a >> plugin. During 40K TCP connection test with netem, I found an issue with TCP >> timer race (timers firing after tcp_timer_reset() was called for them) which >> I tried to work around only to stumble into another crash, which I'm >> presently debugging (an SVM FIFO bug, possibly) but maybe some of you folks >> have some ideas what it could be. >> I've described my findings in this JIRA ticket: >> https://jira.fd.io/browse/VPP-1923 <https://jira.fd.io/browse/VPP-1923> >> Although the last upstream commit UPF is presently based on >> (afc233aa93c3f23b30b756cb4ae2967f968bbbb1) was some time ago, I believe the >> problems are still relevant as there were no changes in these parts of code >> in master since that commit. > > > > -- > Ivan Shvedunov <ivan...@gmail.com <mailto:ivan...@gmail.com>> > ;; My GPG fingerprint is: 2E61 0748 8E12 BB1A 5AB9 F7D0 613E C0F8 0BC5 2807
-=-=-=-=-=-=-=-=-=-=-=- Links: You receive all messages sent to this group. View/Reply Online (#17044): https://lists.fd.io/g/vpp-dev/message/17044 Mute This Topic: https://lists.fd.io/mt/75537746/21656 Group Owner: vpp-dev+ow...@lists.fd.io Unsubscribe: https://lists.fd.io/g/vpp-dev/unsub [arch...@mail-archive.com] -=-=-=-=-=-=-=-=-=-=-=-