At the first, Numan, Dumitru, Han, I really appreciate your replies, will do my 
best to describe the case in details

NS> Does your deployment create/delete logical switches/routers frequently

It is a separate service, i.e.: Open Stack Nova that is creating a Port every 
time when user creates new VM. It does not happen too frequently, yet, if it 
happens, we have a huge transaction flow at NorthD then this transaction going 
from SBDB to NBDB could be delayed, i.e. this means not every transaction takes 
52 secs in delay, I think it is 25%, that is considerable, of cause.

HZ> Yes, it is better to understand why in this deployment the recompute took 
so long (52s). Is it simply too large scale, or is it because of some uncommon 
configuration that we don't handle efficiently and can be optimized to improve 
recompute performance

A Very good question.
Yes, there is a peculiarity with our custom installation, i.e.: we have 
implemented L3 connectivity to External Provider Network, this customization 
was done at ovn-controller level. That was a biz demand and it is working fine, 
but as a drawback we have got a problem with huge Logical_DP_Groups I described 
earlier: Link1

DC> But the best way to tell is to have a way to reproduce this, e.g., NB/SB
databases and the NB/SB jsonrpc update that caused the recompute

Yes, we are working on this and at this moment we have an ovn-heater 
installation, that is using Vanilla (non-custom) OVN installation, simulating 
the case we have at prod
Could you please tell me the bast way in submitting the dumps for both 
(NB|SB)DBs
I can do an Issue at ovn github: https://github.com/ovn-org/ovn/issues

Thank you
Yours truly, George


From: Numan Siddique <num...@ovn.org>
Date: Wednesday, 8 May 2024, 19:01
To: Шагов Георгий <gmsha...@cloud.ru>, Dumitru Ceara <dce...@redhat.com>, Han 
Zhou <hz...@ovn.org>
Cc: "ovs-discuss@openvswitch.org" <ovs-discuss@openvswitch.org>
Subject: Re: [ovs-discuss] NorthD inc-engine Handlers; OVN 24.03

ВНИМАНИЕ! ВНЕШНИЙ ОТПРАВИТЕЛЬ
Если отправитель почты неизвестен, не переходите по ссылкам, не сообщайте 
пароль,
не запускайте вложения и сообщите коллегам из ЦКЗ на 
secur...@cloud.ru<mailto:secur...@cloud.ru>


On Wed, May 8, 2024 at 8:42 AM Шагов Георгий via discuss 
<ovs-discuss@openvswitch.org<mailto:ovs-discuss@openvswitch.org>> wrote:
Hello everyone

In some aspect it might be considered as a continuation of this thread: 
(link1), yet it is different
After we have upgrade from OVN 22.03 to OVN 24.03, we have indeed found 
increase in performance in 3-4 times
And yet still we do observe high CPU load for NorthD process; taking deeper 
into the logs we have found:


Thanks for reporting this issue.


2024-05-07T08:36:46.505Z|18503|poll_loop|INFO|wakeup due to [POLLIN] on fd 15 
(10.34.22.66:60716<http://10.34.22.66:60716><->10.34.22.66:6642<http://10.34.22.66:6642>)
 at lib/stream-fd.c:157 (94% CPU usage)
2024-05-07T08:37:38.857Z|18504|inc_proc_eng|INFO|node: northd, recompute 
(missing handler for input SB_datapath_binding) took 52313ms
2024-05-07T08:37:48.335Z|18505|inc_proc_eng|INFO|node: lflow, recompute (failed 
handler for input northd) took 7759ms
2024-05-07T08:37:48.718Z|18506|timeval|WARN|Unreasonably long 62213ms poll 
interval (56201ms user, 2900ms system)

As you can see there is a significant delay in 52 secs
Correct me please, if I am in the wrong, but IMU: ‘missing handler for’ – 
practically means absence of the inc-engine handler from some node (in this 
sample: SB_datapath_binding)

That's correct.

Before plunging into Development it would be great to clarify/adjust with 
Community’s position

  *   Why there is not handler for this node?

Our approach has been to add a handler  for any input change only if it is 
frequent or if it can be easily handled.
We also have skipped adding handlers if it increases the code complexity.  
Having said that I think we are open
to adding more handlers if it makes sense or if it results in scale 
improvements.

Right now we fall back to a full recompute of northd engine for any changes to 
a logical switch or logical router.
Does your deployment create/delete logical switches/routers frequently ?  Is it 
possible to enable ovn debug logs
and share them ?  I'm curious to know what are the changes to SB datapath 
binding.

Feel free to share your OVN NB and SB DBs if you're ok with it.  I can deploy 
those DBs and see why recompute is so expensive.



  *   Any particular reason for this or just the peculiarity of our 
installation highlighted this issue?

My guess is that your installation is frequently creating , deleting or 
modifying logical switches or routers.


  *
  *   Do you think there is a reason in implementing that handler? 
(SB_datapath_binding)

I'm fine adding a handler if it helps in the scale.   In our use cases, we 
don't frequently create/delete the logical switches and routers
and hence it is ok to fall back to full recomputes for such changes.


  *

Any ideas are highly appreciated.

You're welcome to work on it and submit patches to add a handler for 
SB_datapath_binding.

@Dumitru Ceara<mailto:dce...@redhat.com> @Han Zhou<mailto:hz...@ovn.org> if 
you've any reservations on adding more handlers please do comment here.

Thanks
Numan




Yours truly, George



  *   Link1: 
https://mail.openvswitch.org/pipermail/ovs-discuss/2024-March/053035.html
УВЕДОМЛЕНИЕ О КОНФИДЕНЦИАЛЬНОСТИ: Это электронное сообщение и любые документы, 
приложенные к нему, содержат конфиденциальную информацию. Настоящим уведомляем 
Вас о том, что если это сообщение не предназначено Вам, использование, 
копирование, распространение информации, содержащейся в настоящем сообщении, а 
также осуществление любых действий на основе этой информации, строго запрещено. 
Если Вы получили это сообщение по ошибке, пожалуйста, сообщите об этом 
отправителю по электронной почте и удалите это сообщение.
CONFIDENTIALITY NOTICE: This email and any files attached to it are 
confidential. If you are not the intended recipient you are notified that 
using, copying, distributing or taking any action in reliance on the contents 
of this information is strictly prohibited. If you have received this email in 
error please notify the sender and delete this email.

_______________________________________________
discuss mailing list
disc...@openvswitch.org<mailto:disc...@openvswitch.org>
https://mail.openvswitch.org/mailman/listinfo/ovs-discuss
УВЕДОМЛЕНИЕ О КОНФИДЕНЦИАЛЬНОСТИ: Это электронное сообщение и любые документы, 
приложенные к нему, содержат конфиденциальную информацию. Настоящим уведомляем 
Вас о том, что если это сообщение не предназначено Вам, использование, 
копирование, распространение информации, содержащейся в настоящем сообщении, а 
также осуществление любых действий на основе этой информации, строго запрещено. 
Если Вы получили это сообщение по ошибке, пожалуйста, сообщите об этом 
отправителю по электронной почте и удалите это сообщение.
CONFIDENTIALITY NOTICE: This email and any files attached to it are 
confidential. If you are not the intended recipient you are notified that 
using, copying, distributing or taking any action in reliance on the contents 
of this information is strictly prohibited. If you have received this email in 
error please notify the sender and delete this email.
_______________________________________________
discuss mailing list
disc...@openvswitch.org
https://mail.openvswitch.org/mailman/listinfo/ovs-discuss

Reply via email to