Hi Robin,
Apologies for the delayed response
On 19/03/2024 20:38, Robin Jarry wrote:
Hi Vladimir,
Medvedkin, Vladimir, Mar 19, 2024 at 18:16:
> 2) Is it OK/safe to modify a fib from a control thread (read/write)
> while it is used by data path threads (read only)?
This part is a bit more complicated. In practice, I would say yes,
however, there is a possibility that if the lookup thread is
preempted in the middle of the lookup process, and at the same time
the control thread deletes the corresponding route, then the lookup
result may return outdated data. This problem is solved in LPM with
RCU enabled. I have plans to implement it in the near future in the FIB.
OK that's good to know, thanks.
> 3) There is no public API to list/walk all configured routes in
> a fib. Would that be possible/easy to implement?
Yes, it already there. FIB under the hood uses rte_rib to hold
existing routes. So walking through can be implemented like:
I had tried it and got confusing results out of this. This must have
been before I had realized that all addresses needed to be in host
order...
I tried again and it works as advertised with a small missing detail:
after configuring a default route, e.g.:
rte_fib_add(fib, RTE_IPV4(2, 2, 0, 0), 16, RTE_IPV4(1, 2, 3, 4));
rte_fib_add(fib, RTE_IPV4(3, 3, 3, 0), 24, RTE_IPV4(4, 3, 2, 1));
rte_fib_add(fib, RTE_IPV4(0, 0, 0, 0), 0, RTE_IPV4(9, 9, 9, 9));
It is not returned by rte_rib_get_nxt() successive calls. I only see
the other two routes:
2.2.0.0/16 via 1.2.3.4
3.3.3.0/24 via 4.3.2.1
Is this expected?
Yes, it is expected. It is also reflected in API: "Retrieve next more
specific prefix ...". So, in your case you should explicitly lookup 0/0
route.
IfindthismoreconvenientfordataplanestructureslikeDIR24-8,whereIneedto
findgaps forsomegivensuperprefix.
> 4) In rte_fib, every IPv4 address (route *and* next hop) needs to
be > in host order. This is not consistent with fib6 where
addresses > are stored in network order. It took me quite a while
to figure > out what was wrong with my code.
This API behavior was created in such a way that it is the same as LPM.
As for LPM, I think it was done this way for performance reasons
because in some scenarios you only working with the host order ipv4
addresses.
This should really be advertised in strong capital letters in the API
docs. Or (preferably) hidden to the user. I don't see any valid
scenario where you would work with host order IPv4 addresses.
I just implemented lookup the same way as LPM. As for valid scenario,
years ago I used an LPM/FIB lookup on a huge text log file(it was nginx
logs if I remember correctly) with hundreds of million lines with IP
addresses to resolve corresponding AS numbers for some statistics. The
macro I used converted substrings with IPv4 into unsigned integers in
host byte order. So, it is not always true that IPv4 are in network byte
order.
Do you think we could change that API or at least add a flag at
FIB/RIB creation to make it transparent to the user and consistent
between IPv4 and IPv6?
Yes, I will add FIB configuration option to allow BE IPv4 as an input
for lookup function.
Thanks!
--
Regards,
Vladimir