On Sun, Sep 1, 2024 at 10:59 PM Indivar Nair <[email protected]> wrote: > > Hello All, > > I have a 2-node cluster on which I am trying to load the > drdb_transport_rdma.ko modules. > > The nodes have - > - Rocky Linux 9.1 (Kernel 5.14.0-162.23.1) > - NVIDIA/Mellanox ConnectX-5 EN 100GB NIC > - MLNX_OFED_LINUX-23.10-3.2.2.0-rhel9.1-x86_64 drivers > - DRBD 9.2.3 (compiled on the same machine) > > I have connected the 100G Ethernet (RoCE) ports back-to-back with a > short DAC cable. > Tests with perftest tools (ib_send_bw and ib_read_bw) show proper > connectivity. RoCE is working properly. > > But, I get the following error when I try to load the > drdb_transport_rdma.ko module > --------------------------------------------------------------------------------------- > drbd_transport_rdma: disagrees about version of symbol __ib_alloc_pd > drbd_transport_rdma: Unknown symbol __ib_alloc_pd (err -22) > drbd_transport_rdma: disagrees about version of symbol rdma_resolve_addr (snip) > --------------------------------------------------------------------------------------- > What could be the issue? > Thanks > > Regards, > Indivar Nair
Looks like the kernel modules you built do not match the running kernel. Rocky Linux 9.1 is obsolete and it has many security vulnerabilities. Can you update it to the current 9.4? If you can, then I suggest you use ELRepo's kmod-drbd9x package. It is currently at version 9.2.11 and is available from the elrepo-testing repository. If for some reason you cannot update the OS, make sure you build your modules against the kernel in use. Akemi
