Hi Kuba. I'm looking into cls_bpf code and nfp_net_bpf_offload function in your driver. Why do you need TC_CLSBPF_ADD? Seems like TC_CLSBPF_REPLACE should be enough. It would make the cls_bpf code easier.
Note that other cls just have replace/destroy (u32 too, as drivers handle NEW/REPLACE in one switch-case - will patch this). Thanks! Jiri