Re: Output packet processing (was stretch ACKs, etc.)

Mark Butler Sat, 25 Mar 2006 14:32:55 -0800

David S. Miller wrote:

From: Mark Butler <[EMAIL PROTECTED]>
Date: Fri, 24 Mar 2006 22:37:26 -0700
On a more general note, I find the idea that a current dst entry doesn'tactually reflect the interface (even a logical interface) and nexthopthat will be used to deliver a packet a little disturbing. It wouldseem to me that any filter that is going to re-route a packet to adifferent address or a different interface should be a logical device(with its own IP address) or logical interface, respectively.Otherwise what is going on is completely invisible to the transportprotocol, as well as users of tools like traceroute.
Welcome to firewalls and NAT.

A true firewall should never need to do anything but drop packets andreset connections. Changes to the way packets are routed should be doneat the routing layer, using the flow information from the transportlayer. Simple firewall rules should be implemented the same way. Bythe time a dst entry is returned, the need for NF output chainprocessing should be minimal to non-existent.Serialized processing of every IP packet, whether it needs it or not isridiculously inefficient. No high capacity router would operate thatway. A route decision for a flow would be made once, and data in mostflows would use a fast (generally hardware) path without furtherconsideration.

Of course NAT processing only needs to be done on the NF forward chain,not the input or output chains. No need to affect local transportprotocols at all. The need for any kind of NF processing should bereflected in the routing tables, and echoed in the dst entry (or dstentry stack).There has been discussion of Van Jacobson style optimization of theinput chain. Well the quickest way to optimize the output chain would beto return filtered routing information to the transport layer so that atransport protocol could run its own output processing. For example,why should IPSEC encryption be delayed to the moment of transmission?Why should a re-transmitted packet be re-encrypted? Performance wouldbe improved significantly if a transport protocol could arrange forIPSEC transformations to be done in advance, so that when a congestionwindow opening ACK arrived, data could be transmitted without furtherdelay. Same deal for retransmissions. IPSEC encryption would thengenerally occur in the process context of the sender, rather thansoftirq context at the last possible moment.

Same thing for Neighbor discovery delays and IP fragmentation. Insteadof holding a packet somewhere in the IP layer waiting for an ARP reply,the transport driver should just get an appropriate notification. Thenit could (for example) bundle additional data into the same packet inthe meantime.

Transports could easily hold IP fragments for further processing aswell. Some of them (notably DCCP) can profitably make use of IPdatagrams with missing segments. Other transports could use theinformation to make better determinations about congestion and packetloss. In any case IP segmentation and reassembly at the transport layerwould be more efficient and would be a straight forward extension ofwhat is already present for anything more sophisticated than UDP.

You don't know anything until the packet is examined by the filter,
because it's impossible to know what rule would be matched until the
packet is actually built, since the rule matching is on packet
contents (such as the source and destination IP addresses, and source
and destination ports, but more obscure mathing is also possible, like
matching by TOS or other IP header flags).

The flowi structure already contains all that information for routingpurposes. No reason why it could not be used to do early netfilterreduction as well. Right?


- Mark B.


-
To unsubscribe from this list: send the line "unsubscribe netdev" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Re: Output packet processing (was stretch ACKs, etc.)

Reply via email to