Re: Linux BNG

Raymond Burkholder Sat, 14 Jul 2018 10:10:34 -0700

interspersed comments ....

On 07/14/2018 06:13 AM, Baldur Norddahl wrote:

I am investigating Linux as a BNG. The BNG (Broadband Network Gateway)being the thing that acts as default gateway for our customers.
The setup is one VLAN per customer. Because 4095 VLANs is not enough, wehave QinQ with double VLAN tagging on the customers. The customers canuse DHCP or static configuration. DHCP packets need to be option82tagged and forwarded to a DHCP server. Every customer has one or morestatic IP addresses.

Where do you have this happening? Do you have aggregation switchesdoing this? Are those already in place, or being planned? Because Iwould make a suggestion for how to do the aggregation.

IPv4 subnets need to be shared among multiple customers to conserveaddress space. We are currently using /26 IPv4 subnets with 60 customerssharing the same default gateway and netmask. In Linux terms this means60 VLAN interfaces per bridge interface.

I suppose it could be made to work, but forcing a layer 3 boundary overa bunch of layer 2 boundaries, seems to be a bunch of work, but Isuppose that would be the brute force and ignorance approach from themechanisms you would be using.

However Linux is not quite ready for the task. The primary problem beingthat the system does not scale to thousands of VLAN interfaces.

It probably depends upon which Linux based tooling you wish to use.There are some different ways of looking at this which scale better.

We do not want customers to be able to send non routed packets directlyto each other (needs proxy arp). Also customers should not be able tosteal another customers IP address. We want to hard code the relationbetween IP address and VLAN tagging. This can be implemented usingebtables, but we are unsure that it could scale to thousands of customers.

I would consider suggesting the concepts of VxLAN (kernel plus FRRand/or openvswitch) or OpenFlow.(kernel plus openvswitch)

VxLAN scales to 16 million vlan equivalents. Which is why I ask aboutyour aggregation layers. Rather than trying to do all the addressingacross all the QinQ vlans in the core boxes, the vlans/vxlans andaddressing are best dealt with at the edge. Then, rather than running abunch of vlans through your aggregation/distribution links, you can keepthose resilient with a layer 3 only based strategy.

I am considering writing a small program or kernel module. This wouldcreate two TAP devices (tap0 and tap1). Traffic received on tap0 withVLAN tagging, will be stripped of VLAN tagging and delivered on tap1.Traffic received on tap1 without VLAN tagging, will be tagged accordingto a lookup table using the destination IP address and then delivered ontap0. ARP and DHCP would need some special handling.

I don't think this would be needed. I think all the tools are alreadyavailable and are robust from daily use. Free Range Routing withEVPN/(VxLAN|MPLS) for a traditional routing mix, or use OpenFlow toolingin Open vSwitch to handle the layer 2 and layer 3 rule definitions youhave in mind.

Open vSwitch can be programmed via command line rules or can be hookedup to a controller of some sort. So rather than writing your own kernelprogram, you would write rules for a controller or script which drivesthe already kernel resident engines.

This would be completely stateless for the IPv4 implementation. The IPv6implementation would be harder, because Link Local addressing needs tobe supported and that can not be stateless. The customer CPE will makeup its own Link Local address based on its MAC address and we do notknow what that is in advance.

FRR and OVS are IPv4 and IPv6 aware. The dynamics of the CPE MAC wouldbe handled in various ways, depending upon what tooling you decide upon.

The goal is to support traffic of minimum of 10 Gbit/s per server.Ideally I would have a server with 4x 10 Gbit/s interfaces combined intotwo 20 Gbit/s channels using bonding (LACP). One channel each forupstream and downstream (customer facing). The upstream would be layer 3untagged and routed traffic to our transit routers.

As mentioned earlier, why make the core boxes do all of the work? Whynot distribute the functionality out to the edge? Rather than usingtraditional switch gear at the edge, use smaller Linux boxes to handleall that complicated edge manipulation, and then keep your highbandwidth core boxes pushing packets only.

I am looking for comments, ideas or alternatives. Right now I amconsidering what kind of CPU would be best for this. Unless I take stepsto mitigate, the workload would probably go to one CPU core only and belimited to things like CPU cache and PCI bus bandwidth.

There is much more to write about, but those writings would depend up onwhat you already have in place, what you would like to put in place, andhow you wish to segment your network.


Hope this helps.

Baldur


--
Raymond Burkholder
r...@oneunified.net
https://blog.raymond.burkholder.net

Re: Linux BNG

Reply via email to