Get Flat
This is where flat networks come in. They can lower latency, provide access to more bandwidth, and increase the return on investment of a data center's network infrastructure.
There are several ways to design a flat network or fabric. You can use TRILL, SPB, or several proprietary approaches from infrastructure vendors. I separate these approaches into two groups.
Group one, including TRILL (from the IETF), SPB (from the IEEE), Cisco's FabricPath, and Brocade's Virtual Chassis Switching (VCS), aims to make LAN switches smarter. At a high level, these technologies all address the learning and distribution of forwarding paths using a link state routing protocol (IS-IS, which plots the best path through the network) and the forwarding of traffic from Host A to Host B across multiple paths. However, they differ significantly in their execution.
TRILL adds a new frame type (a TRILL frame) to normal Ethernet frames and forwards them between ingress and egress RBridges. Think of RBridges as ZIP codes. The final street address, say, the destination MAC of Host B, doesn't matter until you are in the right ZIP code (egress RBridge). The ingress RBridge maintains a mapping of final MAC addresses to ZIP codes (RBridges) and then all intermediate switches move traffic from one ZIP code to the other. This additional encapsulation header also includes a TTL field to help stop flooding produced by physical loops.
Unlike TRILL, SPB doesn't encapsulate traffic in a new frame type. Instead, it uses either Q-in-Q or MAC-in-MAC encapsulation, which is available on many switch models. Once it learns its mapping of destination address-to-egress switch through IS-IS, it encapsulates the original frame in a new Q-in-Q or MAC-in-MAC frame and readdresses the new frame to the egress switch.
SPB and TRILL both support multiple paths, faster failover than Spanning Tree, and increased reachability. SPB proponents point to its support of legacy ASICs because no new frame format has to be added to the frame. In other words, you may only need a software upgrade to support SPB, while TRILL will likely require new hardware to handle the TRILL frame format.
Note that while TRILL and SPB solve some of the limitations that are introduced by STP, these technologies also have potential downsides. For example, their topologies may require significant security configuration to ensure that switches are communicating only with other authorized switches.
Cisco FabricPath and Brocade VCS are similar to TRILL in that they encapsulate traffic for transport across a fabric backbone. However, these are proprietary technologies that don't interoperate with TRILL. In fact, they both work differently with existing Spanning Tree networks. VCS either passes your Spanning Tree info through its fabric or drops it entirely. FabricPath terminates Spanning Tree domains at the edge of its fabric.
The second group of technologies tries to eliminate Spanning Tree's inefficiencies by eliminating the tiered design altogether. This approach treats all switches as one giant switch or fabric. Juniper's QFabric is an example. Essentially, the physical switches act as blades within a giant chassis. In Juniper's model, the QF/Director handles the control plane, coordinating the individual switches. The entire fabric not only looks like one giant switch to connected hosts, as with TRILL or SPB, but is also managed as one giant switch.
Another alternative is Multi-Chassis Link Aggregation, variations of which are offered by most major switch vendors. MLAG switches act as a single switch for downstream STP bridges to eliminate Spanning Tree blocking of redundant paths. This allows for better utilization of links and simplifies management. Most MLAG implementations are limited to two aggregated switches.
A flat LAN architecture holds a lot of promise, but we don't recommend blind adoption. If your engineers don't have time to set up a Spanning Tree root bridge priority in order for the Spanning Tree algorithm to choose the best possible tree, can you really expect that TRILL or SPB or any other more complicated setup will be tuned properly?
The LAN stability provided by Spanning Tree shouldn't be taken for granted; you must clearly understand the potential ramifications of new flattening technologies. There's a reason tiered network architectures are so prevalent: They work. If you decide to go flat, apply these new approaches judiciously.
Jeremy Littlejohn is president of consulting firm RISC Networks. Write to us at [email protected].