I'm still a bit uncertain about the way that I've set up my Linux box and
the switches correctly in my quest for Ethernet channel bonding.
Goal: to bond eth0 and eth1 on each blade and thus attain close to 2Gbps
transmit and receive. i.e. desire Load balancing / bandwidth
aggregation. Do *not* care at all about fault-tolerance.
Equipment: 1 server (3 eth ports), 23 blades, 2 Dell 6248 switches. Each
switch has 48 Gbit ports.
I setup bond0 on each blade. Used mode=6. Adaptive load balancing. From
what I read this seems most suitable (correct me if I am wrong please!)
since supports both transmit and receive side balancing.
Now comes the confusing parts:
1. Do I need Ling Aggregation Groups (LAGs) on the switch or not for my
switch-to-port connections? I receive multiple conflicting views on this
online.
http://www.linuxfoundation.org/en/Net:Bonding says "does not
require any special switch support...does not require any special switch
support..." So do many other tutorials that do not mention anything about
any switch side configs being required at all!
Others say it still needs LAGs. My "common-sense" says I ought to tell
the switch that two of my ports are going to the same blade somehow.
2. Will the switch see two MAC ids or just a single one for the bond0
device if I examined its address tables? For alb it ought to be both
right? But I tried examining the ARP tables on the server and there for
each blade IP only the bond0 MAC is listed. Is that a sign something is
wrong or just my misinformed-ignorant paranoia! I read the specs on all
the 6 bonding algorithms (some load balancing and others for fault
tolerance) and see that some seem to transmit both MAC ids and other just
a single one? True?
3. How about the switch-to-switch connections? If I want to connect 8 eth
cables switch-to-switch (to aggregate bandwidth again) do I need a LAG
here or not? (8 is the magic number because thats the max number of ports
my switch will allow me to aggregate).
4. Each LAG group has a LACP option. Enable or not? The core Linux specs.
seem to have no mention of LACP; only company specific info seems to
exist! (Cisco, Dell etc.) I guess LACP is related to IEEE 802.3ad? Is
that only a workaround to prevent having to manually aggregate ports into
LAG groups? Or does it have an advantage as a load-balancing protocol to
my chose "Adaptive load balancing"
I guess it boils down to two questions: (1) To LAG or not-to-LAG (same
for LACP) (2) Is my mode=6 (Adaptive load balancing) the appropriate
mode?
--
Rahul