Couple questions.
Should you always completely separate the storage network switches from production switches or are VLANs fine to segment this traffic? Is there a golden rule here?
How do you properly size a switch for your environment based on the specifications the manufacturer provide (Throughput, Forwarding Throughput, Stacking Throughput, Max Mac)?
If you have two switch options and one has a maximum Mac address of 8,000 vs. another with 16,0000. What does this really mean to me? How do make sure one vs. another is sized properly for me?
Besides VLAN and Jumbo Frame support, is there any other "Must" haves for a virtual environments production or storage networks?
There is a wealth of knowledge on sizing SANs and such, but this seems equally important and it's quite challenging to find as much information.
--
Just to add some tidbits of information for the environment. This setup above is referring to the data centers which supports two different locations which have about 100 users between the two in total. The storage traffic will be iSCSI and will be 3 ESXi Hosts and one SAN housing about 2.7TB of data.
Since there is currently no storage network in place (no SAN), I'm having a hard time regarding #2 to really determine what backplane throughput and switch specifications will be sufficient.
There is no golden rule here. Physical switch segregation happens when the storage network demands more features/reliability than the data-network environment can provide. But if the data network stack is robust enough, a VLAN is all that's needed to segregate.
If you're dropping in a new storage network, you go with physical separation for two major reasons:
You determine the requirements of your network, and match those to the specifications. Kinda broad, but that's what you do. Things to pay attention to in your modeling:
How big are your L2 networks? You need one MAC address for each network interface (physical or virtual) on a network, and the switch needs to track all of those so it knows how to forward packets through its switching infrastructure. An 8K MAC table means that it can accommodate L2 networks with up to about 8K clients on them. That's all.
Storage networking protocols are less tolerant of out-of-order arrival than other TCP protocols, so engineering to prevent such is much more important than it is for general data networks. There is no bullet-point on the feature list to look for, though.
Storage stacks are very sensitive to latency, so minimize that. If you're in a large datacenter with a high port-density, you'll probably want to go with a chassis+blade style switch over stackables as they tend to be lower latency between peers (sub-milisecond latencies do add up).
Large virtualization hosts generally can take advantage of trunking (LACP, 802.11ad, EtherChannel), which allows higher per-host bandwidth utilization and multiple VLAN support inside the VM environment.