Thursday, December 20, 2018

Multicast drops caused by unicast microburst

Hello /r/networking

I'm currently arguing a bit with a netadmin about a packet drop issue and I need some sanity check and help from you guys.

We have around 1Gbps of multicast video traffic (constant traffic) coming into a pair of Cisco Nexus 5548P and egressing on a 10G port (it should grow to 2.5G in the future)

We also have an average of 500Mbps of HTTP traffic coming from multiple servers into the same pair of Cisco Nexus 5548P and egressing on the same 10G port, but in a not so distant future the HTTP traffic could fill up the 10G at peak time If we block this HTTP traffic, multicast is fine, and when the HTTP traffic is present we experience some multicast thus impacting the video.

The theory is that the issue is caused by microburst, and this is a good explanation for me, but I'm convinced this can and should be fixed on the switch side, he is convinced everything was tried so the problem need to be solved server side, and the final client just want everything to work (3 companies)

Reading some Cisco Qos documentation, for example https://www.cisco.com/c/en/us/td/docs/switches/datacenter/nexus5600/sw/qos/7x/b_5600_QoS_Config_7x/b_6k_QoS_Config_7x_chapter_01001.pdf my understanding is that disabling "Priority flow control", enabling "Link-Level Flow Control", configuring "pause no-drop" on a Qos class map should do the trick

Link-Level Flow Control was enabled on friday (it was already enabled on the servers NICs) but i'm not receiving any pause frame ("ethtool -S ethX | grep flow_control" gives only 0)

My questions are: - do you think we can make "Link-Level Flow Control" work and fix this issue ? (applying no-drop policy to either the multicast or the HTTP or both) - any other knob to ensure that we never drop multicast ?

Thanks



No comments:

Post a Comment