Tuesday, September 22, 2020

Excessive flaps on tracking applied on HSRP?

Hi All,

I'm currently facing an issue right now which Excessive flaps on HSRP and track down being detected, from the IPSLA I'm able to see 2-3 failures(1hr interval) only.

Device: ISR4331 v16.6.7

Configuration:

ip sla 1 icmp-echo 192.168.1.1 source-interface GigabitEthernet0/0/0 <--- GATEWAY frequency 5 ip sla schedule 1 life forever start-time now track 1 ip sla 1 reachability interface Port-channel1.100 encapsulation dot1Q 100 ip address 10.1.1.2 255.255.255.224 no ip redirects no ip unreachables no ip proxy-arp standby 1 ip 10.1.1.1 standby 1 priority 115 standby 1 preempt standby 1 track 1 decrement 20 

LOG-1:

<cut> Sep 22 11:51:52.427 GMT: %HSRP-5-STATECHANGE: Port-channel1.100 Grp 2 state Speak -> Standby Sep 22 11:57:30.471 GMT: %TRACK-6-STATE: 1 ip sla 1 reachability Up -> Down Sep 22 11:57:30.727 GMT: %HSRP-5-STATECHANGE: Port-channel1.100 Grp 1 state Active -> Speak Sep 22 11:57:35.472 GMT: %TRACK-6-STATE: 1 ip sla 1 reachability Down -> Up Sep 22 11:57:36.443 GMT: %HSRP-5-STATECHANGE: Port-channel1.100 Grp 1 state Speak -> Active Sep 22 12:17:15.552 GMT: %TRACK-6-STATE: 1 ip sla 1 reachability Up -> Down Sep 22 12:17:17.859 GMT: %HSRP-5-STATECHANGE: Port-channel1.100 Grp 1 state Active -> Speak Sep 22 12:17:20.553 GMT: %TRACK-6-STATE: 1 ip sla 1 reachability Down -> Up Sep 22 12:17:23.038 GMT: %HSRP-5-STATECHANGE: Port-channel1.100 Grp 1 state Speak -> Active Sep 22 12:19:40.564 GMT: %TRACK-6-STATE: 1 ip sla 1 reachability Up -> Down Sep 22 12:19:41.539 GMT: %HSRP-5-STATECHANGE: Port-channel1.100 Grp 1 state Active -> Speak Sep 22 12:19:45.564 GMT: %TRACK-6-STATE: 1 ip sla 1 reachability Down -> Up Sep 22 12:19:47.168 GMT: %HSRP-5-STATECHANGE: Port-channel1.100 Grp 1 state Speak -> Active Sep 22 12:20:50.569 GMT: %TRACK-6-STATE: 1 ip sla 1 reachability Up -> Down Sep 22 12:20:52.181 GMT: %HSRP-5-STATECHANGE: Port-channel1.100 Grp 1 state Active -> Speak Sep 22 12:20:55.569 GMT: %TRACK-6-STATE: 1 ip sla 1 reachability Down -> Up Sep 22 12:20:57.146 GMT: %HSRP-5-STATECHANGE: Port-channel1.100 Grp 1 state Speak -> Active Sep 22 12:31:26.357 GMT: %HSRP-5-STATECHANGE: Port-channel1.100 Grp 2 state Standby -> Active Sep 22 12:31:31.629 GMT: %HSRP-5-STATECHANGE: Port-channel1.100 Grp 2 state Active -> Speak Sep 22 12:31:40.617 GMT: %TRACK-6-STATE: 1 ip sla 1 reachability Up -> Down Sep 22 12:31:42.365 GMT: %HSRP-5-STATECHANGE: Port-channel1.100 Grp 2 state Speak -> Standby Sep 22 12:31:42.914 GMT: %HSRP-5-STATECHANGE: Port-channel1.100 Grp 1 state Active -> Speak Sep 22 12:31:48.005 GMT: %HSRP-5-STATECHANGE: Port-channel1.100 Grp 1 state Speak -> Active 

Detected as UP but still HSRP states changes.

LOG-2: Sep 22 11:03:30.277 GMT: %TRACK-6-STATE: 1 ip sla 1 reachability Down -> Up Sep 22 11:03:31.391 GMT: %HSRP-5-STATECHANGE: Port-channel1.100 Grp 1 state Speak -> Active Sep 22 11:25:51.346 GMT: %HSRP-5-STATECHANGE: Port-channel1.100 Grp 2 state Standby -> Active Sep 22 11:25:56.597 GMT: %HSRP-5-STATECHANGE: Port-channel1.100 Grp 2 state Active -> Speak Sep 22 11:26:06.966 GMT: %HSRP-5-STATECHANGE: Port-channel1.100 Grp 2 state Speak -> Standby Sep 22 11:27:15.240 GMT: %HSRP-5-STATECHANGE: Port-channel1.100 Grp 2 state Standby -> Active 

IP SLA STATS:

IPSLA operation id: 1 Type of operation: icmp-echo Start Time Index: 12:12:46 GMT Tue Sep 22 2020 RTT Values Number Of RTT: 326 RTT Min/Avg/Max: 7/19/65 milliseconds Number of successes: 326 Number of failures: 4 <------------- Start Time Index: 11:12:46 GMT Tue Sep 22 2020 RTT Values Number Of RTT: 718 RTT Min/Avg/Max: 7/13/110 milliseconds Number of successes: 718 Number of failures: 1 <------------- 

From the above, the HSRP states change every 10-20min interval and it affects the network connection since the active router will change every 10-20min.

Note That verification on transport has been conducted and several circuit testing has been completed to fully verify the circuit.

Question:

  1. From IP SLA stats we are seeing 1 failure out of 718 and 4 failures out of 326... since the failure is verify minimal/low why router still detecting it as down or continuing to generate logs ?
  2. From "LOG-2:' you can see that "1 ip sla 1 reachability Down -> Up" is now up but the HSRP states continue to change.
  3. What other verification should be conducted?

Thanks



No comments:

Post a Comment