周末的电话故障排除会议——这已经有一段时间没发生了

啊,记忆……

今天下午,我和家人在数控国家公平,正准备开始我的MBA课程作业当我的即时消息弹出时。“Mike,两天前IOS升级后,总部的广域网出现了问题,他们正在考虑退出升级。”学习吗?故障排除?学习吗?故障排除?好吧,好奇心太大了。另外,我真的很想完成这次IOS升级。所以,我接了电话,我们开始交谈,我们六个人。这是一个很好的例子。 The NetOps team had worked for a bit and determined that内核路由器IOS升级12小时后, OSPF开始运行不稳定。当前与防火墙的OSPF协议状态为down,导致核心路由器到Internet路由器之间无法建立BGP协议。这阻止了缺省路由从Internet路由器通过FW传播到核心路由器.所以总部是在路由到另一个网络中心。它(大部分情况下)对用户是有效的,但我们需要纠正这种情况。奇怪的是,在IOS升级12小时后,OSPF问题就开始了。如果IOS有漏洞,为什么OSPF花了12个小时才失败?您可以预期它在升级后不久就会失败。因此,在核心路由器上降级IOS似乎不是解决办法。经过进一步检查,我们发现广域网交换机(Cat3750堆栈)上有四个设备出现了问题。一个路由器被完全隔离。第二个问题是有问题的防火墙。 One of the core routers was the third device. And fourth was a small voice gatekeeper router. All had problems communicating over the VLANs on this WAN switch, but the WAN switch looked fine. Plus there was nothing in the log that showed an issue 12 hours after the IOS upgrade completed. First, we tackled the isolated router. We could see the router in CDP on the WAN switch, but no CDP on the router. It's was like a unidirectional link, but with copper. After looking for a while to no avail we power cycled the router. It came right back up, all problems solved, OSPF and BGP working. OK....??? Next the firewalls. This one proved trickier. In this case OPSF was showing FULL on the core routers, but was stuck in LOADING on the firewall. While the firewall was stuck in LOADING, no routes from the firewall would show up in OSPF on the core. This was breaking the BGP to the Internet routers. After a while (like 30 minutes), OSPF would reset and go back to stuck in LOADING on the firewall. We bounced interfaces to the firewalls and even rebooted both firewalls and were left with the same problem. Given the firewalls were在加载,我们开始讨论OSPF MTU问题.是的,通常OSPF设备在EXCHANGE/EXSTART中存在MTU问题,但我们注意到这些vlan上的Cisco设备配置了MTU 9198,但防火墙是1500。奇怪的是,这以前从来都不是问题。是的,有一个MTU不匹配,但防火墙被配置为忽略OSPF MTU和一切工作良好的-好-年。在IOS升级12小时后,MTU是否就成了一个问题?好吧,显然是这样。当我们进一步观察时,在核心路由器的IOS升级期间,小语音把关路由器已经成为一个VLAN上的OSPF DR和另一个WAN VLAN上的OSPF BDR。这个小路由器-运行12.4T代码(呃,“朋友不让朋友在他们的网络中运行t代码”),(1)首先不应该是DR/BDR,(2)是MTU问题的根源。当我们在该路由器的GigabitEthernet接口上配置了“ip ospf mtu-ignore”后,防火墙进入ospf FULL状态,到Internet路由器的BGP启动。配置“ip ospf mtu-ignore”命令,强制在两个vlan上进行ospf选举,并允许核心路由器DR/BDR当选,因为它们具有更高的ospf优先级。 The core routers were correctly configured with higher OSPF priority to make them the DR/BDR, but this little router was not configured with "ip ospf priority 0" so it could从来没有成为一个博士/ BDR。无论IOS升级12小时后发生了什么,导致两个核心与VLAN隔离,没有更高优先级的核心路由器的OSPF选举发生了,这个小语音把关人赢得了选举。如果OSPF MTU错误,则该路由器会中断与防火墙的OSPF关系。这就把我们带到了问题的根源,或者说触发点。在ios升级后12小时内发生了什么?让我。我猜是四个设备连接的广域网交换机出了问题,把连接的设备弄乱了。如果有一份记录信息能显示确凿的证据,那将会很有帮助。也许明天就能找到。至少我们知道问题所在,并有了解决方案。 As I write this, BGP from the core routers to the Internet routers has been up 5 1/2 hours. Good! What a "network geek" thrilling 3 hours this afternoon. I haven't done that in years! (and now I get to do myMBA课程作业

更多来自Field博客的>:

Facebook-Skype联盟可能会推动一些严重的视频带宽使用

我们也喜欢隧道- EoMPLS连接两个数据中心雷竞技电脑网站

积极的ROI是广域网转换成为可能的原因

思科宣布股息和少量公司财务显示了思科的变化

广域网路转换是一个巨大的工程

广域网改造是一个去!

思科子网浏览更多思科新闻、博客、论坛、安全警报、图书赠品等。

加入网络世界社区有个足球雷竞技app脸谱网LinkedIn对自己最关心的话题发表评论。

版权©2010Raybet2

工资调查:结果在