在过年前几天,一台GSR的1块3GE-GBIC-SC板卡出现问题,该板卡3个GE口连接的设备无法PING通,通过CDP无法看到,但是显示interface is up, line protocol is up,而且该板卡是被正常认出来,因为通过show diags命令是可以看到的,系统日志也无任何有用的信息。
由于是核心设备,影响用户比较多,我记得以前也出现过相同问题,不过当时设备在检修中,我怀疑是相同问题。临时处理办法是将3GE板卡拔出来,将板卡上的内存重新拔插后就恢复工作了(注:仅拔插板卡是没用的),但出现内在错误信息如下:
SLOT 1:Jan 19 17:21:03 GMT+8: %LC-6-PSAECC: An TLU SDRAM ECC correctable error occured address 1807C45
SLOT 1:Jan 19 17:30:18 GMT+8: %LC-6-PSAECC: An TLU SDRAM ECC correctable error occured address 181174D
SLOT 1:Jan 19 18:42:09 GMT+8: %LC-6-PSAECC: An TLU SDRAM ECC correctable error occured address 180FCA6
SLOT 1:Jan 20 08:35:55 GMT+8: %LC-6-PSAECC: An TLU SDRAM ECC correctable error occured address 18189E4
后向Cisco开Case重新更换一块板卡解决内存报错问题。