Hi Gents,
I am battling for a few days now an issue at one of my customer's site with a Guardlogix system. The System has the CPU's and three 1756-ENBT/A cards f/w 6.6 each.
One of the 1756-ENBT's is plugged into port 1 of a STRATIX 5700 managed switch. The system has three point I/O racks with 1734-AENT/A adapters fw/3.12 plugged in into ports 2,3 & 4 respectively. The switch has all its ports set to smart-ports and set as "AUTOMATION DEVICE".
Occasionally the system crashes on a comms fault. When I say comms I mean faults 516 & 515 come up either on the AENT's or in one or more of the cards attached to the AENT's. Cards like 1734-IB8S/A or 1734-OB8S/A.The fault moves around as in some day AENT adapter 1 will fault other days AENT adapter 2 will fault. The system recovers in about 11 seconds and everything resumes but it trips the whole system in the process. This is bad as the system is linked to another system which is running a coal mine.
The cabling is good as we changed the CAT5 to new CAT6 cables routing the cables basically on the ground temporarily until we find the problem. The longest CAT6 run is probably 12m. Everything is in the same room physically.
RPI rates increased and the timeout multipliers increased for the cards.
We have monitored traffic between port 4 and port 1 and what I captured there using wireshark is absolutely puzzling.
When the system is working ok the protocol used between the 1756-ENBT & 1734-AENT is ENIP. When the system trips on a comms fault wireshark shows that the comms protocol used is CIP SAFETY ??? for a few transactions then it becomes ENIP again for a few packets then CIP SAFETY again and the whole thing goes like this?
As far as I understood the kb article the protocol should be 573950 it should always be CIP SAFETY ? as the task itself is a safety task ?
Rockwell support so far failed to explain why I have this change in protocol from ENIP to CIP SAFETY and whether is the actual cause of the random faults or the faults come first and the system switches to CIP SAFETY (perhaps to ensure that the safety IO comms is ok?)
Could you please help with some ideas how could we troubleshoot this ?and most importantly does someone know why I have this ENIP to CIP SAFETY protocol change?
I am battling for a few days now an issue at one of my customer's site with a Guardlogix system. The System has the CPU's and three 1756-ENBT/A cards f/w 6.6 each.
One of the 1756-ENBT's is plugged into port 1 of a STRATIX 5700 managed switch. The system has three point I/O racks with 1734-AENT/A adapters fw/3.12 plugged in into ports 2,3 & 4 respectively. The switch has all its ports set to smart-ports and set as "AUTOMATION DEVICE".
Occasionally the system crashes on a comms fault. When I say comms I mean faults 516 & 515 come up either on the AENT's or in one or more of the cards attached to the AENT's. Cards like 1734-IB8S/A or 1734-OB8S/A.The fault moves around as in some day AENT adapter 1 will fault other days AENT adapter 2 will fault. The system recovers in about 11 seconds and everything resumes but it trips the whole system in the process. This is bad as the system is linked to another system which is running a coal mine.
The cabling is good as we changed the CAT5 to new CAT6 cables routing the cables basically on the ground temporarily until we find the problem. The longest CAT6 run is probably 12m. Everything is in the same room physically.
RPI rates increased and the timeout multipliers increased for the cards.
We have monitored traffic between port 4 and port 1 and what I captured there using wireshark is absolutely puzzling.
When the system is working ok the protocol used between the 1756-ENBT & 1734-AENT is ENIP. When the system trips on a comms fault wireshark shows that the comms protocol used is CIP SAFETY ??? for a few transactions then it becomes ENIP again for a few packets then CIP SAFETY again and the whole thing goes like this?
As far as I understood the kb article the protocol should be 573950 it should always be CIP SAFETY ? as the task itself is a safety task ?
Rockwell support so far failed to explain why I have this change in protocol from ENIP to CIP SAFETY and whether is the actual cause of the random faults or the faults come first and the system switches to CIP SAFETY (perhaps to ensure that the safety IO comms is ok?)
Could you please help with some ideas how could we troubleshoot this ?and most importantly does someone know why I have this ENIP to CIP SAFETY protocol change?