Use these sections to identify and correct Ethernet packet loss:
See "Ethernet Packet Loss Reference" for additional conceptual and problem analysis detail.
If your Ethernet network shows signs of congestion, it may be experiencing packet loss. When your network is congested, utilization is usually high, packets are discarded because buffers are full, and collision rates are up. Problems related to "Collisions" are often at the heart of packet loss.
Collisions are normal in Ethernet networks. In many cases, Collision rates of 50 percent do not cause a large decrease in throughput. The Collision rate helps mark the upper limit on your network (the maximum percentage of collisions that your network can bear), which is usually around 70 percent. If Collisions increase above this upper limit, your network can become unreliable.
When the Collision rates increase, so do "Excessive Collisions", which causes a delay in transmitting data. An increase in Collisions also indicates that network utilization and network errors, such as "FCS Errors", are probably increasing.
The real packet problems to watch for, however, are undetected collisions that show up as "Late Collisions".
If small packets are colliding, you do not necessarily see a rise in utilization, but you may still have a problem. Capture packets to determine their size.
To identify that your network's problem is related to packet loss, verify that frames are being dropped on your network by examining this packet loss data:
The process of identifying the problem is discussed in "Searching for Packet Loss".
If you notice that packet loss data is consistently high, then your network is too congested. In this case, segment your network with the appropriate network device (such as a switch or router). If Collision data shows increases but your network's utilization is the same, then your network may have a physical problem, such as cabling that is too long. Other problems that packet loss data can indicate include:
Possible solutions to these problems are explained in the procedures in "Searching for Packet Loss".
When you look for packet loss, use the following applications:
Status Watch monitors:
Follow these steps:
1 . Determine if the thresholds for the Alignment Errors tool and FCS Errors tool are being exceeded.
2 . Determine if the Excessive Collisions tool threshold is being exceeded.
3 . Determine if the Receive Discards and Transmit Discards tools thresholds are being exceeded.
Use the LANsentry Manager Network Statistics graph to view data for:
Follow these steps:
1 . Display a Network Statistics graph for the local Ethernet segment on which users have reported poor performance.
2 . Analyze Utilization and Collision rates to determine whether collisions are caused by an overloaded segment or a faulty component.
3 . Examine the CRC Errors and Late Collisions, which often indicate cabling or component problems.
4 . Trace Too Short Errors and Too Long Errors to the sender.
Device View allows you to display a variety of port and device-level statistics relevant to Ethernet packet loss. Table 20 describes these statistics and their use in troubleshooting.
To display Activity and Errors statistics for a device or port, follow these steps:
1 . Select the required port or device.
2 . From the shortcut menu, select Activity or Errors.
You may not be able to access these statistics on some devices using Device View. See the Device View documentation for additional information.
This section explains terms that are relevant to Ethernet packet loss and provides additional conceptual and problem analysis detail.
An Alignment Error indicates a received frame in which both are true:
Alignment Errors often result from MAC layer packet formation problems, cabling problems that cause corrupted or lost data, and packets that pass through more than two cascaded multiport transceivers. See "FCS Errors" for more information about interpreting Alignment Errors.
Collisions indicate that two devices detect that the network is idle and try to send packets at exactly the same time (within one round-trip delay). Because only one device can transmit at a time, both devices must stop sending and attempt to retransmit. Collisions are detected by the transmitting stations.
The retransmission algorithm helps to ensure that the packets do not retransmit at the same time. However, if the two devices retry at nearly the same time, packets can collide again; the process repeats until either the packets finally pass onto the network without collisions, or 16 consecutive collisions occur and the packets are discarded.
A Cyclic Redundancy Check (CRC) Error is an RMON statistic that combines "FCS Errors" and "Alignment Errors". These errors indicate that packets were received with:
CRC Errors can cause an end station to freeze. If a large number of CRC Errors are attributed to a single station on the network, replace the station's network interface board. Typically, a CRC Error rate of more than 1 percent of network traffic is considered excessive.
Excessive Collisions indicate that 16 consecutive collisions have occurred, usually a sign that the network is becoming congested. For each excessive collision count (or after 16 consecutive collisions), a packet is dropped. If you know the normal rate of excessive collisions, then you can determine when the rate of packet loss is affecting your network's performance. See "Knowing Your Network's Configuration" for more information.
Frame Check Sequence (FCS) Errors, a type of CRC, indicate that frames received by an interface are an integral number of octets long but do not pass the FCS check. The FCS is a mathematical way to ensure that all the frame's bits are correct without having the system examine each bit and compare it to the original. Packets with Alignment Errors also generate FCS Errors.
Both Alignment Errors and FCS Errors can be caused by equipment powering up or down or by interference (noise) on unshielded twisted-pair (10BASE-T) segments. In a network that complies with the Ethernet standard, FCS or Alignment Errors indicate bit errors during a transmission or reception. A very low rate is acceptable. Although Ethernet allows a 1 in 108 bit error rate, typical Ethernet performance is 1 in 1012 or better.
Late Collisions indicate that two devices have transmitted at the same time, but cabling errors (most commonly, excessive network segment length or repeaters between devices) prevent either transmitting device from detecting a collision. Neither device detects a collision because the time to propagate the signal from one end of the network to the other is longer than the time to put the entire packet on the network. As a result, neither of the devices that cause the late collision senses the other's transmission until the entire packet is on the network.
Although late collisions occur for small packets, the transmitter cannot detect them. As a result, a network suffering measurable Late Collisions for large packets is losing small packets as well.
Table 21 lists the symptoms that typically occur if a system violates the Ethernet standard.
Receive Discards indicate that received packets could not be delivered to a high-layer protocol because of congestion or packet errors.
A Too Long Error indicates that a packet is longer than 1518 octets (including FCS octets) but otherwise well formed. Too Long Errors are often caused by a bad transceiver, a malfunction of the jabber protection mechanism on a transceiver, or excessive noise on the cable.
A Too Short Error, also called a runt, indicates that a packet is fewer than 64 octets long (including FCS octets) but otherwise well formed.
Transmit Discards indicate that packets were not transmitted because of network congestion.