Lines Matching full:errors
40 hardware errors, and, when possible correcting them in runtime. It should
47 * CPU – detect errors at instruction execution and at L1/L2/L3 caches;
48 * Memory – add error correction logic (ECC) to detect and correct errors;
54 to identify if the probability of hardware errors is increasing, and, on such
56 those errors are correctable.
58 Types of errors
62 Codes that allow error correction when the number of errors on a bit packet
63 is below a threshold. If the number of errors is above, those mechanisms
70 That defines some categories of errors:
73 corrected the error. Such errors are usually not fatal, although some
76 * **Uncorrected Error (UE)** - the amount of errors happened above the error
91 The mechanism for handling non-fatal errors is usually complex and may
177 The word with *data width* is sent to the CPU, even when errors happen.
184 The information about the CE/UE errors is stored on some special registers
187 bit CPUs, such errors can also be retrieved via the Machine Check
224 The ``edac`` kernel module's goal is to detect and report hardware errors
230 Memory Correctable Errors (CE) and Uncorrectable Errors (UE) are the
231 primary errors being harvested. These types of errors are harvested by
263 In addition, PCI devices are scanned for PCI Bus Parity and SERR Errors
264 in order to determine if errors are occurring during data transfers.
266 The presence of PCI Parity errors must be examined with a grain of salt.
503 - ``dimm_ue_count`` - Uncorrectable Errors count attribute file
506 errors that have occurred on this DIMM. If panic_on_ue is set
510 - ``dimm_ce_count`` - Correctable Errors count attribute file
513 errors that have occurred on this DIMM. This count is very
538 to it. With this label in the module, when errors occur
592 - ``ue_count`` - Total Uncorrectable Errors count attribute file
595 errors that have occurred on this csrow. If panic_on_ue is set
600 - ``ce_count`` - Total Correctable Errors count attribute file
603 errors that have occurred on this csrow. This count is very
660 to it. With this label in the module, when errors occur
689 to it. With this label in the module, when errors occur
705 information indicating that errors have been detected::
779 This attribute file will display the number of parity errors that
807 Generate kernel messages describing uncorrectable errors. These errors
823 Generate kernel messages describing correctable errors. These
824 errors are reported through the system message log system.
841 necessary handling of errors and might loose valuable information for
1037 2) The MC has the ability to inject errors to test drivers. The drivers
1114 Those newer MCs have some registers to count memory errors. The driver
1115 uses those registers to report Corrected Errors on devices with Registered
1133 What happens here is that errors on different csrows, but at the same
1155 possible that some errors could be lost. With RDIMM's, they display the