xref: /linux/Documentation/ABI/testing/sysfs-bus-pci-devices-aer_stats (revision 95298d63c67673c654c08952672d016212b26054)
1==========================
2PCIe Device AER statistics
3==========================
4These attributes show up under all the devices that are AER capable. These
5statistical counters indicate the errors "as seen/reported by the device".
6Note that this may mean that if an endpoint is causing problems, the AER
7counters may increment at its link partner (e.g. root port) because the
8errors may be "seen" / reported by the link partner and not the
9problematic endpoint itself (which may report all counters as 0 as it never
10saw any problems).
11
12What:		/sys/bus/pci/devices/<dev>/aer_dev_correctable
13Date:		July 2018
14KernelVersion: 4.19.0
15Contact:	linux-pci@vger.kernel.org, rajatja@google.com
16Description:	List of correctable errors seen and reported by this
17		PCI device using ERR_COR. Note that since multiple errors may
18		be reported using a single ERR_COR message, thus
19		TOTAL_ERR_COR at the end of the file may not match the actual
20		total of all the errors in the file. Sample output:
21-------------------------------------------------------------------------
22localhost /sys/devices/pci0000:00/0000:00:1c.0 # cat aer_dev_correctable
23Receiver Error 2
24Bad TLP 0
25Bad DLLP 0
26RELAY_NUM Rollover 0
27Replay Timer Timeout 0
28Advisory Non-Fatal 0
29Corrected Internal Error 0
30Header Log Overflow 0
31TOTAL_ERR_COR 2
32-------------------------------------------------------------------------
33
34What:		/sys/bus/pci/devices/<dev>/aer_dev_fatal
35Date:		July 2018
36KernelVersion: 4.19.0
37Contact:	linux-pci@vger.kernel.org, rajatja@google.com
38Description:	List of uncorrectable fatal errors seen and reported by this
39		PCI device using ERR_FATAL. Note that since multiple errors may
40		be reported using a single ERR_FATAL message, thus
41		TOTAL_ERR_FATAL at the end of the file may not match the actual
42		total of all the errors in the file. Sample output:
43-------------------------------------------------------------------------
44localhost /sys/devices/pci0000:00/0000:00:1c.0 # cat aer_dev_fatal
45Undefined 0
46Data Link Protocol 0
47Surprise Down Error 0
48Poisoned TLP 0
49Flow Control Protocol 0
50Completion Timeout 0
51Completer Abort 0
52Unexpected Completion 0
53Receiver Overflow 0
54Malformed TLP 0
55ECRC 0
56Unsupported Request 0
57ACS Violation 0
58Uncorrectable Internal Error 0
59MC Blocked TLP 0
60AtomicOp Egress Blocked 0
61TLP Prefix Blocked Error 0
62TOTAL_ERR_FATAL 0
63-------------------------------------------------------------------------
64
65What:		/sys/bus/pci/devices/<dev>/aer_dev_nonfatal
66Date:		July 2018
67KernelVersion: 4.19.0
68Contact:	linux-pci@vger.kernel.org, rajatja@google.com
69Description:	List of uncorrectable nonfatal errors seen and reported by this
70		PCI device using ERR_NONFATAL. Note that since multiple errors
71		may be reported using a single ERR_FATAL message, thus
72		TOTAL_ERR_NONFATAL at the end of the file may not match the
73		actual total of all the errors in the file. Sample output:
74-------------------------------------------------------------------------
75localhost /sys/devices/pci0000:00/0000:00:1c.0 # cat aer_dev_nonfatal
76Undefined 0
77Data Link Protocol 0
78Surprise Down Error 0
79Poisoned TLP 0
80Flow Control Protocol 0
81Completion Timeout 0
82Completer Abort 0
83Unexpected Completion 0
84Receiver Overflow 0
85Malformed TLP 0
86ECRC 0
87Unsupported Request 0
88ACS Violation 0
89Uncorrectable Internal Error 0
90MC Blocked TLP 0
91AtomicOp Egress Blocked 0
92TLP Prefix Blocked Error 0
93TOTAL_ERR_NONFATAL 0
94-------------------------------------------------------------------------
95
96============================
97PCIe Rootport AER statistics
98============================
99These attributes show up under only the rootports (or root complex event
100collectors) that are AER capable. These indicate the number of error messages as
101"reported to" the rootport. Please note that the rootports also transmit
102(internally) the ERR_* messages for errors seen by the internal rootport PCI
103device, so these counters include them and are thus cumulative of all the error
104messages on the PCI hierarchy originating at that root port.
105
106What:		/sys/bus/pci/devices/<dev>/aer_stats/aer_rootport_total_err_cor
107Date:		July 2018
108KernelVersion: 4.19.0
109Contact:	linux-pci@vger.kernel.org, rajatja@google.com
110Description:	Total number of ERR_COR messages reported to rootport.
111
112What:	    /sys/bus/pci/devices/<dev>/aer_stats/aer_rootport_total_err_fatal
113Date:		July 2018
114KernelVersion: 4.19.0
115Contact:	linux-pci@vger.kernel.org, rajatja@google.com
116Description:	Total number of ERR_FATAL messages reported to rootport.
117
118What:	    /sys/bus/pci/devices/<dev>/aer_stats/aer_rootport_total_err_nonfatal
119Date:		July 2018
120KernelVersion: 4.19.0
121Contact:	linux-pci@vger.kernel.org, rajatja@google.com
122Description:	Total number of ERR_NONFATAL messages reported to rootport.
123