120 lines
		
	
	
		
			4.4 KiB
		
	
	
	
		
			Plaintext
		
	
	
	
	
	
			
		
		
	
	
			120 lines
		
	
	
		
			4.4 KiB
		
	
	
	
		
			Plaintext
		
	
	
	
	
	
| PCIe Device AER statistics
 | |
| --------------------------
 | |
| 
 | |
| These attributes show up under all the devices that are AER capable. These
 | |
| statistical counters indicate the errors "as seen/reported by the device".
 | |
| Note that this may mean that if an endpoint is causing problems, the AER
 | |
| counters may increment at its link partner (e.g. root port) because the
 | |
| errors may be "seen" / reported by the link partner and not the
 | |
| problematic endpoint itself (which may report all counters as 0 as it never
 | |
| saw any problems).
 | |
| 
 | |
| What:		/sys/bus/pci/devices/<dev>/aer_dev_correctable
 | |
| Date:		July 2018
 | |
| KernelVersion:	4.19.0
 | |
| Contact:	linux-pci@vger.kernel.org, rajatja@google.com
 | |
| Description:	List of correctable errors seen and reported by this
 | |
| 		PCI device using ERR_COR. Note that since multiple errors may
 | |
| 		be reported using a single ERR_COR message, thus
 | |
| 		TOTAL_ERR_COR at the end of the file may not match the actual
 | |
| 		total of all the errors in the file. Sample output::
 | |
| 
 | |
| 		    localhost /sys/devices/pci0000:00/0000:00:1c.0 # cat aer_dev_correctable
 | |
| 		    Receiver Error 2
 | |
| 		    Bad TLP 0
 | |
| 		    Bad DLLP 0
 | |
| 		    RELAY_NUM Rollover 0
 | |
| 		    Replay Timer Timeout 0
 | |
| 		    Advisory Non-Fatal 0
 | |
| 		    Corrected Internal Error 0
 | |
| 		    Header Log Overflow 0
 | |
| 		    TOTAL_ERR_COR 2
 | |
| 
 | |
| What:		/sys/bus/pci/devices/<dev>/aer_dev_fatal
 | |
| Date:		July 2018
 | |
| KernelVersion:	4.19.0
 | |
| Contact:	linux-pci@vger.kernel.org, rajatja@google.com
 | |
| Description:	List of uncorrectable fatal errors seen and reported by this
 | |
| 		PCI device using ERR_FATAL. Note that since multiple errors may
 | |
| 		be reported using a single ERR_FATAL message, thus
 | |
| 		TOTAL_ERR_FATAL at the end of the file may not match the actual
 | |
| 		total of all the errors in the file. Sample output::
 | |
| 
 | |
| 		    localhost /sys/devices/pci0000:00/0000:00:1c.0 # cat aer_dev_fatal
 | |
| 		    Undefined 0
 | |
| 		    Data Link Protocol 0
 | |
| 		    Surprise Down Error 0
 | |
| 		    Poisoned TLP 0
 | |
| 		    Flow Control Protocol 0
 | |
| 		    Completion Timeout 0
 | |
| 		    Completer Abort 0
 | |
| 		    Unexpected Completion 0
 | |
| 		    Receiver Overflow 0
 | |
| 		    Malformed TLP 0
 | |
| 		    ECRC 0
 | |
| 		    Unsupported Request 0
 | |
| 		    ACS Violation 0
 | |
| 		    Uncorrectable Internal Error 0
 | |
| 		    MC Blocked TLP 0
 | |
| 		    AtomicOp Egress Blocked 0
 | |
| 		    TLP Prefix Blocked Error 0
 | |
| 		    TOTAL_ERR_FATAL 0
 | |
| 
 | |
| What:		/sys/bus/pci/devices/<dev>/aer_dev_nonfatal
 | |
| Date:		July 2018
 | |
| KernelVersion:	4.19.0
 | |
| Contact:	linux-pci@vger.kernel.org, rajatja@google.com
 | |
| Description:	List of uncorrectable nonfatal errors seen and reported by this
 | |
| 		PCI device using ERR_NONFATAL. Note that since multiple errors
 | |
| 		may be reported using a single ERR_FATAL message, thus
 | |
| 		TOTAL_ERR_NONFATAL at the end of the file may not match the
 | |
| 		actual total of all the errors in the file. Sample output::
 | |
| 
 | |
| 		    localhost /sys/devices/pci0000:00/0000:00:1c.0 # cat aer_dev_nonfatal
 | |
| 		    Undefined 0
 | |
| 		    Data Link Protocol 0
 | |
| 		    Surprise Down Error 0
 | |
| 		    Poisoned TLP 0
 | |
| 		    Flow Control Protocol 0
 | |
| 		    Completion Timeout 0
 | |
| 		    Completer Abort 0
 | |
| 		    Unexpected Completion 0
 | |
| 		    Receiver Overflow 0
 | |
| 		    Malformed TLP 0
 | |
| 		    ECRC 0
 | |
| 		    Unsupported Request 0
 | |
| 		    ACS Violation 0
 | |
| 		    Uncorrectable Internal Error 0
 | |
| 		    MC Blocked TLP 0
 | |
| 		    AtomicOp Egress Blocked 0
 | |
| 		    TLP Prefix Blocked Error 0
 | |
| 		    TOTAL_ERR_NONFATAL 0
 | |
| 
 | |
| PCIe Rootport AER statistics
 | |
| ----------------------------
 | |
| 
 | |
| These attributes show up under only the rootports (or root complex event
 | |
| collectors) that are AER capable. These indicate the number of error messages as
 | |
| "reported to" the rootport. Please note that the rootports also transmit
 | |
| (internally) the ERR_* messages for errors seen by the internal rootport PCI
 | |
| device, so these counters include them and are thus cumulative of all the error
 | |
| messages on the PCI hierarchy originating at that root port.
 | |
| 
 | |
| What:		/sys/bus/pci/devices/<dev>/aer_rootport_total_err_cor
 | |
| Date:		July 2018
 | |
| KernelVersion:	4.19.0
 | |
| Contact:	linux-pci@vger.kernel.org, rajatja@google.com
 | |
| Description:	Total number of ERR_COR messages reported to rootport.
 | |
| 
 | |
| What:		/sys/bus/pci/devices/<dev>/aer_rootport_total_err_fatal
 | |
| Date:		July 2018
 | |
| KernelVersion:	4.19.0
 | |
| Contact:	linux-pci@vger.kernel.org, rajatja@google.com
 | |
| Description:	Total number of ERR_FATAL messages reported to rootport.
 | |
| 
 | |
| What:		/sys/bus/pci/devices/<dev>/aer_rootport_total_err_nonfatal
 | |
| Date:		July 2018
 | |
| KernelVersion:	4.19.0
 | |
| Contact:	linux-pci@vger.kernel.org, rajatja@google.com
 | |
| Description:	Total number of ERR_NONFATAL messages reported to rootport.
 |