I've seen some worrying messages in dmesg
lately.
Specifically bunch of:
[ 19.367114] pcieport 0000:00:1c.5: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID)
[ 19.367148] pcieport 0000:00:1c.5: device [8086:9d15] error status/mask=00000081/00002000
[ 19.367172] pcieport 0000:00:1c.5: [ 0] Receiver Error (First)
[ 19.367192] pcieport 0000:00:1c.5: [ 7] Bad DLLP
And:
[ 20.121489] ath10k_pci 0000:03:00.0: Unknown eventid: 118809
[ 20.124485] ath10k_pci 0000:03:00.0: Unknown eventid: 90118
Or:
[ 19.367213] pcieport 0000:00:1c.5: AER: Multiple Corrected error received: 0000:00:1c.5
[ 19.367218] pcieport 0000:00:1c.5: can't find device of ID00e5
And most worryingly:
Nov 06 19:03:16 3c86-notebook kernel: ath10k_pci 0000:03:00.0: firmware crashed! (guid a62c787e-4709-4d94-a1a7-4e9357c2555a)
Nov 06 19:03:16 3c86-notebook kernel: ath10k_pci 0000:03:00.0: failed to get memcpy hi address for firmware address 4: -16
Nov 06 19:03:16 3c86-notebook kernel: ath10k_pci 0000:03:00.0: failed to read firmware dump area: -16
(This one happens roughly 50% of the time on boot)
All of these started appearing in a fairly short time (~2 weeks) And since all of them could be caused by a hardware failure, I am most worried, is there a software way to test all or most of the hardware?
(Apart from the firmware crash, which causes the wifi to stop working, I did not see any impact of the previous errors)