Hi,
I’m having serious issues with 3 Samsung 980 PRO NVMEs M.2 + Tyan S8030 + AMD EPYC 7443P. I moved the SSD to a laptop and I’m seeing the some of the same messages. I’m a bit surprised it happens with all 3 of them
[ 4392.683810] nvme 0000:03:00.0: AER: aer_status: 0x000000c1, aer_mask: 0x00000000 [ 4392.683820] nvme 0000:03:00.0: AER: [ 0] RxErr (First) [ 4392.683826] nvme 0000:03:00.0: AER: [ 6] BadTLP [ 4392.683831] nvme 0000:03:00.0: AER: [ 7] BadDLLP [ 4392.683835] nvme 0000:03:00.0: AER: aer_layer=Physical Layer, aer_agent=Receiver ID [ 4392.683860] nvme 0000:03:00.0: AER: aer_status: 0x00000001, aer_mask: 0x00000000 [ 4392.683861] nvme 0000:03:00.0: AER: [ 0] RxErr (First) [ 4392.683863] nvme 0000:03:00.0: AER: aer_layer=Physical Layer, aer_agent=Receiver ID [ 4392.683878] nvme 0000:03:00.0: AER: aer_status: 0x00000081, aer_mask: 0x00000000 [ 4392.683879] nvme 0000:03:00.0: AER: [ 0] RxErr (First) [ 4392.683881] nvme 0000:03:00.0: AER: [ 7] BadDLLP [ 4392.683882] nvme 0000:03:00.0: AER: aer_layer=Physical Layer, aer_agent=Receiver ID [ 4392.683897] nvme 0000:03:00.0: AER: aer_status: 0x00000001, aer_mask: 0x00000000 [ 4392.683898] nvme 0000:03:00.0: AER: [ 0] RxErr (First) [ 4392.683900] nvme 0000:03:00.0: AER: aer_layer=Physical Layer, aer_agent=Receiver ID [ 4392.683914] nvme 0000:03:00.0: AER: aer_status: 0x00000081, aer_mask: 0x00000000 [ 4392.683916] nvme 0000:03:00.0: AER: [ 0] RxErr (First) [ 4392.683917] nvme 0000:03:00.0: AER: [ 7] BadDLLP [ 4392.683918] nvme 0000:03:00.0: AER: aer_layer=Physical Layer, aer_agent=Receiver ID [ 4392.683933] nvme 0000:03:00.0: AER: aer_status: 0x000000c1, aer_mask: 0x00000000 [ 4392.683934] nvme 0000:03:00.0: AER: [ 0] RxErr (First) [ 4392.683936] nvme 0000:03:00.0: AER: [ 6] BadTLP [ 4392.683937] nvme 0000:03:00.0: AER: [ 7] BadDLLP [ 4392.683938] nvme 0000:03:00.0: AER: aer_layer=Physical Layer, aer_agent=Receiver ID
They constantly throw errors in the IPMI interface and the system is unstableerror.txt (6.4 KB)