I was wondering if there is any fix for this issue. It looks like it’s a problem with intel 10g SFP+. I have found a previous post and have used the ethtool work around because like original poster my logs were filling up and bringing down the server. For reference the machine and network adapter I have is a HPE DL385 gen10 with a HP Ethernet 10Gb 2-port 560FLR-SFP+ Adapter, running Proxmox VE 7.1-7. I have also posted this on the promox forums.
Now this is a bit of stretch, I could be wildly misinterpreting the documentation, but when describing IRQs, the following page mentions,
When an interrupt goes unhandled over time, they are tracked by the Linux kernel as Spurious Interrupts. The IRQ will be disabled by the Linux kernel after it reaches a specific count with the error “nobody cared”
and in reference to PCIe,
On PCI Express, interrupts are represented with either MSI or inbound interrupt messages (Assert_INTx/Deassert_INTx)
and
Starting with ICX there are no longer any IO-APICs in the Core IO’s devices. IO-APIC is only in the PCH. Devices connected to the Core IO’s PCIe Root Ports will use native MSI/MSI-X mechanisms.
So maybe this is an unhandled MSI-type interrupt? I do not really know what that means, but hopefully it may be useful to you; or I could be sending you down the wrong rabbit hole, in which case, my apologies.
I appreciate the response. To be honest this will take some time for me to digest, I’m still a novice at best lol! I’ve been in contact with HPE support to get the latest ISO for a full firmware update on everything. Since I’m running Proxmox they make it difficult to upgrade individual firmware. Also their website wouldn’t let me download either so they sent me info to a SFTP server which I thought was pretty neato. Anyway after I get everything updated I will report back.