I have an interesting Proxmox problem that I haven’t found a solution for. I have 4 home built servers in my home lab, 2 of which are very much based on desktop hardware. One is an X370 motherboard with a Ryzen 1700X processor and the other is a Ryzen 1600 running on a B450 motherboard. The four servers were running VMware until I upgraded to ESXi 7 and ran into hw compatibility issues. When running ESXi 6.7 these “servers” were rock solid and never gave me trouble.
In any case, figured it would be a great time to test out and learn more about Proxmox so I downloaded latest (7.1), did my reading and had fun setting them up on those two. Except I started having severe issues on both servers in that they would simply hang once to twice a day and need a reboot to get back online. The error I would get on them is:
watchdog: BUG: soft lockup - CPU#XX stuck for XXXs!
I have tried finding something on this and I have done a lot of things, BIOS resets/configs etc but nothing works. I can even find very old posts with this error for Proxmox so not even sure that this is tied to Ryzen specifically. To note, the 1700x had one VM running on it which is pretty much idling and the 1600 one has no workload.
I finally decided to take the 1600 instance as this was the one that was giving me more problems, and I decided to try out XCP-ng and try that out as well (again - to learn and get a feel for it). That was 8 days ago and the machine has been rock solid under XCP-ng.
Not sure what I am missing - anybody out there seen this type of issue with Proxmox before? I am just a step away from converting the 1700X over to XCP-ng as well but I was having fun with Proxmox and would like to mess around with it some more, but only if I can get it stable.