Asrock b650d4u code 00 server motherboard failure

Hello
Same scenario here, ~20 boards in production, had 5 of them rebooting randomly when used in a proxmox cluster.
From these 5 boards, 2 of them ended up giving a 00 boot code at some point.
We have contacted Asrockrack support, they exchanged the 5 boards without problem but they did not tell us what was wrong…

So we are worried to put these new boards in production, they are “revision 4.01”, same as the faulty ones.
We also had narrowed the issue to some serial numbers but today we had another board failing, and it has a very different serial than the5 previous boards.
This board has 281 power on days.
We have some other B650D4U boards that seem to be immune to the problem, having 613 power on days without issue, so there is certainly an hardware fault introduced at some point.

We replace them with Supermicro H13SAE and can confirm that it fixes all our issues keeping all other hardware the same, so problem definitively linked to the B650D4U.

Now as Asrockrack is sending us boards in exchange we wonder what to do.
-do we try these and wait 3 - 6 months to see if they fail
-do we sell them at a loss and replace them with Supermicro ones ?

This has definitively caused us to loose trust in asrockrack, having more than 70 * X470D4U / X570D4U in production for years without issue, this is a shame for them.

1 Like

Update : found on discord thread ( Asrock rack B650D4U died again ) a guy mentionning that boards with serial numbers beginning with H5-xxxxxxx or H6-xxxxxxx should be fixed.

quote : "Boards with SN starting with H5 should be OK. I got this answer from ASRockRack EU support. 3 H1s have failed in my case, warranty replacement is H6. So I asked them because I didn’t want to put soon-to-be-faulty-again server back into production. I guess H6 is newer than H5, I hope they didn’t introduce new flaws. My trust is weak so I am burning-in the H6 in testing environment. "

3 Likes

Thank you @netswitch for your helpful post! Glad you posted on the proxmox forum.

Exactly my problem too: B650D4U with serial number: H4-S0R60000xx with BIOS version: 20.05 & BMC version: 5.03.00 is stuck in Dr. Debug code 00 after experiencing 6 months of random reboots in proxmox.

I got this board as a replacement for another failed board with serial number: H1S0XE0016xx. Working with AsRock to replace this mobo.

I would like to avoid asRock Rack boards completely. Looking into Supermicro H13SAE and other consumer boards with iKVM.

Wondering what consumer board options are out there that can keep other hardware the same. Does anyone know?

(post deleted by author)

Hi. Has anyone tested boards with serial numbers beginning with H5-xxxxxxx or H6-xxxxxxx to confirm that they are fixed?

How do I gain access to the specific discord thread like ( Asrock rack B650D4U died again ) ? Logging to level1techs discord server, I only see announcements, welcome and rules.