AMD Instinct Mi50 Beeping constantly

I have a server with a pair of AMD Instinct Mi50s I use for local LLMs. However recently one of the GPUs started beeping and I can find absolutely no reference to this behaviour online (only gaming cards that have it via software).
As soon as the server POSTs one of the GPUs endlessly beeps. It keeps beeping during the OS load and during use. I can still see the GPU in rocm-smi and it seems to run models just fine, the beeping is very annoying.
It happens as soon as I plug it in, no matter the PCIe slot. I’ve reseated it and swapped slots and swapped power cables.

There is no error as far as I can tell either at dmesg or in the BIOS/IPMI.

Both cards run the exact same vbios 113-D1631400-X11 and thermally they are fine as well, well below 40C when idle.

I’m contemplating just desoldering the beeper at this point.

Additionally, the card has a GPIO injection switch a little hardware switch on the side for which I can’t find much information either

Hi, @Sidiox !
I also have 3 of those GPUs running on my server and, on the first installation, they also beeped like hell.
It’s related to insufficient power supplied to it.
Check your PSU with a multimeter.

I don’t think it is the PSUs, I’ve been running the two GPUs in the server for a few months and it never beeped. Additionally, only one of the two GPU beeps, even if its the only one in the system.
Since it works fine temps wise I’ve opted to just remove the beeper from the GPU. They are second hand already so I think its fine. The beeper fell apart as soon as I heated its pads a bit so maybe it was already bad itself? Unsure.

amd-instinct-mi50-removed-beeper1

Well, there’s no putting it back on, now :grimacing: :grin:

Hi,

I have a similar issue: I am trying to get a Radeon Pro V340L to work, but it also constantly beeps - as soon as I turn on my server (PCI bus is not even initialized by then).
I even purchased a X1 to X16 GPU adapter (typically used in mining) with external power, to see if it was related to the PCI slot power.

There is a single configuration, were it does not beep:
If I plug in the X1 adapter (with external power) to a X1 slot in my server, the card gets warm and does not beep, but is not detected in the OS. For every other slot (e.g. X1 in the X16 slot) it instantly beeps. So I am not sure if this is indeed related to power.

Is there anyone else with the same problem and knows how to solve it?
Thanks!

I’ve run into the constant beeping issue only when I had forgotten to plug the card into power or when it overheated from a fan failure. From your description, it sounds like your card was unhappy until you changed the power source, so I would make sure your connection to the power supply is 100% validated

Thanks for the hint.

Actually the card is still unhappy even if in the external riser. E.g. the card would beep if plugged in the external riser and the x16 slot on the motherboard.

On my server (Dell R730) there is a small x1 slot for a dual sd card. If i plug it into this slot with the external riser, it would not beep. It would even get warm, to the point where it gets really hot if not appropriably cooled. Which makes no sense at all to me. However, in the slot the server seems to not recognize the card (maybe a Dell restriction for that port).

Did a bit more testing and found some interesting stuff:

Looks like the X1 slot isn’t really usable for PCI cards (Reddit: “Can the Dell IDSDM PCIe slot be use with a PCIe x1 to M.2 SSD instead?”, I cannot post links unfortunately).

What’s odd is that the card still heats up when sitting in the external riser, even if it’s not plugged into any PCI slot. That probably explains why it seemed to “work” in the X1 slot.

I also noticed that it starts beeping after a while if it’s not cooled properly, which lines up with the explanation of overheating.

But as soon as I connect it to a PCI slot, the behavior is the same. For example:

  • If I power on the system with the card in the riser but not connected to the motherboard, no beeps, card gets warm.
  • The moment I plug it in afterwards (while the card is already powered on and quiet, i.e. hot-swap), it immediately starts beeping.

Could it be that the card is “seeing” some odd voltage on the PCI bus and that’s what makes it complain?

Hey can you help me friend,i have a 16gb mi50 that i bought from aliexpress that is behaving similarly to yours, in my case i only have one card,initially i thought the beep could be related to it being overheating so i got myself a custom cooling solution, also from aliexpress, since the bios that came with the card did not show the temperatures i had to flash a custom one, everything went well and after flashing i could see that the temperatures where just fine while idling but after a while it would still start to beep the same, while gaming the maximum temperature on the hotspot was around 85 degrees so i crossed out the possibility of overheating, after that i thought it could be the power-supply not handling the demand from the GPU, so i got myself an 850w gold rated power supply from XPG, and the card still does the same thing, i am almost doing the same thing as you did with this beeper, but i am afraid of running the card on this state on my system and damaging any other component, how was your build after you did this ? did it all survived ?

Yea everything was absolutely fine after removing the beeper, its fairly easy to get to on the board and it behaves just fine with it removed.