Some thoughts observations and questions concerning prime95

traceh38 · July 13, 2023, 1:25am

I am running a Ryzen 7900x with a custom loop which consists of a 120mm and a 240mm rad.
I have been experimenting with a manual overclock and testing with prime95.
thoughts

A lot of misinformation about prime95
not a lot of documentation on the behavior of CPU/ cache while running prime95.

Questions

So while running prime 95 small FFTs with FFT in place selected, I have noticed that once the FFT gets larger than 200k that my core utilization will drop to about 99.7 percent utilization on all threads. Is this because the FFT is too large to fit in the CPU cache and that it is leaking out into memory?

TObject · July 13, 2023, 4:11am

Really, you want to know where exactly the 0.3% went?

This level of curiosity is certainly admirable.

You can start by disabling hyperthreading; it does not do anything useful for LL tests.

Also, if you are running into thermal throttling anyway (I do on a 7950x all the time, with a 420 AIO), then anything else does not really matter, does it?

Third, the time per iteration should be a more usefull metric than the CPU utilization percentage; me thinks…

Good luck with your investigation. It surely is interesting.

traceh38 · July 13, 2023, 4:38am

Hello, thanks for the response, Yes a very curious individual indeed. I want to understand, I have to understand < Oo > But yes I have a hypotheses that FFT over the length of 200k are too large to fit in the cache of the processor including L1/L2/ and L3 therefore the processor utilization will drop slightly because the FFT will spill over to memory then the cpu first has to retrieve the data from memory then complete the calculation which will be slightly slower than storing the data in cache and completing the calculation. which would result in a core and thread percentage usage drop and slightly lower effective clocks.

vic · July 13, 2023, 4:44am

Depends on the versions of Prime95. The range of FFT considered as ‘small fft’ differs. Your claim (above 200k not fitting in ‘cache’) could be right.

traceh38 · July 13, 2023, 4:55am

Hi vic! The version I am using the small FFTS range between 146k and 545k.

vic · July 13, 2023, 4:59am

Currently I have two versions in my repository:

Mersenne Prime Test Program: Mac OS X 64-bit (AMD Ryzen),Untrusted Prime95,v30.5,build 2

small fft: 36k-227k

I patched this version myself (not related to fft length) to better support MacOS.

Mersenne Prime Test Program: Linux64,Prime95,v30.8,build 18

small fft: 73k-497k

Vanilla Linux verison

traceh38 · July 19, 2023, 9:40pm

@TObject and @vic so looks like I was a bit off on the size of the FFT looks any FFT over 240k and processor utilization drops off a fraction. I am making some inquires at the Mersenne forums to do some additional research I will get back with my findings.

vic · July 20, 2023, 3:53am

Prime95 forum is perhaps a much better place to seek feedback on P95. The author or current maintainer is often there last time I checked.

I didn’t pay much attention to CPU utilisation. I did see power consumption is lower in VM when running the same test. AMD Ryzen seems to have quite a few weird things that AMD perhaps will never tell the public. And I’m zen2. I believe later gens are likely improved. AMD internal validation should have started including Prime95, either since zen3 or if not surely zen4.

I’m not into prime numbers but Prime95 is a terrific tool for testing CPU & memory stability. Keep us posted.

traceh38 · July 20, 2023, 4:18am

I will.

twin_savage · July 20, 2023, 4:30am

Grab a copy of AMDuProf and let it analyze a run, it should be able to definitively answer your cache hypothesis.

system · April 18, 2024, 10:30pm

This topic was automatically closed 273 days after the last reply. New replies are no longer allowed.