I hate synthetic benchmarks so I started inventing my own ridiculous and useful benchmarks.
So in the past I have posted about hardware that I have gotten. This is mostly just to show what projects I might do or to pose questions on the thread later on if I am really not getting something. Some of those threads have very weird benchmarks posted with them. These all to test different aspects of what the machine is supposed to do and are rated to different machines. Hi-Calibur machines get one kind, low-calibur another, and mid-range kind of get both but also have their own. At that some are designed to show what I will be doing with the machine and how fast I can expect to be able to do that from boot, closing an app, opening another app, or after setting idle for a minimum of 30 minutes (while the machine gets hot). We’ll start from low to high and go through what they do.
The AAAAAAAAAAAA Test:
The A test is applicable to any OS with an office suite. This is generally set with linux as I use that the most, but after startup and basic system checks, I generally open an office suite (or install libreoffice) and just hold down A. Could be any key, but A is just there. I keep a stop watch open on my phone and the CPU graph in the task manager. If the machine spikes to 100 I count the seconds between the spikes and count how many times it lags the entrance of text to the document. If it is a high lag time and the CPU graph constantly spikes, I generally go back to an older OS or trime the DE down to the basic essentials. In some cases trimming helps, like a lot.
Example: IBM 600X with a Pentium 3 MMX @ 650 MHZ, 386 MB ram, 1997 IDE Bus. The average time for startup on a 7200 RPM drive was around 56 seconds (Xubuntu). Opening the Office Suite (libre office) took 25-27 seconds and while it loaded I held the A key. As soon as the AAAAA started I got 4 in, CPU spiked, then 17, spike, 15, spike, and after another 3 spikes (around 35-45 seconds) it smoothed out and I could type as fast as I wanted.
This test is based on the CPU, the FSB, RAM timings, and over all system performance. A pentium M laptop (like my HP NW8000 for example) can be as fast as possible bit doesn’t like office suites. It likes rendering graphics. So we do…..
The Youtube Test:
Yes I use youtube as a benchmark. I have a CPU graph open while I load the YT web page. If it slams the ceiling then I know web browsing at all will be a problem. When a video loads I watch how long it takes for the entire video to buffer and test the different resolutions. I find that 360p is probably the best to do on any laptop up to 2008, then 720p is more possible and 1080p is the gold spot, obviously. Then after seeing how long it takes to load, I flip between the resolutions as fast as possible to see if the system lags (most of the time it does), and in the event that it does how long it takes to recover. If its a newer system (Core 2 Dua P8*** and on) this test can be avoided as it doesn’t do anything.
The Load Test:
This one is simple: bog the system down with as many apps as possible. If you have very little ram, under 512, this test helps show if you need to manage what apps you have open and what you do with the system. If you have a system with 1-2 GB ram then it will be more a test to see what all happens when you lag the system out. Some systems do different things if you believe me or not. Past 2 GB of ram this test is kind of pointless.
Example: IBM 600X, same specs as above. Boot, start fucking with the DE, just as much as possible. Hit every single button you can, jump around in work spaces, start opening terminals, then web browser windows, then office suites. This system specifically lags out with web browsers and more than 3 libreoffice windows. Terminals don’t matter. Opening games up on top of all this (such as mine sweeper) will totally crash the machine and force a reboot. Fixing this just takes a bigger swap file.
HP NW8000, 1.67 GHZ Pentium M, 2 GB ram. Again, boot, hit everything then open stuff. Can handle 5 firefox windows and 5-7 libreoffice windows. Lags more from heat generation than anything else, but still becomes slow after not opening that much.
Another good test on top of this is….
The Pony Test:
This sounds stupid, it really does, but the linux package qt-ponies and the windows app Desktop Ponies is an amazing CPU benchmark. Essentially each pony you spawn has its own log of what its done on your screen, a rather large list of what it can do, and a bit of AI behind it to interact with other ponies on your screen. This test mostly shows how intense your CPU can get in the number of database pulls it can perform.
Example: Lenovo Think Center TS140, 3.6 GHZ Xeon 1225v3 4c4t, 8 GB ram. Can only handle 45 ponies on the screen before the system slows down. Adding more will incremently kill the system till it crashes.
This test is good to show what your best hardware can do maxed out as well as show the limits of your lowest hardware.
What I like to call “The 3d Pile”.
Both CPU and GPU intensive , the 3D pile is mostly for linux. I don’t really use this anymore as it doesn’t apply to many, if any, of my machines. But the gist of it is to open a bunch of openGL games, GLTron in my case, and open as many of them as possible and run the demo in a loop. The last time I tested this was on a pentium 4 (Eve actually) and I got to 14 instances I think? I may have ran it on my Phenom 2 machine but it didn’t go much higher than that (9600PRO vs 250X).
Kdenlive Run:
Just like the benchmarks on any of the adobe suites, this is meant to show what the machine can do video editing wise. Generally I take a video with a camera as high def as possible, or a desktop recording actually, and roll that for 30 minutes. When I have that file, I’ll start adding as many effects as the system can handle and render the file out multiple times. The goal time for me is about 15 minutes at 45 filters and effects. For my hardware, this is a lot, especially at 1080p or 720p.
The VM Test:
I haven’t done this one because I can’t figure out GPU passthrough, but the test is the same as the tests above, just done through a passthrough VM of windows. This shows how well the system handles everything going on in the VM as well as how well the VM software runs. Again as I have not run this I cannot give an example, but imagine a can that you can add infinite preassure to inside of a bucket and if it touches the sides of the bucket then you really need to stop putting things in the can.
The VM Pile Test:
Basically VM-Ception, this test is to see how many VM’s you can run inside of each other. My record is 6 or 7 on my current Xeon machine. The goal is to just run as many VM’s as possible. The higher the better.
The Generic Tests:
Basically running the highest game possible on the system. Most of the time I use rocket league and CS go to see the FPS, but sometimes I don’t have that option (older systems) so I’ll run something like Wizard101 or second life where there is always shit everywhere and there could be 500 people on the screen.
I’ll list one more because I am getting bored.
The Minecraft Pile Test:
Download tekkit and start wiring computers up to each other. Then, start stuffing the giant system you made with commands. If java crashes, you know how many java requests can be made based on the amound of computers you built. I would recommend using IndustialCraft2 for this as Computercraft tends to be a bit difficult. With this, you can normally stack 8 computers on each other and it will register as one system. When you add more to each other it will just mirror your request. When you get to about 20 it starts to put some stress on the JVM. The more of those computers you can tack on, the more ridiculous stuff you can do with Java on your system.
I’ll have more later I’m just bored now so expect an edit.