Hi all,
I build a PC similar to @wendell DevOPS Machine.
On a Gigabyte x570 Aorus Master i have a Gigabyte AORUS NVMe Gen4 SSD 1TB in first and second slot.
In the third slot a have an Intel Optane 16GB ssd.
I installed Fedora 31 on the system.
For doing so i created the File systems as follows:
Before starting the installation i created a BTRFS raid 1 with the two Gigabyte SSDs.
sudo wipefs -a /dev/nvme0n1
sudo wipefs -a /dev/nvme1n1
sudo wipefs -a /dev/nvme2n1
sudo mkfs.btrfs -d raid1 -m raid1 -L gigabyte-nvme-raid /dev/nvme0n1 /dev/nvme1n1
Also I formated the Intel SSD with the disk utility of Fedora’s live system with gpt,
this was needed as the installer complained when it tried to delete all partitions on that disk without gpt.
The mountpoints / and /home go as btrfs subvolumes to the btrfs raid1.
/boot, /boot/efi and swapt to the Intel SSD.
I updated the system. The kernel now is 5.4.17-200.fc31.x86_64.
Now comes the problem.
After the system is running for a while the system suddenly gets stuck for arround 10 seconds. Then it recovers.
When looking into dmesg i can see issues that it can not write to nvme0. Something about APST, and then a lot of btrfs checksum errors.
To test a bit i reinstalled having set nvme_core.default_ps_max_latency_us=55000.
I got this number from sudo nvme id-ctrl /dev/nvme0 and inspecting ps 4, by adding up enlat + exlat plus some time extra. Description from https://wiki.archlinux.org/index.php/Solid_state_drive/NVMe. Remark this is not the value 5500 as described in section Samsung drive error.
With this kernel command line set the Fedora the system freezes during installation in step post install something.
I reinstalled with nvme_core.default_ps_max_latency_us=0 which disables APST for all of the drives.
Currently the system is stable.
Does anyone observe the same issues with the drives and btrfs raid 1?
Does it harm the drives disabling APST and always have them running at full operational state?
Could it be a hardware issue instead?
Is there some geat fix comming in kernel 5.5?
Can firmware updates be installed without Windows 10, i tried to install Windows but it did not see any of the drives?
By the way the issue with fstrim -v /
as described in Devops Workstation: Fixing NVMe Trim on Linux
is not present.
Thanks a lot,
André