100 Gbe & TrueNAS Scale 23.10 iSCSI - Performance Unleashed

wardtj · January 30, 2024, 9:32pm

Hi All,

I thought I would post a quick “how-to” for those interested in getting better performance out of TrueNAS Scale for iSCSI workloads.

TL;DR

I was able to get almost 4GB/s throughput to a single VMware VM using Mellanox ConnectX-5 cards and TrueNAS Scale 23.10. That’s more than double what I was getting previously with 2x10Gbe connections previously. This is quick and dirty tuning on TrueNAS and VMWare to get this performance. If there’s interest, I can go into some of the deeper “how” in extracting more performance if those are curious.

Requisite FIO information (12 Thread): fio --bs=128k --direct=1 --directory=pwd --gtod_reduce=1 --ioengine=posixaio --iodepth=32 --group_reporting --name=randrw --numjobs=12 --ramp_time=10 --runtime=60 --rw=randrw --size=256M --time_based

randrw: (g=0): rw=randrw, bs=(R) 128KiB-128KiB, (W) 128KiB-128KiB, (T) 128KiB-128KiB, ioengine=posixaio, iodepth=32
…
fio-3.33
Starting 12 processes
Jobs: 12 (f=12): [m(12)][100.0%][r=5929MiB/s,w=5978MiB/s][r=47.4k,w=47.8k IOPS][eta 00m:00s]
randrw: (groupid=0, jobs=12): err= 0: pid=384493: Tue Jan 30 16:20:53 2024
read: IOPS=49.1k, BW=6144MiB/s (6443MB/s)(360GiB/60010msec)
bw ( MiB/s): min= 3214, max= 9988, per=100.00%, avg=6155.88, stdev=106.94, samples=1416
iops : min=25714, max=79906, avg=49245.40, stdev=855.47, samples=1416
write: IOPS=49.2k, BW=6153MiB/s (6452MB/s)(361GiB/60010msec); 0 zone resets
bw ( MiB/s): min= 3277, max= 9977, per=100.00%, avg=6165.42, stdev=106.67, samples=1416
iops : min=26214, max=79816, avg=49320.25, stdev=853.40, samples=1416
cpu : usr=3.40%, sys=0.72%, ctx=1975083, majf=0, minf=439
IO depths : 1=0.1%, 2=0.3%, 4=0.8%, 8=15.3%, 16=61.1%, 32=22.3%, >=64=0.0%
submit : 0=0.0%, 4=100.0%, 8=0.0%, 16=0.0%, 32=0.0%, 64=0.0%, >=64=0.0%
complete : 0=0.0%, 4=96.3%, 8=0.7%, 16=0.9%, 32=2.1%, 64=0.0%, >=64=0.0%
issued rwts: total=2949494,2953888,0,0 short=0,0,0,0 dropped=0,0,0,0
latency : target=0, window=0, percentile=100.00%, depth=32

Run status group 0 (all jobs):
READ: bw=6144MiB/s (6443MB/s), 6144MiB/s-6144MiB/s (6443MB/s-6443MB/s), io=360GiB (387GB), run=60010-60010msec
WRITE: bw=6153MiB/s (6452MB/s), 6153MiB/s-6153MiB/s (6452MB/s-6452MB/s), io=361GiB (387GB), run=60010-60010msec

System Specs

TrueNAS Host:

128GB RAM
Ryzen Threadripper Pro 3955WX (12C/24T)
2xASUS PCIe 4.0 Quad Hyper Cards
8xWD850X 2TB (512b mode) in 2 VDEVxRAIDZ1
Mellanox ConnectX-5 MCX515A-CCAT Single Port 100GBe
TrueNAS 23.10.0.1
Extra Kernel Params: nvme_core.default_ps_max_latency_us=0 pcie_aspm=off

VMWare Host:

256GB RAM
Ryzen Threadripper Pro 3955WX (12C/24T)
Mellanox ConnectX-5 MCX515A-CCAT Single Port 100GBe
ESXi 8.0 U2

INTRODUCTION

I’ve been on this quest for “performance.” Much like muscle cars and trying to get into the 9’s at the track, I’m the type that just wants it to work “fast” . Being a tinkerer, I’ve spent many hours (and too much budget!) in pursuit of performance perfection. I’ve long tried Fiber Channel, many types of storage arrays, rolling my own kernels, spending time tuning code in SCST. All because I’ve wanted fast performance.

My latest endeavour has been building the fast-as-possible, cheap as reasonable possible lap setup. One that gives performance, but also balances power and general affordability. I know fast and cheap typically = poor quality. This has been the quest to avoid tha.t

HOW-TO

NETWORKING CARD SUPPORT

For the performance to work, you will need to be specific on the type of 100GBe (50, 40, 25 or even 10) cards you want. In my case, I had loads of ConnectX-3 cards, but, VMware being VMware, ESXi does not support them. ESXi 7 and below should work ok with ConnectX-3.

I ended up going with ConnectX-5 as I did not want to be saddled with another forced upgrade due to VMWare changes.

I know many in the community avoid ConnectX cards and combining them with TrueNAS. They work just fine typically, except, you have to know what to do with them. I normally configure my ConnectX cards on a Windows host to get the latest firmware, and settings applied. I then place the cards into the TrueNAS system which generally preserves their settings. On TrueNAS Scale you will not have the ability to use mst or any of the other tools to modify the card, so, it’s important to do any configuration of the cards, aka setting Ethernet mode, before installing in TrueNAS Scale systems. You can avoid this by looking for Ethernet only cards.

You can find out more about what card to order here, nVidia ConnectX-5 Specs.

If you want to use another 100GBe card by another manufacturer, go ahead, but, the important part to understand is the card MUST support RDMA, and preferably ROCE v2. 1/2 of the sauce to getting performance is making sure to have RDMA support that works on both ends. In the case of the MCX515A-CCAT, it does. RDMA allows the systems to talk directly over the wire to memory on either side of the connection. This avoids protocol overhead and other traditional limitations in performance. Your choice of OS must also support RDMA on both ends. VMWare and TrueNAS Scale do. If you want to use Linux KVM, you need to enable Linux RDMA initiator support on the host side, and RDMA target support on the other. That’s outside the scope of this quick “how-to.”

I also chose this card based on cost. It was $90ea for a single port 100GBe card, whereas 50GBe cards where double that price. While I know that I really cannot stress 100GBe completely, it made no sense to go with 50GBe cards for more money.

SYSTEM CONSIDERATION

You will need to ensure that the machines you use have plenty of PCIe lanes. The MCX515A-CCAT is a PCIe 3.0x16 card. It needs that to be able to support a single link at 100GBe without any loses due to bandwidth restrictions. You also need to ensure that whatever storage backend you are using has enough bandwidth to support your activities. A basic LSI 9300 controller with 8 SSDs tops out around 3GB/s, not matter. This will of course, not max out 100GBe. The point here is that getting good 10GBe performance (aka 2 GB/s w 2x10Gbe) needs to follow these principles as well.

For this, I chose the Lenovo P620. I found some very good deals on them where I managed to pick up working machines for less than $800. Deals come up, you need to look for them. These are upgradable to 5000WX series, which makes them a great investment. CPU lock is the downside for resale, but, Lenovo-Only Threadripper CPU’s are fairly cheap.

It’s also worthwhile to mention that the more memory you have the better. 100GBe is 10GB/s, which under load can exhaust small memory systems quickly. In my case, I chose 128GB to avoid these issues. Smaller amounts of RAM will limit the performance you can achieve. The system has to be able to move memory quickly, and low memory systems may not be able to keep up, hence hurting your performance.

THE RECIPE

Once you have the network cards connected at 100GBe (50, 40, 25, etc.) on each end, the magic can begin. The rest of this configuration focuses on two things, getting RDMA to work (without hacking TrueNAS Scale or VMWare) and a few custom ZFS and network tuning activities to run.

On the TrueNAS Scale side, configure the iSCSI target like you would always do. There’s nothing special to do there. I suggest following ixsystems guides on how to configure it here,

Adding iSCSI Shares on Scale.

Once you have it working, go to the Linux Bash Root prompt in TrueNAS Scale and check to see if “iSER” is loaded. iSER is required to make this work. Fortunately TrueNAS Scale includes iSER with the appropriate RDMA support out of the box. No need to install any OFED or other tools (if you follow my advice above).

Run this command at root to check on iSER on your TrueNAS Scale host,

dmesg | grep -i scst | grep -i iser

And you should get something similar to,

[ 54.789249] [5677]: iscsi-scst: Registered iSCSI transport: iSER
[ 54.806430] [6157]: iscsi-scst: iser portal with cm_id 000000006e105ebf listens on 0.0.0.0:3260
[ 54.806584] [6157]: iscsi-scst: Created iser portal cm_id:000000006e105ebf
[ 54.806762] [6157]: iscsi-scst: iser portal with cm_id 00000000aeeea253 listens on [::]:3260
[ 54.806926] [6157]: iscsi-scst: Created iser portal cm_id:00000000aeeea253

If you see this, you are in great shape!

On the VMWare side, you need to enable RDMA, and enable RDMA over iSCSI. Many are familiar with regular iSCSI support, however, RDMA is “hidden” and needs to be enabled from the command line.

You should follow the steps located here,

VMWware RDMA over iSCSI Setup

This will enable RDMA on your system, if your network card supports it. The MCX515-CCAT does out of the box.

You then need to configure the RDMA adapter like you would any other VMWare iSCSI target.

Once you’ve enabled the target, regardless of the client OS, you can run the command on TrueNAS Scale,

dmesg | grep -i “iser accepted”

and get something similar to,

[47005.736315] [275975]: iscsi-scst: iser accepted connection cm_id:00000000e9f837d8 10.100.200.11:42694->10.100.200.225:3260

This means that the connection has been made to the target in RDMA mode.

That’s it, it works! … Kinda

TUNING TRUENAS SCALE

There’s been many guides posted over the years on tuning FreeNAS/TrueNAS. Some are helpful, others are not. I’ve found other the years that a few tweaks are needed to get better performance out of the TrueNAS Scale system. Also, some tuning is needed for stability with NVMe devices. By default I’ve found TrueNAS Scale to be VERY unstable without tuning for NVMe.

I also would mention, that its VERY easy to overrun a TrueNAS storage server with multi-GB/s performance. If you have spinning drives, and limited caching, you may see performance tank, as the machine has to cope with a massive inrush of I/O that must be cleared. I’ve seen this make the machine perform so poorly it starts to fail to respond and lock up.

TRUENAS SCALE LINUX KERNEL TUNING

I suggest adding,

nvme_core.default_ps_max_latency_us=0 pcie_aspm=off

to the kernel boot options. This particular command line disables deep power save on PCIe devices. Without these lines, I’ve found it only takes a couple days for my TrueNAS scale machine to lock hard with PCIe hardware errors. The downside is higher idle power draw, but, for 2W in my case, and system stability, it was the best option for me.

Via sysctl and TrueNAS Scales Kernel parameter section, I suggest adding the following kernel values,

net.core.netdev_max_backlog = 8192
net.ipv4.tcp_max_syn_backlog = 8192
net.core.rmem_max = 16777216
net.core.wmem_max = 16777216
net.ipv4.tcp_congestion_control = dctcp
net.ipv4.tcp_ecn_fallback = 0

These changes enable more memory to be allocated to the networking stack. In particular, we enable Data Center TCP which is better for low-latency links which is what we want our storage server to provide.

ZFS TUNING

Improving ZFS performance is the next step. These settings have worked for me, with my workloads to improve performance. They may not for you. My settings are generally tuned towards a big storage server with plenty of CPU and memory. Smaller systems may not run as well.

echo 128 > /sys/module/zfs/parameters/zfs_vdev_def_queue_depth
echo 0 > /sys/module/zfs/parameters/zfs_dmu_offset_next_sync
echo 12 > /sys/module/zfs/parameters/zfs_vdev_async_read_max_active
echo 4096 > /sys/module/zfs/parameters/zfs_vdev_max_active

These commands increase some defaults that ZFS ships with, which are rather conservative. As I’m working with NVMe drive, that can sustain more IOPS, I increase values like queue depth and active commands.

I also set 75% of memory for the ZFS ARC cache. By default TrueNAS Scale uses only 50% of available RAM. This is apparently being fixed in TrueNAS Scale 24.04 to match the TrueNAS Core behaviour.

LINUX IO TUNING

Finally, I do some tuning to the Linux IO subsystem. Again, these changes work for me, and not everyone. These are designed to unlock the bigger queue depths that NVMe drives can sustain.

I set 2 parameters. I set the scheduler to deadline for each drive, and increase the read-ahead-kb to 512. I do this as I want consistent performance per drive. Linux defaults to noop, but I’ve found that for storage workloads, in my case, deadline scheduling works best.

TRUENAS TUNING SCRIPT

If you’ve read this far, bonus. Here’s a script you can call from TrueNAS to run on boot which will make these settings work automatically!

#!/bin/sh

PATH=“/bin:/sbin:/usr/bin:/usr/sbin:${PATH}”
export PATH

ARC_PCT=“75”
ARC_BYTES=$(grep ‘^MemTotal’ /proc/meminfo | awk -v pct=${ARC_PCT} ‘{printf “%d”, $2 * 1024 * (pct / 100.0)}’)
echo ${ARC_BYTES} > /sys/module/zfs/parameters/zfs_arc_max

SYS_FREE_BYTES=$((810241024*1024))
echo ${SYS_FREE_BYTES} > /sys/module/zfs/parameters/zfs_arc_sys_free
echo ${SYS_FREE_BYTES} > /sys/module/zfs/parameters/zfs_arc_min
echo 128 > /sys/module/zfs/parameters/zfs_vdev_def_queue_depth
echo 0 > /sys/module/zfs/parameters/zfs_dmu_offset_next_sync
echo 12 > /sys/module/zfs/parameters/zfs_vdev_async_read_max_active
echo 4096 > /sys/module/zfs/parameters/zfs_vdev_max_active

for i in ls /sys/block | grep -i nvme
do
echo 512 > /sys/block/$i/queue/read_ahead_kb
echo mq-deadline > /sys/block/$i/queue/scheduler
done

/usr/sbin/modprobe tcp_dctcp
/usr/sbin/sysctl -w net.ipv4.tcp_congestion_control=dctcp
/usr/sbin/sysctl -w net.ipv4.tcp_ecn_fallback=0
/usr/sbin/sysctl -w net.core.netdev_max_backlog=8192
/usr/sbin/sysctl -w net.ipv4.tcp_max_syn_backlog=8192
/usr/sbin/sysctl -w net.core.rmem_max=16777216
/usr/sbin/sysctl -w net.core.wmem_max=16777216

SUMMARY

With this simple configuration, multi GB/s is possible with TrueNAS Scale. Not additional drivers or configuration is needed. This quick how-to did not apply any tuning on the VMWare side as you can increase performance further by using Paravirtualized controllers, and increasing VM, ESXi and TrueNAS Scale queue depths.

I am hoping that future versions of TrueNAS Scale will enable support for NFS and SMB RDMA support. RDMA support via iSER offers significant performance advantages over regular iSCSI. I’m also eagerly waiting for TrueNAS Scale to support NVMe over TCP/RDMA. Those improvements will also increase the performance here.

I hope some found this article helpful. I’ll post more results over time as I tune this configuration.

diizzy · January 30, 2024, 10:37pm

Edit:
Apparently the link doesn’t work properly, it’s refering to “FreeBSD, ZFS and iSCSI, or one year of TrueNAS development” paper on that page.

Looks like you should be able to get better performance in general?

Bushi1147 · January 31, 2024, 1:25am

My one question is why Threadripper vs say a epyc cpu?

wardtj · January 31, 2024, 2:20am

Cost. I originally investigated using Epyc for this due to the PCIe requirements. It was ultimately cheaper to buy a fully configured P620 than to assemble a Epyc machine from parts. There are some good deals on Epyc + Boards on eBay but to get Rome compatibility and PCIe 4.0 is more expensive than just buying the P620 loaded up which already has that for a lower priced used.

BVD · February 3, 2024, 6:09am

This tuning is quite sufficient for generalized storage, but Linux handling of very small highly random IO I’d expect to be reduced a bit by this, as well as less than optimal storage latency values. Generally I’d only recommend someone set a 512 read ahead on Linux (bsd doesnt seem to care as much, but never looked in to why) when primarily acting as a file share (or otherwise “non-sparsified” file workloads), and that 128 vdev queue depth can also be problematic for anything with high sync writes.

Overall, I feel like this is a great starting point for folks, but I’d just caution them thay if they dont have enough memory to just “brute force it” (as I think is likely happening here), they’ll definitely want to at least fully evaluate the Linux and zfs tuning parameters for themselves - if there was heavy memory usage elsewhere or any need for zfs to relinquish an allocation throughout, I’d expect to see lower numbers, even with the reduced overhead thanks to user.

I’d be curious to see the same fio test ran with a 16-64k block size. I suspect with these tuning settings in place, the interrupt count would likely get high enough over time that it’d start to show what I’m talking about a bit more concretely in the numbers.

And just to be clear here - I think this post is still of significant use to folks, and that its awesome you pulled this together here. I just hope to dissuade anyone who might come across this down the line from blindly putting these values in place for some database workload at some point and then get frustrated that things tank, thats all

Janos · May 7, 2024, 8:58pm

Truenas Scale 24.04.0 somehow doesn’t seem to scale with NVME.
I created three Luns, one with a 6x2 Mirror HDD and two stripes with 3xNVME. The “Raid10” Lun performs as expected but the two NVME Luns are rather disappointing and although completely different types of NVME, 1x 3 consumer NVME and the other 3x data center NVME, the results are strangely similar.

VM Host Manjaro Kernel 6.8, ConnectX-5 direct connected to Truenas 24.04.0, KVM Client Windows11 23H2, Luns mounted via Libvirt

<disk type='block' device='disk'>
      <driver name='qemu' type='raw' cache='none'/>
      <source dev='/dev/disk/by-id/wwn-0x6589cfc0000008a4b1669512318fad8b'/>
      <target dev='vde' bus='virtio'/>
</disk>

6x2 TOSHIBA_MG08ACA16TE
CrystalDiskMark_raid10

3x ADATA SX8200PNP
CrystalDiskMark_consumer

2x ORACLE F640 6.4TB FLASH CARD V2, 1x Samsung PM9A3
CrystalDiskMark_datac

iscsiadm -m session -P 1
Target: iqn.2005-10.org.freenas.ctl:iscsi (non-flash)
        Current Portal: 10.0.90.6:3260,1
        Persistent Portal: 10.0.90.6:3260,1
                **********
                Interface:
                **********
                Iface Name: iser
                Iface Transport: iser
                Iface Initiatorname: iqn.2016-04.com.open-iscsi:12b459afcbc6
                Iface IPaddress: [default]
                Iface HWaddress: default
                Iface Netdev: default
                SID: 2
                iSCSI Connection State: LOGGED IN
                iSCSI Session State: LOGGED_IN
                Internal iscsid Session State: NO CHANGE
Target: iqn.2005-10.org.freenas.ctl:steam (non-flash)
        Current Portal: 10.0.90.6:3260,1
        Persistent Portal: 10.0.90.6:3260,1
                **********
                Interface:
                **********
                Iface Name: iser
                Iface Transport: iser
                Iface Initiatorname: iqn.2016-04.com.open-iscsi:12b459afcbc6
                Iface IPaddress: [default]
                Iface HWaddress: default
                Iface Netdev: default
                SID: 3
                iSCSI Connection State: LOGGED IN
                iSCSI Session State: LOGGED_IN
                Internal iscsid Session State: NO CHANGE
Target: iqn.2005-10.org.freenas.ctl:games (non-flash)
        Current Portal: 10.0.90.6:3260,1
        Persistent Portal: 10.0.90.6:3260,1
                **********
                Interface:
                **********
                Iface Name: iser
                Iface Transport: iser
                Iface Initiatorname: iqn.2016-04.com.open-iscsi:12b459afcbc6
                Iface IPaddress: [default]
                Iface HWaddress: default
                Iface Netdev: default
                SID: 5
                iSCSI Connection State: LOGGED IN
                iSCSI Session State: LOGGED_IN
                Internal iscsid Session State: NO CHANGE

diizzy · May 7, 2024, 9:08pm

It would be interesting to see what you’d get with FreeBSD 14.0/14.1-BETA but sure how KVM will affect performance

Janos · May 7, 2024, 9:16pm

you mean truenas core?

diizzy · May 7, 2024, 9:28pm

No, FreeBSD 14.0 or 14.1-BETA as TrueNAS Core is using the rather old (by now) 13.1-base (with some backports afaik).
No idea if or when they’re planning on a 14.X-based version.

wardtj · May 7, 2024, 9:29pm

This performance is expected. In your 6x2 you have 6 vdevs to take reads or writes from. In your 3x its likely raidz1, one vdev.

The speed of zfs depends on the number of vdevs backing your volume or dataset.

The speed for read or write will be the speed of the slowest device in the pool x number of vdevs. In your case looks like one of your tests is being assisted by ram versus the others where its cur tbrough following the logic above.

You might want to put your cdmark into nvme mode so you can see the details like iops and latency.

Zfs is copy on write so this saps performance by definition. Not to mention pch and pcie bus bandwidth and all the other constaints.

Janos · May 7, 2024, 9:54pm

as I wrote, two stripe sets
Server is an EPYC 7F52 with 256GB RAM



  pool: nvme-iscsi
 state: ONLINE
config:

        NAME         STATE     READ WRITE CKSUM
        nvme-iscsi   ONLINE       0     0     0
          nvme5n1    ONLINE       0     0     0
          nvme4n1p1  ONLINE       0     0     0
          nvme2n1p1  ONLINE       0     0     0

errors: No known data errors

  pool: nvme-steam
 state: ONLINE
config:

        NAME                                 STATE     READ WRITE CKSUM
        nvme-steam                           ONLINE       0     0     0
          nvme-ADATA_SX8200PNP_2J2020048537  ONLINE       0     0     0
          nvme0n1                            ONLINE       0     0     0
          nvme1n1                            ONLINE       0     0     0

wardtj · May 7, 2024, 10:13pm

Have you tried the fio command at the top.of my post? My drives are 7GB/s and fio comes close to this based on the formula above.

When I measure the rdma performance i get 5GB/s which is translating to just over 4GB/s in.the VM. I cant speak to non ESX cases since Ive not tested proxmox. For proxmox I plan to test NFS over RDMA, as i can max out the 100GB link in my simple testing so far. Ive modified Truenas to do NFS over RDMA.

Have tried you tried multple VMs at once? I top out just over 8GB/s in overall throughput after 2 VMs. It just less the more I run at once.

Janos · May 7, 2024, 10:26pm

I think I’ve found the problem, I think the NVME mix doesn’t work well togehter, I get more than twice the performance from a single P9M3 than from the 3 in the stripe

Single P9M3 Zpool
[manja01 tank02]# dd if=/dev/zero of=./test1.img bs=1G count=10 oflag=dsync
10+0 Datensätze ein
10+0 Datensätze aus
10737418240 Bytes (11 GB, 10 GiB) kopiert, 1,24707 s, 8,6 GB/s

3xNVME Stripe Zpool
root@truenas[~]# cd /mnt/nvme-iscsi
root@truenas[/mnt/nvme-iscsi]# dd if=/dev/zero of=./test1.img bs=1G count=1 oflag=dsync
1+0 records in
1+0 records out
1073741824 bytes (1.1 GB, 1.0 GiB) copied, 0.351011 s, 3.1 GB/s

diizzy · May 8, 2024, 5:32pm

dd especially on Linux is a terrible way of benchmarking due to how caching works and even if you decide to use dd at least write a large amount of data that exceeds the cache size (like 100Gbyte or so at least)

07stuntar1 · August 1, 2024, 8:13am

I was able to follow all the instructions until adding the RDMA adapter. After adding the RDMA over iSCSI Adapter. I bind the ports and use Dynamic and add the address of the TrueNas Server. Prior it was connects to the Software ISCSI. I disconnected it from there and added it to the RDMA over iSCSI. It doesn’t connect to the TrueNas server. Do I need to enable something on the TrueNAS side?

wardtj · August 2, 2024, 3:12am

For TrueNAS Scale, it should just work. If the card is not RDMA capable, it will not work. Make sure that you see something like this,

truenas# dmesg | grep -i iser
[ 60.135067] [5763]: iscsi-scst: Registered iSCSI transport: iSER
[ 60.155253] [6226]: iscsi-scst: iser portal with cm_id 0000000094389f3e listens on 0.0.0.0:3260
[ 60.155458] [6226]: iscsi-scst: Created iser portal cm_id:0000000094389f3e
[ 60.155481] [6226]: iscsi-scst: iser portal with cm_id 00000000bba3c8f5 listens on [::]:3260
[ 60.155867] [6226]: iscsi-scst: Created iser portal cm_id:00000000bba3c8f5

This shows that the iSER RDMA is ready to work.

When your device connects, you’ll see,

[ 207.491358] [204]: iscsi-scst: isert_cm_evt:connect request(4) status:0 portal:000000002ce236ad cm_id:00000000e091896e
[ 207.502854] [204]: iscsi-scst: iser created device:0000000031400017
[ 207.505783] [204]: iscsi-scst: iser created cm_id:00000000e091896e qp:0xBB
[ 207.507277] [204]: iscsi-scst: iser accepted connection cm_id:00000000e091896e 10.10.20.11:44724->10.10.20.22:3260
[ 207.508935] [203]: iscsi-scst: isert_cm_evt:established(9) status:0 portal:000000002ce236ad cm_id:00000000e091896e

dmesg is your friend to say why the connection fails.

Wask · January 9, 2025, 5:39pm

I’m a bit of a noob and I’m getting similar results, I have 4x NVME Raid Z1 with one VDEV Can you add more VDEVS to Raid Z1 to increase performance?

wardtj · January 9, 2025, 11:49pm

Yes adding more vdevs helps but only to a point. There is VM driver tuning needed to open up the I/O queue on vmware. For example the vmware scsi driver needs tuning to max depth of 256. Then the hostt and luns queue depth and adjacency on vmware need to be expanded to at least 1024. I found 4096+ helps.

Ive gotten at high as around 50 gbe/s read and write. Multiple vms can get to 95gbe/s. At those speeds you need gob of ram as well to avoid saturation.

Hth!

CJRoss · January 28, 2025, 2:01pm

I’m curious why you didn’t go with mirrors instead of Z1 in order to get more IOPs. That’s my plan for when I set up TN with 100g and XCPng hosts with 25g.

Wask · January 28, 2025, 8:17pm

Basically Volume size. I have 2x 3drive Z1s 6 4tb drives 2 redundancy 14.4TB volume. 6 drives Mirrored would give me 3 redundancy 10.6 TB Volume. I’m achieving 3GB/s W- 2.5GB/s R throughput real world, 40g/bit iperf3 over SMB which is acceptable. Win 11 pro 4 WS cannot achieve much more tbh . It may improve with RDMA but Truenas is only letting that off with an enterprise licence in the next version. So prefer to have the extra 4TB