A few weeks ago I combined a 9500-16i with two Adaptec AEC-82885T 36Port SAS Expander.
I took the AEC-82885T because it doesn’t require a PCIe slot. In retrospect, a SAS expander from Broadcomms SAS3xNN series would have been a better choice, maybe…
The plan was to use the 9500 for two PM9A3 U2 drives and 20 HDDs (TOSHIBA MG08ACA16TE) via SAS Expander, 12 internal and 8 external.
I connected all drives to test the combination, but waited to migrate the metadata to the two U2 drives, so the U2 drives had no load, only the disks were configured as productive and backup pool.
I had it running for 3 weeks with TrueNas Scale, everything was normal, the performance of the productive pool was good and the backup to my DIY SAS Enclosure also worked.
The trouble started with the transfer of the metadata Special vdev from the previous NVME mirror to the two U2 drives.
As soon as there was load on the pool, one or more disks failed and almost all of them now show UDMA CRC errors.
I first updated the HBA firmware, then improved the cooling of the HBA/Expander and replaced the cables, but nothing really helped, although I had the impression that the better cooling had helped a bit, the errors came much later.
But the cooling is now as good as it can be, I have a 40mm fan directly on the HBA and a 22" fan in front of it and the HBA shows 50 degrees Celsius, which seems high to me, because only the two U2 drives are connected, the other disks are now connected to a 9300 with no SAS expander in between.
Another problem is the PCIe link speed of the 9500, lspci shows “LnkSta: Speed 16GT/s, Width x8” but storcli shows “Device Interface = PCIe-8GT/s” who is right?
The PCIe slot is currently set to Auto, as soon as I can shut down the server again I’ll try to force PCIe Gen4, but a second backup runs at the moment, so that have to wait.
I think the idea with the SAS enclosure would be nice but unfortunately too much trouble.
I would like to keep the 9500, if I can even get a suitable SlimSAS SFF-8654 8i Straight to 8x SATA cable, because no dealer offers such cables in Germany and I don’t want to test the eBay/China cables after the drama of the last few days.
And what discourages me, is that Broadcom doesn’t have a SATA Breakout cable for the 9500 in its cable guide, however that would be optimal for my case.
I could get that, but my case isn’t ideal for the angled plugs
Does anyone know a dealer who delivers suitable and high-quality SFF-8654 8i Straight to 8x SATA cable to Germany?
Or has someone already gone through this and come to the conclusion that the 9500 needs a backplane?
I just found a firmware from 2021 for the AEC-82885T, maybe…
#./storcli64 /c0 show all
CLI Version = 007.2807.0000.0000 Dec 22, 2023
Operating system = Linux 6.5.0-15-generic
Controller = 0
Status = Success
Description = None
Basics :
======
Controller = 0
Adapter Type = SAS3816(A0)
Model = HBA 9500-16i
Serial Number = SKB5091830
Current System Date/time = 02/04/2024 23:43:24
Concurrent commands supported = 4352
SAS Address = 500062b20918c500
PCI Address = 00:42:00:00
Version :
=======
Firmware Package Build = 29.00.00.00
Firmware Version = 29.00.00.00
Bios Version = 09.57.00.00_29.00.00.00
NVDATA Version = 29.02.00.11
PSOC FW Version = 0x0064
PSOC Part Number = 14790
Driver Name = mpt3sas
Driver Version = 43.100.00.00
PCI Version :
===========
Vendor Id = 0x1000
Device Id = 0xE6
SubVendor Id = 0x1000
SubDevice Id = 0x4050
Host Interface = PCIE
Device Interface = PCIe-8GT/s
Bus Number = 66
Device Number = 0
Function Number = 0
Domain ID = 0
Pending Images in Flash :
=======================
Image name = No pending images
Status :
======
Controller Status = OK
Memory Correctable Errors = 0
Memory Uncorrectable Errors = 0
Bios was not detected during boot = No
Controller has booted into safe mode = No
Controller has booted into certificate provision mode = No
Package Stamp Mismatch = No
Supported Adapter Operations :
============================
Alarm Control = No
Cluster Support = No
Self Diagnostic = No
Deny SCSI Passthrough = No
Deny SMP Passthrough = No
Deny STP Passthrough = No
Support more than 8 Phys = Yes
FW and Event Time in GMT = No
Support Enclosure Enumeration = Yes
Support Allowed Operations = Yes
Support Multipath = Yes
Support Security = Yes
Support Config Page Model = No
Support the OCE without adding drives = No
support EKM = No
Snapshot Enabled = No
Support PFK = No
Support PI = No
Support Shield State = No
Support Set Link Speed = No
Support JBOD = No
Disable Online PFK Change = No
Real Time Scheduler = No
Support Reset Now = No
Support Emulated Drives = No
Support Secure Boot = Yes
Support Platform Security = No
Support Package Stamp Mismatch Reporting = Yes
Support PSOC Update = Yes
Support PSOC Part Information = Yes
Support PSOC Version Information = Yes
HwCfg :
=====
ChipRevision = A0
BatteryFRU = N/A
Front End Port Count = 1
Backend Port Count = 21
Serial Debugger = Absent
NVRAM Size = 0KB
Flash Size = 16MB
On Board Memory Size = 0MB
On Board Expander = Absent
Temperature Sensor for ROC = Present
Temperature Sensor for Controller = Absent
Current Size of CacheCade (GB) = 0
Current Size of FW Cache (MB) = 0
ROC temperature(Degree Celsius) = 47
Policies :
========
Policies Table :
==============
------------------------------------------------
Policy Current Default
------------------------------------------------
Predictive Fail Poll Interval 0 sec
Interrupt Throttle Active Count 0
Interrupt Throttle Completion 0 us
Rebuild Rate 0 % 30%
PR Rate 0 % 30%
BGI Rate 0 % 30%
Check Consistency Rate 0 % 30%
Reconstruction Rate 0 % 30%
Cache Flush Interval 0s
------------------------------------------------
Flush Time(Default) = 4s
Drive Coercion Mode = none
Auto Rebuild = Off
Battery Warning = Off
ECC Bucket Size = 0
ECC Bucket Leak Rate (hrs) = 0
Restore HotSpare on Insertion = Off
Expose Enclosure Devices = Off
Maintain PD Fail History = Off
Reorder Host Requests = On
Auto detect BackPlane = SGPIO/i2c SEP
Load Balance Mode = None
Security Key Assigned = Off
Disable Online Controller Reset = Off
Use drive activity for locate = Off
Boot :
====
Max Drives to Spinup at One Time = 2
Maximum number of direct attached drives to spin up in 1 min = 60
Delay Among Spinup Groups (sec) = 2
Allow Boot with Preserved Cache = On
Defaults :
========
Phy Polarity = 0
Phy PolaritySplit = 0
Cached IO = Off
Default spin down time (mins) = 0
Coercion Mode = None
ZCR Config = Unknown
Max Chained Enclosures = 0
Direct PD Mapping = No
Restore Hot Spare on Insertion = No
Expose Enclosure Devices = No
Maintain PD Fail History = No
Zero Based Enclosure Enumeration = No
Disable Puncturing = No
Un-Certified Hard Disk Drives = Block
SMART Mode = Mode 6
Enable LED Header = No
LED Show Drive Activity = No
Dirty LED Shows Drive Activity = No
EnableCrashDump = No
Disable Online Controller Reset = No
Treat Single span R1E as R10 = No
Power Saving option = Enable
TTY Log In Flash = No
Auto Enhanced Import = No
Enable Shield State = No
Time taken to detect CME = 60 sec
Capabilities :
============
Supported Drives = SAS, SATA, NVMe
Enable JBOD = Yes
Max Parallel Commands = 4352
Max SGE Count = 128
Max Data Transfer Size = 32 sectors
Max Strips PerIO = 0
Max Configurable CacheCade Size = 0
Min Strip Size = 512Bytes
Max Strip Size = 512Bytes
Scheduled Tasks = NA
Secure Boot :
===========
Secure Boot Enabled = Yes
Controller in Soft Secure Mode = No
Controller in Hard Secure Mode = Yes
Key Update Pending = No
Remaining Secure Boot Key Slots = 7
Security Protocol properties :
============================
Security Protocol = None
Enclosure Information :
=====================
------------------------------------------------------------------
EID State Slots PD PS Fans TSs Alms SIM ProdID VendorSpecific
------------------------------------------------------------------
0 OK 10 2 0 0 0 0 0 VirtualSES
------------------------------------------------------------------
Physical Device Information :
===========================
Drive /c0/e0/s4 :
===============
-----------------------------------------------------------------------------------------------
EID:Slt DID State DG Size Intf Med SED PI SeSz Model Sp
-----------------------------------------------------------------------------------------------
0:4 1 JBOD - 3.492 TB NVMe SSD - - 512B SAMSUNG MZQL23T8HCLS-00B7C -
-----------------------------------------------------------------------------------------------
EID-Enclosure Device ID|Slt-Slot No|DID-Device ID|DG-DriveGroup
UGood-Unconfigured Good|UBad-Unconfigured Bad|Intf-Interface
Med-Media Type|SED-Self Encryptive Drive|PI-Protection Info
SeSz-Sector Size|Sp-Spun|U-Up|D-Down|T-Transition
Drive /c0/e0/s4 - Detailed Information :
======================================
Drive /c0/e0/s4 State :
=====================
Shield Counter = N/A
Media Error Count = N/A
Other Error Count = N/A
Predictive Failure Count = N/A
S.M.A.R.T alert flagged by drive = N/A
Drive /c0/e0/s4 Device attributes :
=================================
Manufacturer Id = NVMe
Model Number = SAMSUNG MZQL23T8HCLS-00B7C
NAND Vendor = NA
SN = S63UNE0R409085
WWN = 3D5C80C7D8B7C8E6
Firmware Revision = GDC51C2Q
Raw size = 3.492 TB [0x1bf1f72af Sectors]
Coerced size = 3.492 TB [0x1bf1f72af Sectors]
Non Coerced size = 3.492 TB [0x1bf1f72af Sectors]
Device Speed = 16.0GT/s
Link Speed = 16.0GT/s
Sector Size = 512B
Config ID = NA
Number of Blocks = 7501476527
Connector Name = C0.0 x4
Drive /c0/e0/s4 Policies/Settings :
=================================
Enclosure position = 0
Connected Port Number = 0(path0)
Sequence Number = 0
Commissioned Spare = No
Emergency Spare = No
Last Predictive Failure Event Sequence Number = N/A
Successful diagnostics completion on = N/A
SED Capable = N/A
SED Enabled = N/A
Secured = N/A
Needs EKM Attention = N/A
PI Eligible = N/A
Certified = N/A
Wide Port Capable = N/A
Multipath = No
Port Information :
================
------------------------------------------
Port Status Link Speed SAS address
------------------------------------------
0 Active 16.0GT/s 0xe6c8b7d8c7805c4d
------------------------------------------
Inquiry Data =
00 00 07 12 45 00 00 02 4e 56 4d 65 20 20 20 20
53 41 4d 53 55 4e 47 20 4d 5a 51 4c 32 33 54 38
31 43 32 51 53 36 33 55 4e 45 30 52 34 30 39 30
38 35 20 20 20 20 20 20 00 00 00 c0 05 c0 06 00
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
Drive /c0/e0/s6 :
===============
-----------------------------------------------------------------------------------------------
EID:Slt DID State DG Size Intf Med SED PI SeSz Model Sp
-----------------------------------------------------------------------------------------------
0:6 2 JBOD - 3.492 TB NVMe SSD - - 512B SAMSUNG MZQL23T8HCLS-00B7C -
-----------------------------------------------------------------------------------------------
EID-Enclosure Device ID|Slt-Slot No|DID-Device ID|DG-DriveGroup
UGood-Unconfigured Good|UBad-Unconfigured Bad|Intf-Interface
Med-Media Type|SED-Self Encryptive Drive|PI-Protection Info
SeSz-Sector Size|Sp-Spun|U-Up|D-Down|T-Transition
Drive /c0/e0/s6 - Detailed Information :
======================================
Drive /c0/e0/s6 State :
=====================
Shield Counter = N/A
Media Error Count = N/A
Other Error Count = N/A
Predictive Failure Count = N/A
S.M.A.R.T alert flagged by drive = N/A
Drive /c0/e0/s6 Device attributes :
=================================
Manufacturer Id = NVMe
Model Number = SAMSUNG MZQL23T8HCLS-00B7C
NAND Vendor = NA
SN = S63UNE0R409724
WWN = 3D5C7FC1DFB7C8E6
Firmware Revision = GDC51C2Q
Raw size = 3.492 TB [0x1bf1f72af Sectors]
Coerced size = 3.492 TB [0x1bf1f72af Sectors]
Non Coerced size = 3.492 TB [0x1bf1f72af Sectors]
Device Speed = 16.0GT/s
Link Speed = 16.0GT/s
Sector Size = 512B
Config ID = NA
Number of Blocks = 7501476527
Connector Name = C0.1 x4
Drive /c0/e0/s6 Policies/Settings :
=================================
Enclosure position = 0
Connected Port Number = 1(path0)
Sequence Number = 0
Commissioned Spare = No
Emergency Spare = No
Last Predictive Failure Event Sequence Number = N/A
Successful diagnostics completion on = N/A
SED Capable = N/A
SED Enabled = N/A
Secured = N/A
Needs EKM Attention = N/A
PI Eligible = N/A
Certified = N/A
Wide Port Capable = N/A
Multipath = No
Port Information :
================
------------------------------------------
Port Status Link Speed SAS address
------------------------------------------
0 Active 16.0GT/s 0xe6c8b7dfc17f5c4d
------------------------------------------
Inquiry Data =
00 00 07 12 45 00 00 02 4e 56 4d 65 20 20 20 20
53 41 4d 53 55 4e 47 20 4d 5a 51 4c 32 33 54 38
31 43 32 51 53 36 33 55 4e 45 30 52 34 30 39 37
32 34 20 20 20 20 20 20 00 00 00 c0 05 c0 06 00
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00