I’m attempting to upgrade my NAS hard drives from a 8x2TB raidz2 to a 4x6TB raidz1. The system is a Supermicro CSE-745 chassis with a Supermicro X8DTH-6 motherboard and a pair of Xeon x5687s with 48gb ECC RAM. I’m running Centos 7 with ZFS on Linux.
I’ve got the 4 new drives connected to a spare LSI 9211-8i powered off a spare EVGA PSU.
I keep having one or more of the new drives become “Degraded” with “too many errors”. Currently 3 of the drives are showing “too many errors” in when running
zpool status and it’s already tried to resilver the pool. Also I’ve noticed that the “UDMA_CRC_Error_Count” on the 3 errored drives was increasing as it tried to resilver(went from 88 to 617 on
pool: new6TBpool state: DEGRADED status: One or more devices has experienced an unrecoverable error. An attempt was made to correct the error. Applications are unaffected. action: Determine if the device needs to be replaced, and clear the errors using 'zpool clear' or replace the device with 'zpool replace'. see: http://zfsonlinux.org/msg/ZFS-8000-9P scan: resilvered 75.9G in 0h25m with 0 errors on Tue Apr 2 01:56:21 2019 config: NAME STATE READ WRITE CKSUM new6TBpool DEGRADED 0 0 0 raidz1-0 DEGRADED 0 0 1.89K sdb DEGRADED 0 0 0 too many errors sdc ONLINE 0 0 0 sdd DEGRADED 0 0 0 too many errors sde DEGRADED 0 0 0 too many errors errors: No known data errors
Every 2.0s: smartctl -a /dev/sdb | grep UDMA_CRC_Error_Count && sm... Tue Apr 2 01:57:43 2019 199 UDMA_CRC_Error_Count 0x000a 200 200 000 Old_age Always - 617 199 UDMA_CRC_Error_Count 0x000a 200 200 000 Old_age Always - 74 199 UDMA_CRC_Error_Count 0x000a 200 200 000 Old_age Always - 659 199 UDMA_CRC_Error_Count 0x000a 200 200 000 Old_age Always - 650
I’ve tried using different Mini-SAS->SATA breakout cables, different ports on the card, and different PCIe slots. I’m beginning to think maybe the HBA is bad. I’m going to see if there’s a spare one at work I can borrow tomorrow to get this data transfer done.
Anything else I can try?