I have a very old physical server using many disks in a number of RAID6 group. Those disks are not in the same chassis, instead is a JBOD system connected through the RAID controller on this server. In one of the RAID group, I had a disk failed. After replacing the failed disk, below is the storcli
output:
$ storcli /c0 show
…
---------------------------------------------
DG Arr Row EID:Slot DID Type State BT Size PDC PI SED DS3 FSpace
---------------------------------------------
1 - - - - RAID6 Pdgd N 27.285 TB dflt N N none N
1 0 - - - RAID6 Dgrd N 27.285 TB dflt N N none N
1 0 0 34:0 48 DRIVE Onln N 2.728 TB dflt N N none -
1 0 1 34:1 49 DRIVE Onln N 2.728 TB dflt N N none -
1 0 2 34:2 50 DRIVE Onln N 2.728 TB dflt N N none -
1 0 3 34:3 51 DRIVE Onln N 2.728 TB dflt N N none -
1 0 4 - - DRIVE Msng - 2.728 TB - - - - -
1 0 5 34:5 53 DRIVE Onln N 2.728 TB dflt N N none -
1 0 6 34:6 55 DRIVE Onln N 2.728 TB dflt N N none -
1 0 7 34:7 54 DRIVE Onln N 2.728 TB dflt N N none -
1 0 8 34:8 56 DRIVE Onln N 2.728 TB dflt N N none -
1 0 9 34:9 57 DRIVE Onln N 2.728 TB dflt N N none -
1 0 10 34:10 58 DRIVE Onln N 2.728 TB dflt N N none -
1 0 11 34:11 59 DRIVE Onln N 2.728 TB dflt N N none -
From above, we can see slot 34:4 is missing, as confirmed by the following command:
$ MegaCli -PdGetMissing -a0
Adapter 0 - Missing Physical drives
No. Array Row Size Expected
0 1 4 2861056 MB
When I tried to manually add this disk to its disk group, it threw the following error:
$ MegaCli -PdReplaceMissing -PhysDrv [34:4] -Array1 -Row4 -a0
Adapter: 0: Failed to replace Missing PD at Array 1, Row 4.
FW error description:
The specified physical disk doesn't have enough capacity to complete the requested command.
Exit Code: 0x0d
As per this old megaCLI guide, the above error code is due to "Drive is too small for requested operation".
If I compare storcli /c0/e34/s3,4 show all
output, the replacement disk has identical capacity and sector size:
Drive /c0/e34/s3 Device attributes : <<<< working disk
==================================
WWN = 5000c50090fb1146
Firmware Revision = SN06
Raw size = 2.728 TB [0x15d50a3b0 Sectors]
Coerced size = 2.728 TB [0x15d400000 Sectors]
Non Coerced size = 2.728 TB [0x15d40a3b0 Sectors]
Device Speed = 6.0Gb/s
Link Speed = 6.0Gb/s
Logical Sector Size = 512B
Drive /c0/e34/s4 Device attributes : <<<< new disk to add
==================================
WWN = 5000c50074a7f9cf
Firmware Revision = SN04
Raw size = 2.728 TB [0x15d50a3b0 Sectors]
Coerced size = 2.728 TB [0x15d400000 Sectors]
Non Coerced size = 2.728 TB [0x15d40a3b0 Sectors]
Device Speed = 6.0Gb/s
Link Speed = 6.0Gb/s
Logical Sector Size = 512B
Physical Sector Size = 512B
Any idea why it is complaining about capacity when they appear to be identical?
EDITED (based on suggestions of @djdomi and @U880D)
To see the firmware revision of all the disks, filtered the output of MegaCli -PDList -a0
as below:
$ MegaCli -PDList -a0 | awk '/^Enclosure Device ID/ {printf "%d", $4; next} /^Slot Number:/ {printf ":%d\t", $3; next} /Device Firmware Level/ {print $4}'
34:0 SN06
34:1 TN02
34:2 TN02
34:3 SN06
34:4 SN04
34:5 TN02
34:6 TN02
34:7 TN02
34:8 TN02
34:9 TN02
34:10 TN02
34:11 SN04
There is a mix of firmware. Disk 34:11 has the same firmware SN04 as 34:4. Wondering why its only affecting disk 34:4?