Hello,
We have an MFSYS25 chassis (details below) which has a strange but rather critical problem.
Whenever a drive fails on the RAID setup we have the entire system locks up. The fans speed up; all of the compute modules restart but do not go anywhere, and we can barely manage the system as the management console hangs up. What we find that we have to do is completely unplug and plug back in the power to the system and then start the rebuild on the failed drive. Then, we can start the compute modules and get the various servers going again, etc.
Has anyone else experienced this? What can we do to add some resiliency to the system?
Thank you for your time.
Details of our MFSYS25 System are as follows:
- Chassis Management Module: Part Number: D70735-403
- Server Storage Module: Part Number: D70737-404
- Gigabit Ethernet Switch: Part Number: D70739-404
- Six (6) Server Compute Modules: Part Number: D70726-404
- Firmware Versions:
Server 1 | BMC Firmware | ok | 1.36.6 |
| BMC Boot | ok | 0.10 |
| BIOS | ok | SB5000.86B.10.00.0050.083120090939 |
Server 2 | BMC Firmware | ok | 1.36.6 |
| BMC Boot | ok | 0.10 |
| BIOS | ok | SB5000.86B.10.00.0050.083120090939 |
Server 3 | BMC Firmware | ok | 1.36.6 |
| BMC Boot | ok | 0.10 |
| BIOS | ok | SB5000.86B.10.00.0050.083120090939 |
Server 4 | BMC Firmware | ok | 1.36.6 |
| BMC Boot | ok | 0.10 |
| BIOS | ok | SB5000.86B.10.00.0050.083120090939 |
Server 5 | BMC Firmware | ok | 1.36.6 |
| BMC Boot | ok | 0.10 |
| BIOS | ok | SB5000.86B.10.00.0050.083120090939 |
Server 6 | BMC Firmware | ok | 1.36.6 |
| BMC Boot | ok | 0.10 |
| BIOS | ok | SB5000.86B.10.00.0050.083120090939 |
Switch 1 | Firmware | ok | 1.0.0.27 |
| Boot | ok | 1.0.0.6 |
Switch 2 | Firmware | not present | -- |
| Boot | not present | -- |
Storage Control Module 1 | Firmware | ok | 3.10.140.2 |
Storage Control Module 2 | Firmware | not present | -- |
System Fan 1 | Firmware | ok | 1.2 |
| Boot | ok | 1.2 |
System Fan 2 | Firmware | ok | 1.2 |
| Boot | ok | 1.2 |
I/O Fan | Firmware | ok | 1.2 |
| Boot | ok | 1.2 |
Power Supply 1 | Firmware | not applicable ![]() | -- |
| Boot | not applicable ![]() | -- |
Power Supply 2 | Firmware | not applicable ![]() | -- |
| Boot | not applicable ![]() | -- |
Power Supply 3 | Firmware | not applicable ![]() | -- |
| Boot | not applicable ![]() | -- |
Power Supply Blank 4 | Firmware | ok | 1.2 |
| Boot | ok | 1.2 |