ECS - ECS Replacement Procedures-Replacing Read Cache SSDs
ECS - ECS Replacement Procedures-Replacing Read Cache SSDs
Topic
ECS Replacement Procedures
Selections
Select ECS Hardware: ECS EX-Series (Gen3) Hardware Procedures
Select EX-Series Platform or Switch: ECS EX500
Select ECS EX500 Hardware Replacement Procedures: Read Cache SSD Replacement
REPORT PROBLEMS
If you find any errors in this procedure or have comments regarding this application, send email to
[email protected]
Copyright © 2023 Dell Inc. or its subsidiaries. All Rights Reserved. Dell Technologies, Dell, EMC, Dell
EMC and other trademarks are trademarks of Dell Inc. or its subsidiaries. Other trademarks may be
trademarks of their respective owners.
The information in this publication is provided “as is.” Dell Inc. makes no representations or warranties of
any kind with respect to the information in this publication, and specifically disclaims implied warranties of
merchantability or fitness for a particular purpose.
Use, copying, and distribution of any software described in this publication requires an applicable
software license.
This document may contain certain words that are not consistent with Dell's current language guidelines.
Dell plans to update the document over subsequent future releases to revise these words accordingly.
This document may contain language from third party content that is not under Dell's control and is not
consistent with Dell's current guidelines for Dell's own content. When such third party content is updated
by the relevant third parties, this document will be revised accordingly.
Page 1 of 50
Contents
Preliminary Activity Tasks .......................................................................................................3
Read, understand, and perform these tasks.................................................................................................3
Page 2 of 50
Preliminary Activity Tasks
This section may contain tasks that you must complete before performing this procedure.
Table 1 List of cautions, warnings, notes, and/or KB solutions related to this activity
2. This is a link to the top trending service topics. These topics may or not be related to this activity.
This is merely a proactive attempt to make you aware of any KB articles that may be associated with
this product.
Note: There may not be any top trending service topics for this product at any given time.
Page 3 of 50
Dell Technologies Confidential Information version: 2.3.6.198
Page 4 of 50
Read Cache SSD Replacement
Page 5 of 50
ECS Appliance Read Cache SSD Replacement
Guide
October 2022
Rev. 1.3
Page 6 of 50
Notes, cautions, and warnings
NOTE: A NOTE indicates important information that helps you make better use of your product.
CAUTION: A CAUTION indicates either potential damage to hardware or loss of data and tells you how to avoid
the problem.
WARNING: A WARNING indicates a potential for property damage, personal injury, or death.
© 2022 Dell Inc. or its subsidiaries. All rights reserved. Dell, EMC, and other trademarks are trademarks of Dell Inc. or its subsidiaries. Other
trademarks may be trademarks of their respective owners.
Page 7 of 50
Contents
Figures..........................................................................................................................................5
Tables........................................................................................................................................... 6
Revision history.......................................................................................................................................................................... 7
Overview......................................................................................................................................................................................8
Determine the Version of EX Series Hardware .................................................................................................................9
Replacement Disk Prerequisites for ECS 3.5.................................................................................................................... 13
Tools............................................................................................................................................................................................14
Following ECS UI for SSD status and replacement information.................................................................................. 15
Contents 3
Page 8 of 50
Installing an SSD................................................................................................................................................................ 43
Post—Disk Replacement Checks ................................................................................................................................ 44
4 Contents
Page 9 of 50
Figures
Figures 5
Page 10 of 50
Tables
1 Revision history...........................................................................................................................................................7
2 EX3000 Server indicators, buttons, or connectors......................................................................................... 12
6 Tables
Page 11 of 50
Revision history
This section provides a description of document changes.
Revision history 7
Page 12 of 50
Overview
This guide provides information for ECS Gen3 EX Series and Gen2 U Series disk replacement.
For all other ECS hardware platforms, contact Dell support to perform disk replacement.
ECS supports disk replacement using the process that is described in this document only if all nodes in the Virtual Data Center
(VDC) are running ECS version 3.5 or later.
ECS supports NVMe SSDs on the ECS EXF900 Appliance on ECS version 3.6 and later.
ECS supports HDDs on the ECS EX5000, EX500, EX300, EX3000, and Gen2 U Series Appliances on version 3.5 and later.
ECS blocks disk replacement by the user in the ECS UI until upgrade to version 3.5 or later is completed on all nodes. For
assistance carrying out a disk replacement while upgrade is in progress if for some reason deemed absolutely required, contact
Dell Support.
Throughout this document, in the context of these procedures, the following terms are used interchangeably:
● disk
● storage disk
● drive
● hard drive
8 Overview
Page 13 of 50
Determine the Version of EX Series Hardware
Learn how to determine which version of EX Series hardware you have.
EX5000
The following figures illustrate the front of the various versions of the EX5000 server.
EXF900
The following figures illustrate the front of the EXF900 server.
Page 14 of 50
Figure 3. EXF900 Server Front View 24 Drives
EX500
The following figure illustrates the front of the EX500 server:
Page 15 of 50
EX300
The following figure illustrates the front of the EX300 server:
EX3000
The following figure illustrates the front of the EX3000 server:
Page 16 of 50
Table 2. EX3000 Server indicators, buttons, or connectors
Item Indicator, Button, or Connector Description
1 Power indicator The power indicator glows when the system is turned on.
2 ID indicator When a system identification button is pressed, the ID indicator blinks blue
to help locate a particular system within a rack.
3 Sled An HDD fault status indicator. The indicator blinks amber if an HDD
experiences an issue.
4 System board status indicator If the system is on, and in good health, the indicator glows solid blue. The
indicator blinks amber if the system is in standby, and if any issue exists
(for example, a failed fan or HDD).
5 Power button ● The power button controls the PSU output to the system.
● NOTE: On ACPI-compliant operating systems (OSs), turning off
the system using the power button causes the system to perform a
graceful shutdown before power to the system is turned off.
6 System identification button ● The identification button can be used to locate a particular system
within a rack.
● Press to switch the system ID on and off.
● If the system stops responding during POST, press and hold the system
ID button for more than five seconds to enter BIOS progress mode.
● To reset iDRAC (if not disabled in F2 iDRAC setup) press and hold the
button for more than 15 seconds.
7 Power indicator The power indicator glows when the system is turned on.
8 ID indicator When a system identification button is pressed, the ID indicator blinks blue
to help locate a particular system within a rack.
9 Sled B HDD fault status indicator ● The indicator blinks amber if an HDD experiences an issue.
● NOTE: Features of Sled B are for dual-node systems only.
10 System board status indicator If the system is on, and in good health, the indicator glows solid blue. The
indicator blinks amber if the system is in standby, and if any issue exists
(for example, a failed fan or HDD).
11 Power button ● The power button controls the PSU output to the system.
● NOTE: On ACPI-compliant operating systems (OSs), turning off
the system using the power button causes the system to perform a
graceful shutdown before power to the system is turned off.
12 System identification button ● The identification button can be used to locate a particular system
within a rack.
● Press to switch the system ID on and off.
● If the system stops responding during POST, press and hold the system
ID button for more than five seconds to enter BIOS progress mode.
● To reset iDRAC (if not disabled in F2 iDRAC setup) press and hold the
button for more than 15 seconds.
Gen2
The figure illustrates the front of the ECS Gen2 server:
Page 17 of 50
Replacement Disk Prerequisites for ECS 3.5
Ensure that you have Secure Remote Services (SRS) configured, or contact Dell Support to order replacement disks.
Prerequisites
If SRS is configured, the disks for replacement are automatically dispatched to a customer site. If Secure Remote Services is not
configured, follow the procedure below to order replacement disks.
Steps
1. In the ECS UI, go to Manage > Events > Alerts.
2. Filter the Alerts by:
a. Date: Time Range
Select the time range according to the approximate date that you believe to have received the alert to replace the
faulted disk.
b. Severity: Select Info
3. Export the filter results.
The Export function exports only what is displayed on the ECS UI screen at the moment of exporting. You may have to carry
out the Export function several times to yield all the results in your export output.
4. Take note of the following information ready to place the disk order:
● Node Serial Number
● Disk Serial Number
● Disk Type
● Model
● Size
Look for alerts such as the following with the pertinent information:
Severity: Info
Decription: Node SN=<service tag> Disk SN=<disk serial number> in rack=<name of rack>,
node=<fqdn>, slot=<slot number> is ready for replacement. Disk Details: Type=<disk
type>,
Model=<vendor model>, Size=<disk size> GB, Firmware = <firmware version>.
5. Contact Dell EMC at https://www.dell.com/support or your support provider to order replacement parts.
Page 18 of 50
Tools
Learn about the tools required to complete the disk replacements.
● Phillips #1 screwdriver
● T6 torx bit, for torx screw removal of 2.5" drives
● T8 torx bit, for torx screw removal of 3.5" drives
● ESD gloves or ESD wristband
● Ladder: Required to access to the disk bays, if you are replacing a disk in an EX3000 mounted in the rack at 30U or higher.
● Ladder: Required to access to the disk bays, if you are replacing a disk in an EX5000 mounted in the rack at 30U or higher.
14 Tools
Page 19 of 50
Following ECS UI for SSD status and
replacement information
Follow the ECS UI for information as to when you should replace an SSD.
Prerequisites
Ensure that replacement disks are onsite.
Steps
1. In the ECS UI, go to Manage > Maintenance.
This section represents all the racks that exist in a VDC and all nodes.
The SSD Cache Disks column provides SSD status. The Data Disks column provides HDD status.
Green means that disks are operating properly.
Yellow means that disks are undergoing recovery or initializing after replacement. No action is required.
Red means that disks require attention; for example, they have failed recovery.
Blue means that disks require your action; for example, you must replace the disks.
When the disk status turns to yellow, it means that the disk status is Bad, and that ECS automatically has begun the
recovery process.
2. Click the node for additional information.
The ECS Maintenance page for the rack appears and shows the status for each node.
Page 20 of 50
Figure 9. ECS UI Maintenance Data Disks Blue Status
The ECS UI Maintenance drill-down for that node appears. The Replace option becomes available.
When the Replace option becomes available, a Secure Remote Services Dial Home event automatically instructs Dell EMC to
send a new disk to your site.
NOTE:
Page 21 of 50
The SSD disk undergoes a replacement process only if its health is FAILED. The read cache SSD replacement is not triggered
based on a "SSD Life Remaining" threshold level.
3. Proceed with physical disk replacement step only when you have the replacement disk available for physical replacement.
● If you do not have the replacement disks at hand, contact Dell Support and ensure that the replacement disk has been
ordered. Continue when you have the replacement disk. Obtain disk information as follows:
○ In ECS UI Maintenance, select the disk for which you want information, and then select left-most arrow in disk row
to expand the disk information.
○ In ECS UI Manage > Events > Alerts which is posted at the time when disk is ready for replacement.
The Alert details are:
Look for alerts such as the following for the pertinent information:
Severity: Info
Decription: Node SN=<service tag> Disk SN=<disk serial number> in rack=<name of rack>,
node=<fqdn>, slot=<slot number> is ready for replacement. Disk Details: Type=<disk
type>, Model=<vendor model>, Size=<disk size> GB, Firmware = <firmware version>.
● If you have the replacement disk at hand, navigate to the node that requires the replacement disk, and that is in blue
status.
4. Click Replace, and then click OK to confirm.
ECS allows replacing one disk at a time per node. After you click Replace for a given drive, you must place a new disk into
the node before proceeding with next disk. ECS does not allow you to select Replace for another drive until you insert the
drive for which Replace has already been clicked.
5. Click OK.
Next steps
Go to the section specific to the hardware type of the target node for the steps to physically replace the target SSD on the ECS
appliance.
Do not replace the disk physically unless corresponding Disk Status in the ECS UI shows Replace Disk and Description states:
Replace the disk according to LED identity and Slot/Enclosure location. Ensure that you verify the serial number of
the disk that you remove from the system against the serial number that the UI displays.
Page 22 of 50
1
Replace Read Cache SSDs on ECS EX5000
appliance
Topics:
• Replace the Failed Drives Overview
• Uninstalling an SSD
• Removing a drive carrier
• Removing the SSD from the drive carrier
• Installing the SSD into the drive carrier
• Installing a drive carrier
• Installing an SSD
• Post—Disk Replacement Checks
Uninstalling an SSD
Steps
1. Locate the faulted disk that you are going to replace.
The faulted disk should have a blinking LED. If the disk is not blinking, ensure that you are using correct node and confirm
disk location.
The target node should have blue node LED blinking in the front and back of the node.
The read cache SSD for the EX5000 models are located in Drive Slot 1. The graphics below show the location of the read
cache SSD slots.
Page 23 of 50
Figure 11. EX5000D SSD Slots
Page 24 of 50
Figure 12. EX5000S SSD Slots
2. Press the release button to open the drive carrier release handle.
3. Holding the handle, slide the hard drive out of the hard drive slot.
Page 25 of 50
4. Go to ECS UI Manage > Maintenance and verify that the disk serial number matches the number reported for the disk you
removed. If the disk serial number does not match, immediately reinsert the disk into its original slot.
Steps
1. Locate the hard drive to be removed and press the release button to open the drive carrier release handle.
2. Holding the handle, slide the drive carrier out of the drive slot.
Page 26 of 50
Figure 15. Removing an SSD carrier from expander module
Page 27 of 50
Figure 17. Installing the SSD into the drive carrier
CAUTION: When a replacement hot swappable drive is installed and the system is powered on, the drive
automatically begins to rebuilt. Ensure that the replacement drive is blank or contains data that you wish to
overwrite. Any data on the replacement drive is immediately lost after the drive is installed.
Steps
1. Press the release button on the front of the drive carrier to open the release handle.
2. Insert and slide the drive carrier into the drive slot.
3. Close the drive carrier release handle until it clicks in place.
Figure 18. Installing an SSD drive carrier into the expander module
Installing an SSD
Steps
1. Press the release button on the front of the hard drive to open the release handle.
Page 28 of 50
2. Insert the hard drive into the rear of the node into specified slot and slide until the hard drive connects with the backplane.
3. Close the hard drive release handle to lock the hard drive in place.
The read cache SSD for the EX5000 models are located in Drive Slot 1. The graphics below show the location of the read
cache SSD slots.
Page 29 of 50
Figure 20. EX5000D SSD Slots
Page 30 of 50
Figure 21. EX5000S SSD Slots
4. If there are any additional failed drives showing as Replace in the ECS UI Manage > Maintenance, perform the
replacement of the next disk drive. Step 3 of Following ECS UI for SSD status and replacement information provides
information.
Do not replace the next disk physically unless the corresponding Disk Status in the ECS UI shows Replace Disk and the
Description states: Replace the disk according to LED identity and Slot/Enclosure location. Ensure that you verify
the serial number of the disk that you remove from the system against the serial number that the UI displays.
Steps
In the ECS UI, go to Manage > Maintenance.
Page 31 of 50
The disk status should be Initializing and eventually turn to Healthy.
The disk may be blinking showing normal activity, but identification LED should no longer be blinking.
You can proceed with the next disk replacement while the previous disk is still Initializing. Ensure that you check on the previous
drives afterward.
Page 32 of 50
2
Replace Read Cache SSDs on ECS EX500
appliance
Topics:
• Replace the Failed Drive Overview
• Uninstalling an SSD
• Uninstalling the Drive from the Drive Carrier
• Installing a Drive into the Drive Carrier
• Installing an SSD
• Post—Disk Replacement Checks
Uninstalling an SSD
Steps
1. Locate the faulted disk that you are going to replace.
The faulted disk should have a blinking LED. If the disk is not blinking, ensure that you are using correct node and confirm
disk location.
The target node should have blue node LED blinking in the front and back of the node.
The graphic below shows the location of the SSD slots in the back of the EX500 server. The yellow-highlighted SSD slot is
the read cache SSD slot. The red-highlighted SSD slot is blank.
Page 33 of 50
Figure 23. EX500 LED indicators
2. Press the release button to open the drive carrier release handle.
3. Holding the handle, slide the hard drive out of the hard drive slot.
4. Go to ECS UI Manage > Maintenance and verify that the disk serial number matches the number reported for the disk you
removed. If the disk serial number does not match, immediately reinsert the disk into its original slot.
Page 34 of 50
Uninstalling the Drive from the Drive Carrier
Steps
1. Using a Phillips #1 screwdriver, uninstall the screws from the slide rails on the drive carrier.
2. Lift the drive out of the drive carrier.
Steps
1. Insert the replacement drive into the drive carrier with the connector end of the drive towards the back of the carrier.
2. Align the screw holes on the drive with the screws holes on the drive carrier.
3. Using a Phillips #1 screwdriver, replace the screws to secure the drive to the drive carrier.
Page 35 of 50
Figure 26. Installing a drive into the drive carrier
Installing an SSD
Steps
1. Press the release button on the front of the hard drive to open the release handle.
2. Insert the hard drive into the rear of the node into specified slot and slide until the hard drive connects with the backplane.
3. Close the hard drive release handle to lock the hard drive in place.
Page 36 of 50
Figure 27. Installing an SSD
The graphic below shows the location of the SSD slots in the back of the EX500 server. The yellow-highlighted SSD slot is
the read cache SSD slot. The red-highlighted SSD slot is blank.
4. If there are any additional failed drives showing as Replace in the ECS UI Manage > Maintenance, perform the
replacement of the next disk drive. Step 3 of Following ECS UI for SSD status and replacement information provides
information.
Do not replace the next disk physically unless the corresponding Disk Status in the ECS UI shows Replace Disk and the
Description states: Replace the disk according to LED identity and Slot/Enclosure location. Ensure that you verify
the serial number of the disk that you remove from the system against the serial number that the UI displays.
Steps
In the ECS UI, go to Manage > Maintenance.
The disk status should be Initializing and eventually turn to Healthy.
The disk may be blinking showing normal activity, but identification LED should no longer be blinking.
You can proceed with the next disk replacement while the previous disk is still Initializing. Ensure that you check on the previous
drives afterward.
Page 37 of 50
3
Replace Read Cache SSDs on ECS EX300
appliance
Topics:
• Replace the Failed Drives Overview
• Remove the SSD
• Remove the Drive from the Drive Carrier
• Install the Replacement Drive into the Drive Carrier
• Install the SSD
• Post—Disk Replacement Checks
Steps
1. Locate the faulted disk that you are going to replace.
The target node should have blue node LED blinking in the front and back of the node to help locate correct node.
The graphic below shows the location of the SSD slots in the back of the EX300 server. The yellow-highlighted slot is the
read cache SSD slot. The red-highlighted slot is blank.
Page 38 of 50
Figure 30. EX300 LED indicators
4. Go to ECS UI Manage > Maintenance and verify that the disk serial number matches the number reported for the disk that
you removed. If the disk serial number does not match, immediately reinsert the disk into its original slot.
5. If you are not replacing the SSD immediately, insert an SSD blank in the empty SSD slot to maintain proper system cooling.
Page 39 of 50
Remove the Drive from the Drive Carrier
Steps
1. Using a Phillips #1 screwdriver, remove the screws from the slide rails on the drive carrier.
2. Lift the drive out of the drive carrier.
Page 40 of 50
Figure 33. Install drive into drive carrier
Page 41 of 50
Figure 34. Installing an SSD
The graphic below shows the location of the SSD slots in the back of the EX300 server. The yellow-highlighted slot is the
read cache SSD slot. The red-highlighted slot is blank.
4. If there are any additional failed drives showing as Replace in the ECS UI Manage > Maintenance, perform the
replacement of the next disk drive. Step 3 of Following ECS UI for SSD status and replacement information provides
information.
Do not replace the next disk physically unless the corresponding Disk Status in the ECS UI shows Replace Disk and the
Description states: Replace the disk according to LED identity and Slot/Enclosure location. Ensure that you verify
the serial number of the disk that you remove from the system against the serial number that the UI displays.
Steps
In the ECS UI, go to Manage > Maintenance.
The disk status should be Initializing and eventually turn to Healthy.
The disk may be blinking showing normal activity, but identification LED should no longer be blinking.
You can proceed with the next disk replacement while the previous disk is still Initializing. Ensure that you check on the previous
drives afterward.
Page 42 of 50
4
Replace Read Cache SSDs on ECS EX3000
appliance
Topics:
• Replace the Failed Drives Overview
• Remove the SSD
• Install the SSD
• Post—Disk Replacement Checks
Page 43 of 50
The LEDs on SSDs do not light up to identify the disk for replacement. Ensure that you use the blinking blue node light and
visual identification for SSD read cache disks location from the figure above.
Ensure that you replace BOTTOM of the two drives in the node.
The top drives are operating system disks. The operating system disk may show activity, and look like they are blinking. Do
not touch the operating system disks. If you remove the operating system disk, the node becomes inaccessible.
2. Press the release button to open the SSD carrier release handle.
3. Slide the SSD carrier out until it is free of the SSD slot.
CAUTION: To maintain proper system cooling, all empty SSD slots must have SSD blanks installed.
4. Remove the screws from the slide rails on the SSD carrier.
5. Lift the SSD out of the SSD carrier.
Item Description
1 Release Button
2 3.5” SSD
3 SSD Carrier Handle
Page 44 of 50
Item Description
1 Screw (4)
2 3.5” SSD
3 SSD Carrier
6. Go to ECS UI Manage > Maintenance and verify that the disk serial number matches the number reported for the disk you
replaced. If the disk serial number does not match, immediately reinsert the disk into its original slot.
CAUTION: Use only SSDs that have been tested and approved for use with the SSD backplane.
Steps
1. Insert the SSD into the SSD carrier with the connector-end of the SSD toward the back.
2. Align the screw holes on the SSD with the set of screw holes on the SSD carrier.
When aligned correctly, the back of the SSD is flush with the back of the SSD carrier.
3. Attach the screws to secure the SSD to the SSD carrier.
4. If an SSD blank is installed in the SSD slot, remove it.
5. Insert the SSD carrier into the back of the node into specified slot until the carrier connects with the backplane.
CAUTION: Do not force or drop the SSD into the slot and backplane connectors. The backplane can be
permanently damaged. Slowly and carefully lower the drive into the slot until the cam lever engages. Ensure
that the cam lever on the carrier engages properly.
The graphic below shows the location of the SSD slots on the back of the EX3000 server. The red-highlighed boxes are
operating system drives. The yellow-highlighted boxes are SSD read cache drives.
Page 45 of 50
Figure 37. EX3000 SSD slot locations
Steps
In the ECS UI, go to Manage > Maintenance.
The disk status should be Initializing and eventually turn to Healthy.
The disk may be blinking showing normal activity, but identification LED should no longer be blinking.
You can proceed with the next disk replacement while the previous disk is still Initializing. Ensure that you check on the previous
drives afterward.
Page 46 of 50
5
Replace Read Cache SSDs on ECS Gen 2 U
Series appliance
Topics:
• Replace the Failed Drives Overview
• Removing a Failed SSD
• Installing an SSD
• Post—Disk Replacement Checks
Steps
1. Locate the faulted SSD.
The graphic below shows the location of the SSD slots in the front of the Gen2 server. The yellow-highlighted SSD slots are
the read cache SSD slots. The red-highlighted SSD slots are operating system disks.
Page 47 of 50
Figure 38. Gen2 SSD Slot Location
Installing an SSD
Prerequisites
Steps
1. With the SSD carrier latch fully open, align the module with the guides and gently lower the SSD into the front of the node
of the specified slot.
The graphic below shows the location of the SSD slots in the front of the Gen2 server. The yellow-highlighted SSD slots are
the read cache SSD slots. The red-highlighted SSD slots are operating system disks.
The latch begins to rotate downward when its tabs meet the enclosure.
2. Push the latch tab to engage the latch.
3. When the latch is engaged, push firmly on the module to verify that the SSD is properly seated.
The SSD Active light flashes to reflect the SSD activity.
4. If there are any additional failed drives showing as Replace in the ECS UI Manage > Maintenance, perform the
replacement of the next SSD drive on this node. Step 3 of Following ECS UI for SSD status and replacement information
provides information.
Page 48 of 50
Do not replace the next SSD physically unless the corresponding SSD Status in the ECS UI shows Replace Disk and the
Description states: Replace the disk according to LED identity and Slot/Enclosure location. Ensure that you verify
the serial number of the disk that you remove from the system against the serial number that the UI displays.
Steps
In the ECS UI, go to Manage > Maintenance.
The disk status should be Initializing and eventually turn to Healthy.
The disk may be blinking showing normal activity, but identification LED should no longer be blinking.
You can proceed with the next disk replacement while the previous disk is still Initializing. Ensure that you check on the previous
drives afterward.
Page 49 of 50
Dell Technologies Confidential Information version: 2.3.6.198
Page 50 of 50