From: Linda Bebernes Date: Wed, 9 Oct 2013 17:48:21 +0000 (-0700) Subject: LUDOC-152 Bug: Added missing links to chapter sections in Chapter 14. X-Git-Tag: 2.5.0~13 X-Git-Url: https://git.whamcloud.com/?a=commitdiff_plain;h=refs%2Fchanges%2F06%2F7906%2F3;p=doc%2Fmanual.git LUDOC-152 Bug: Added missing links to chapter sections in Chapter 14. Added links to TOC section at head of chapter for Sections 14.8.3 - 14.8.6. This also fixed missing entries in TOC in PDF version of manual. Signed-off-by: Linda Bebernes Change-Id: Ic7e0c1e76ca8500d3b757bedc8bbddc097fa341f Reviewed-on: http://review.whamcloud.com/7906 Tested-by: Hudson Reviewed-by: Richard Henwood --- diff --git a/LustreMaintenance.xml b/LustreMaintenance.xml index fc43567..39ce921 100644 --- a/LustreMaintenance.xml +++ b/LustreMaintenance.xml @@ -31,7 +31,19 @@ - + \ + + + + + + + + + + + + @@ -340,137 +352,169 @@ client$ lfs getstripe -M /mnt/lustre/local_dir0 maintenance maintenanceinactive MDTsWorking with Inactive MDTs Files located on or below an inactive MDT are inaccessible until the MDT is activated again. Clients accessing an inactive MDT will receive an EIO error. -
- <indexterm><primary>maintenance</primary><secondary>removing a OST</secondary></indexterm> - Removing an OST from the File System - When removing an OST, remember that the MDT does not communicate directly with OSTs. Rather, each OST has a corresponding OSC which communicates with the MDT. It is necessary to determine the device number of the OSC that corresponds to the OST. Then, you use this device number to deactivate the OSC on the MDT. - To remove an OST from the file system: - - - For the OST to be removed, determine the device number of the corresponding OSC on the MDT. - - - List all OSCs on the node, along with their device numbers. Run: - lctl dl | grep osc - For example: lctl dl | grep - 11 UP osc lustre-OST-0000-osc-cac94211 4ea5b30f-6a8e-55a0-7519-2f20318ebdb4 5 +
+ <indexterm> + <primary>maintenance</primary> + <secondary>removing a OST</secondary> + </indexterm> Removing an OST from the File System + When removing an OST, remember that the MDT does not communicate directly with OSTs. + Rather, each OST has a corresponding OSC which communicates with the MDT. It is necessary to + determine the device number of the OSC that corresponds to the OST. Then, you use this + device number to deactivate the OSC on the MDT. + To remove an OST from the file system: + + + For the OST to be removed, determine the device number of the corresponding OSC on + the MDT. + + + List all OSCs on the node, along with their device numbers. Run: + lctl dl | grep osc + For example: lctl dl | grep + 11 UP osc lustre-OST-0000-osc-cac94211 4ea5b30f-6a8e-55a0-7519-2f20318ebdb4 5 12 UP osc lustre-OST-0001-osc-cac94211 4ea5b30f-6a8e-55a0-7519-2f20318ebdb4 5 13 IN osc lustre-OST-0000-osc lustre-MDT0000-mdtlov_UUID 5 14 UP osc lustre-OST-0001-osc lustre-MDT0000-mdtlov_UUID 5 - - - Determine the device number of the OSC that corresponds to the OST to be removed. - - - - - Temporarily deactivate the OSC on the MDT. On the MDT, run: - mds# lctl --device lustre_devno deactivate - For example, based on the command output in Step 1, to deactivate device 13 (the MDT’s OSC for OST-0000), the command would be: - - mds# lctl --device 13 deactivate - This marks the OST as inactive on the MDS, so no new objects are assigned to the OST. This does not prevent use of existing objects for reads or writes. - - - Do not deactivate the OST on the clients. Do so causes errors (EIOs), and the copy out to fail. - - - Do not use lctl conf_param to deactivate the OST. It permanently sets a parameter in the file system configuration. - - - - Discover all files that have objects residing on the deactivated OST. - Depending on whether the deactivated OST is available or not, the data from that OST may be migrated to other OSTs, or may need to be restored from backup. - - - - If the OST is still online and available, find all files with objects on the deactivated OST, and copy them to other OSTs in the file system to: - - client# lfs find --obd ost_name /mount/point | lfs_migrate -y - - - If the OST is no longer available, delete the files on that OST and restore them from backup: - client# lfs find --obd ost_uuid -print0 /mount/point | \ + + + Determine the device number of the OSC that corresponds to the OST to be + removed. + + + + + Temporarily deactivate the OSC on the MDT. On the MDT, run: + mds# lctl --device lustre_devno deactivate + For example, based on the command output in Step 1, to deactivate device 13 (the + MDT’s OSC for OST-0000), the command would be: + mds# lctl --device 13 deactivate + This marks the OST as inactive on the MDS, so no new objects are assigned to the + OST. This does not prevent use of existing objects for reads or writes. + + Do not deactivate the OST on the clients. Do so causes errors (EIOs), and the copy + out to fail. + + + Do not use lctl conf_param to deactivate the OST. It + permanently sets a parameter in the file system configuration. + + + + Discover all files that have objects residing on the deactivated OST. + Depending on whether the deactivated OST is available or not, the data from that OST + may be migrated to other OSTs, or may need to be restored from backup. + + + If the OST is still online and available, find all files with objects on the + deactivated OST, and copy them to other OSTs in the file system to: + client# lfs find --obd ost_name /mount/point | lfs_migrate -y + + + If the OST is no longer available, delete the files on that OST and restore them + from backup: + client# lfs find --obd ost_uuid -print0 /mount/point | \ tee /tmp/files_to_restore | xargs -0 -n 1 unlink - The list of files that need to be restored from backup is stored in /tmp/files_to_restore. Restoring these files is beyond the scope of this document. - - - - - Deactivate the OST. - - - To temporarily disable the deactivated OST, enter: [client]# lctl set_param osc.fsname-OSTnumber-*.active=0If there is expected to be a replacement OST in some short time (a few days), the OST can temporarily be deactivated on the clients: - This setting is only temporary and will be reset if the clients or MDS are rebooted. It needs to be run on all clients. - - - - If there is not expected to be a replacement for this OST in the near future, permanently deactivate the OST on all clients and the MDS: [mgs]# lctl conf_param ost_name.osc.active=0 - - A removed OST still appears in the file system; do not create a new OST with the same name. - - - -
-
- <indexterm><primary>maintenance</primary><secondary>backing up OST config</secondary></indexterm> -<indexterm><primary>backup</primary><secondary>OST config</secondary></indexterm> -Backing Up OST Configuration Files - If the OST device is still accessible, then the Lustre configuration files on the OST should be backed up and saved for future use in order to avoid difficulties when a replacement OST is returned to service. These files rarely change, so they can and should be backed up while the OST is functional and accessible. If the deactivated OST is still available to mount (i.e. has not permanently failed or is unmountable due to severe corruption), an effort should be made to preserve these files. - - - Mount the OST filesystem. - oss# mkdir -p /mnt/ost + The list of files that need to be restored from backup is stored in + /tmp/files_to_restore. Restoring these files is beyond the + scope of this document. + + + + + Deactivate the OST. + + + To temporarily disable the deactivated OST, enter: + [client]# lctl set_param osc.fsname-OSTnumber-*.active=0If + there is expected to be a replacement OST in some short time (a few days), the OST + can temporarily be deactivated on the clients: + This setting is only temporary and will be reset if the clients or MDS are + rebooted. It needs to be run on all clients. + + + + If there is not expected to be a replacement for this OST in the near future, + permanently deactivate the OST on all clients and the MDS: + [mgs]# lctl conf_param ost_name.osc.active=0 + + A removed OST still appears in the file system; do not create a new OST with the + same name. + + + +
+
+ <indexterm> + <primary>maintenance</primary> + <secondary>backing up OST config</secondary> + </indexterm> + <indexterm> + <primary>backup</primary> + <secondary>OST config</secondary> + </indexterm> Backing Up OST Configuration Files + If the OST device is still accessible, then the Lustre configuration files on the OST + should be backed up and saved for future use in order to avoid difficulties when a + replacement OST is returned to service. These files rarely change, so they can and should be + backed up while the OST is functional and accessible. If the deactivated OST is still + available to mount (i.e. has not permanently failed or is unmountable due to severe + corruption), an effort should be made to preserve these files. + + + Mount the OST filesystem. + oss# mkdir -p /mnt/ost [oss]# mount -t ldiskfs /dev/ost_device /mnt/ost - - - Back up the OST configuration files. - oss# tar cvf ost_name.tar -C /mnt/ost last_rcvd \ + + + Back up the OST configuration files. + oss# tar cvf ost_name.tar -C /mnt/ost last_rcvd \ CONFIGS/ O/0/LAST_ID - - - - - Unmount the OST filesystem. - oss# umount /mnt/ost - - - -
-
- <indexterm><primary>maintenance</primary><secondary>restoring OST config</secondary></indexterm> -<indexterm><primary>backup</primary><secondary>restoring OST config</secondary></indexterm> -Restoring OST Configuration Files - If the original OST is still available, it is best to follow the OST backup and restore procedure given in either , or and . - - To replace an OST that was removed from service due to corruption or hardware failure, the file system needs to be formatted for Lustre, and the Lustre configuration should be restored, if available. - - - If the OST configuration files were not backed up, due to the OST file system being completely inaccessible, it is still possible to replace the failed OST with a new one at the same OST index. - - - - Format the OST file system. + + + + Unmount the OST filesystem. oss# umount /mnt/ost + + + +
+
+ <indexterm> + <primary>maintenance</primary> + <secondary>restoring OST config</secondary> + </indexterm> + <indexterm> + <primary>backup</primary> + <secondary>restoring OST config</secondary> + </indexterm> Restoring OST Configuration Files + If the original OST is still available, it is best to follow the OST backup and restore + procedure given in either , or and . + To replace an OST that was removed from service due to corruption or hardware failure, + the file system needs to be formatted for Lustre, and the Lustre configuration should be + restored, if available. + If the OST configuration files were not backed up, due to the OST file system being + completely inaccessible, it is still possible to replace the failed OST with a new one at + the same OST index. + + + Format the OST file system. oss# mkfs.lustre --ost --index=old_ost_index other_options \ /dev/new_ost_dev - - - - - Mount the OST filesystem. + + + + Mount the OST filesystem. oss# mkdir /mnt/ost oss# mount -t ldiskfs /dev/new_ost_dev /mnt/ost - - - - Restore the OST configuration files, if available. + + + + Restore the OST configuration files, if available. oss# tar xvf ost_name.tar -C /mnt/ost - - - Recreate the OST configuration files, if unavailable. - - Follow the procedure in to recreate the + + + Recreate the OST configuration files, if unavailable. + Follow the procedure in to recreate the LAST_ID file for this OST index. The last_rcvd file will be recreated when the OST is first mounted using the default parameters, which are normally correct for all file systems. The CONFIGS/mountdata file is created by @@ -480,21 +524,25 @@ oss# mount -t ldiskfs /dev/new_ost_dev / oss1# debugfs -c -R "dump CONFIGS/mountdata /tmp/ldd" /dev/other_osdev oss1# scp /tmp/ldd oss0:/tmp/ldd oss0# dd if=/tmp/ldd of=/mnt/ost/CONFIGS/mountdata bs=4 count=1 seek=5 skip=5 conv=notrunc - - - - Unmount the OST filesystem. - oss# umount /mnt/ost - - - -
-
- <indexterm><primary>maintenance</primary><secondary>reintroducing an OSTs</secondary></indexterm> -Returning a Deactivated OST to Service - If the OST was permanently deactivated, it needs to be reactivated in the MGS configuration. mgs# lctl conf_param ost_name.osc.active=1 If the OST was temporarily deactivated, it needs to be reactivated on the MDS and clients. mds# lctl --device lustre_devno activate + + + Unmount the OST filesystem. oss# umount /mnt/ost + + + +
+
+ <indexterm> + <primary>maintenance</primary> + <secondary>reintroducing an OSTs</secondary> + </indexterm> Returning a Deactivated OST to Service + If the OST was permanently deactivated, it needs to be reactivated in the MGS + configuration. + mgs# lctl conf_param ost_name.osc.active=1 If + the OST was temporarily deactivated, it needs to be reactivated on the MDS and clients. + mds# lctl --device lustre_devno activate client# lctl set_param osc.fsname-OSTnumber-*.active=0 -
+
<indexterm><primary>maintenance</primary><secondary>aborting recovery</secondary></indexterm>