Whamcloud - gitweb
LU-17809 osp: make disconnect asynchronous 95/54995/7
authorAlexander Boyko <alexander.boyko@hpe.com>
Sat, 20 Apr 2024 22:02:54 +0000 (18:02 -0400)
committerOleg Drokin <green@whamcloud.com>
Wed, 19 Jun 2024 00:46:17 +0000 (00:46 +0000)
commitffedcbae21f7aefe5c2258a94b36fe286f46182c
tree96e93bef97ac8faf66c3339b9a06222299eaf6ba
parent9c8f9b80f9d2f479221a38841dab6a2423edcbf3
LU-17809 osp: make disconnect asynchronous

MDT could have many osp devices. During umount there is a problem
of casscading timeouts of disconnect request. It could lead to
unpredictable large umount time.

This patch adds ability of parallel disconnect for OSP devices.
During LCFG_PRECLEANUP osp_disconnect() sends disconnects requests.
And osp_shutdown() waits it. So casscading timeouts were changed
to a single request wait.

Don't drop obd_force flag from upper layers.

Adds replay-single test 201, it simulates delays of OSP disconnects.
This leads to a high cumulative umount time.

HPE-bug-id: LUS-12251
Signed-off-by: Alexander Boyko <alexander.boyko@hpe.com>
Change-Id: Id788b22c494147bdc7f0d36968629e7b7f660e01
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/54995
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Alexey Lyashkov <alexey.lyashkov@hpe.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
Reviewed-by: Alex Zhuravlev <bzzz@whamcloud.com>
lustre/include/lustre_net.h
lustre/osp/osp_dev.c
lustre/osp/osp_internal.h
lustre/ptlrpc/import.c
lustre/tests/replay-single.sh