Whamcloud - gitweb
LU-13569 lnet: Recover local NI w/exponential backoff interval
authorChris Horn <chris.horn@hpe.com>
Fri, 21 Aug 2020 20:16:43 +0000 (15:16 -0500)
committerAndreas Dilger <adilger@whamcloud.com>
Sat, 23 Mar 2024 20:31:04 +0000 (20:31 +0000)
commit2e6ae1f46dc6aed4d67895d9e68bb36351f8ccf5
treede8dea3c3f83bdf40321297d1ce10a1c409e76b7
parentc82bd3b3d13b7618a3185a6c76127b4de92fc02d
LU-13569 lnet: Recover local NI w/exponential backoff interval

Use an exponential backoff algorithm to determine the interval at
which unhealthy local NIs are ping'd

Introduce lnet_ni_add_to_recoveryq_locked() which handles checking
pre-conditions for whether the NI should be added to the recovery
queue, and takes a ref on the NI as appropriate.

Lustre-change: https://review.whamcloud.com/39721
Lustre-commit: 8fdf2bc62ac9c418bd8e326112da9a2835f09ccb

Test-Parameters: trivial
HPE-bug-id: LUS-9109
Signed-off-by: Chris Horn <chris.horn@hpe.com>
Change-Id: Idb3789366b2e450837989f9a12eb2d598f80081c
Reviewed-on: https://review.whamcloud.com/c/ex/lustre-release/+/54403
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Frank Sehr <fsehr@whamcloud.com>
Reviewed-by: Cyril Bordage <cbordage@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
lnet/include/lnet/lib-lnet.h
lnet/include/lnet/lib-types.h
lnet/lnet/lib-move.c
lnet/lnet/lib-msg.c