Whamcloud - gitweb
LU-13569 lnet: Recover local NI w/exponential backoff interval 21/39721/15
authorChris Horn <chris.horn@hpe.com>
Fri, 21 Aug 2020 20:16:43 +0000 (15:16 -0500)
committerOleg Drokin <green@whamcloud.com>
Wed, 28 Apr 2021 02:10:34 +0000 (02:10 +0000)
commit8fdf2bc62ac9c418bd8e326112da9a2835f09ccb
tree65e16960ee942b69ee87404001d7790bb9e57551
parent3e0d994fbf4c4f2e5c51b2be5669ad97aa02f840
LU-13569 lnet: Recover local NI w/exponential backoff interval

Use an exponential backoff algorithm to determine the interval at
which unhealthy local NIs are ping'd

Introduce lnet_ni_add_to_recoveryq_locked() which handles checking
pre-conditions for whether the NI should be added to the recovery
queue, and takes a ref on the NI as appropriate.

Test-Parameters: trivial
HPE-bug-id: LUS-9109
Signed-off-by: Chris Horn <chris.horn@hpe.com>
Change-Id: Idb3789366b2e450837989f9a12eb2d598f80081c
Reviewed-on: https://review.whamcloud.com/39721
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Alexander Boyko <alexander.boyko@hpe.com>
Reviewed-by: James Simmons <jsimmons@infradead.org>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
lnet/include/lnet/lib-lnet.h
lnet/include/lnet/lib-types.h
lnet/lnet/lib-move.c
lnet/lnet/lib-msg.c