Whamcloud - gitweb
LU-14654 lnet: Correct peer NI recovery age out calculation 01/43501/3
authorChris Horn <chris.horn@hpe.com>
Thu, 29 Apr 2021 18:14:34 +0000 (13:14 -0500)
committerOleg Drokin <green@whamcloud.com>
Thu, 8 Jul 2021 02:05:25 +0000 (02:05 +0000)
The calculation to age a peer NI out of recovery is only valid if
lnet_recovery_limit is non-zero. When set to zero, we allow peer NIs
to be in recovery indefinitely.

Test-Parameters: trivial
HPE-bug-id: LUS-9953
Fixes: cc27201a76 ("LU-13569 lnet: Age peer NI out of recovery")
Signed-off-by: Chris Horn <chris.horn@hpe.com>
Change-Id: I6bb40ca3a9affa0eaaae9deb1cecdb03e4bb42c5
Reviewed-on: https://review.whamcloud.com/43501
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Alexander Boyko <alexander.boyko@hpe.com>
Reviewed-by: Cyril Bordage <cbordage@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
lnet/lnet/peer.c

index 709fe24..9087d99 100644 (file)
@@ -4065,7 +4065,8 @@ lnet_peer_ni_add_to_recoveryq_locked(struct lnet_peer_ni *lpni,
                return;
        }
 
-       if (now > lpni->lpni_last_alive + lnet_recovery_limit) {
+       if (lnet_recovery_limit &&
+           now > lpni->lpni_last_alive + lnet_recovery_limit) {
                CDEBUG(D_NET, "lpni %s aged out last alive %lld\n",
                       libcfs_nid2str(lpni->lpni_nid),
                       lpni->lpni_last_alive);