Whamcloud - gitweb
LU-13477 lnet: Force full discovery cycle 22/38322/5
authorAmir Shehata <ashehata@whamcloud.com>
Thu, 23 Apr 2020 00:47:05 +0000 (17:47 -0700)
committerOleg Drokin <green@whamcloud.com>
Wed, 27 May 2020 05:05:00 +0000 (05:05 +0000)
commit9bafd530d785896f4eff93f7ac2342a019bdbd02
treed0941771e09cc460142e7f09f9f9e5ee880bb344
parentadae4295b62b1074f5c3c45543c586282394b1be
LU-13477 lnet: Force full discovery cycle

There are scenarios where there could be a discrepancy between
cached peer information and reality. In these cases what could
end-up happening is incomplete interface information might be
cached because one side determined that the peer didn't require
a PUSH. This will lead to undesired MR behavior, where not all
the interfaces are used for a period of time.

Therefore, it is safer to always force a full discovery cycle:
GET/PUSH to ensure both sides are up-to-date.

In the NMR case, when discovery is turned off, make sure to flag
discovery as complete to avoid stalling the state machine.

Signed-off-by: Amir Shehata <ashehata@whamcloud.com>
Change-Id: Ie49ad11e8ff874206baa268a4ef2d58ebb536ed5
Reviewed-on: https://review.whamcloud.com/38322
Tested-by: jenkins <devops@whamcloud.com>
Reviewed-by: Chris Horn <chris.horn@hpe.com>
Reviewed-by: Serguei Smirnov <ssmirnov@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
lnet/lnet/peer.c
lnet/lnet/router.c