Whamcloud - gitweb
LU-12344 lnet: handle remote health error 30/36030/1
authorAmir Shehata <ashehata@whamcloud.com>
Mon, 27 May 2019 17:43:10 +0000 (10:43 -0700)
committerMinh Diep <mdiep@whamcloud.com>
Tue, 3 Sep 2019 14:12:36 +0000 (07:12 -0700)
commit415b9c8ca07a9d6b85aeb909c207b85e021c673a
treedb8ec489da23b5aa91859cb3a2f39426d3394b5a
parentacdd3b9bed4068b993ac20daaf724bfc4979e9d7
LU-12344 lnet: handle remote health error

When a peer is dead set the health status to REMOTE_DROPPED
in order to handle health properly for the peer.
When dropping a routed message set REMOTE_ERROR. Routed messages
are dropped when the routing feature is turned off which could
be considered a configuration error if it happens in the middle
of traffic. Therefore, it's better to flag this issue at this
point without resending the message.

Lustre-change: https://review.whamcloud.com/34967
Lustre-commit: b45e3d96fc4d82ebf5b1bb3ef0b5a59e8ff86e75

Signed-off-by: Amir Shehata <ashehata@whamcloud.com>
Change-Id: I131263215a68fc8607582643a47007ce4d04abbc
Reviewed-by: Olaf Weber <olaf.weber@hpe.com>
Reviewed-by: Chris Horn <hornc@cray.com>
Signed-off-by: Minh Diep <mdiep@whamcloud.com>
lnet/lnet/lib-move.c