Whamcloud - gitweb
LU-12344 lnet: handle remote health error 30/36030/6
authorAmir Shehata <ashehata@whamcloud.com>
Mon, 27 May 2019 17:43:10 +0000 (10:43 -0700)
committerOleg Drokin <green@whamcloud.com>
Tue, 8 Oct 2019 13:25:29 +0000 (13:25 +0000)
commitb1eee019fa12e16c8abf6ae14bea45d9adbbad3d
tree25f8b8408de89ee6592e4774e952d913abfbc090
parentda9998767a9093c088d28119179ee591f42910dc
LU-12344 lnet: handle remote health error

When a peer is dead set the health status to REMOTE_DROPPED
in order to handle health properly for the peer.
When dropping a routed message set REMOTE_ERROR. Routed messages
are dropped when the routing feature is turned off which could
be considered a configuration error if it happens in the middle
of traffic. Therefore, it's better to flag this issue at this
point without resending the message.

Lustre-change: https://review.whamcloud.com/34967
Lustre-commit: b45e3d96fc4d82ebf5b1bb3ef0b5a59e8ff86e75

Signed-off-by: Amir Shehata <ashehata@whamcloud.com>
Change-Id: I131263215a68fc8607582643a47007ce4d04abbc
Reviewed-by: Olaf Weber <olaf.weber@hpe.com>
Reviewed-by: Chris Horn <hornc@cray.com>
Signed-off-by: Minh Diep <mdiep@whamcloud.com>
Reviewed-on: https://review.whamcloud.com/36030
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
lnet/lnet/lib-move.c