Whamcloud - gitweb
LU-12344 lnet: handle remote health error 67/34967/2
authorAmir Shehata <ashehata@whamcloud.com>
Mon, 27 May 2019 17:43:10 +0000 (10:43 -0700)
committerAmir Shehata <ashehata@whamcloud.com>
Fri, 7 Jun 2019 18:04:09 +0000 (18:04 +0000)
commitb45e3d96fc4d82ebf5b1bb3ef0b5a59e8ff86e75
tree6595fa9c968b91369bacc45e0a25ef99440ac8f4
parent1065c8888e96fef9e98676bd3a71b46f7910b085
LU-12344 lnet: handle remote health error

When a peer is dead set the health status to REMOTE_DROPPED
in order to handle health properly for the peer.
When dropping a routed message set REMOTE_ERROR. Routed messages
are dropped when the routing feature is turned off which could
be considered a configuration error if it happens in the middle
of traffic. Therefore, it's better to flag this issue at this
point without resending the message.

Signed-off-by: Amir Shehata <ashehata@whamcloud.com>
Change-Id: I131263215a68fc8607582643a47007ce4d04abbc
Reviewed-on: https://review.whamcloud.com/34967
Reviewed-by: Olaf Weber <olaf.weber@hpe.com>
Tested-by: Jenkins
Reviewed-by: Chris Horn <hornc@cray.com>
Tested-by: Maloo <maloo@whamcloud.com>
lnet/lnet/lib-move.c