Whamcloud - gitweb
LU-11472 lnet: Decrement health on timeout 08/33308/4
authorAmir Shehata <ashehata@whamcloud.com>
Thu, 4 Oct 2018 20:00:49 +0000 (13:00 -0700)
committerOleg Drokin <green@whamcloud.com>
Fri, 2 Nov 2018 07:17:14 +0000 (07:17 +0000)
commit139d69141b73d427490f39d3096b2187e979eaea
tree1fa6db0b4cbb3e31b17fc7ce7f62b627fd634a34
parentef6881b898a824a61556a761bafb313da5aad51d
LU-11472 lnet: Decrement health on timeout

When a response times out we want to decrement the health of the
immediate next hop peer ni, so we don't use that interface if there
are others available.

When sending a message if there is a response tracker associated
with the MD, store the next-hop-nid there. If the response times
out then we can look up the peer_ni using the cached NID, and
decrement its health value.

Signed-off-by: Amir Shehata <ashehata@whamcloud.com>
Change-Id: I6c2f49a695f078ee50378c0a468c7ee058f7e712
Reviewed-on: https://review.whamcloud.com/33308
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Sonia Sharma <sharmaso@whamcloud.com>
Reviewed-by: Doug Oucharek <dougso@me.com>
Reviewed-by: Olaf Weber <olaf.weber@hpe.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
lnet/include/lnet/lib-lnet.h
lnet/include/lnet/lib-types.h
lnet/lnet/lib-move.c
lnet/lnet/lib-msg.c