From: Chris Horn Date: Fri, 11 Sep 2020 18:42:42 +0000 (-0500) Subject: LU-13571 lnd: Use NETWORK_TIMEOUT for txs on ibp_tx_queue X-Git-Tag: 2.13.57~32 X-Git-Url: https://git.whamcloud.com/?p=fs%2Flustre-release.git;a=commitdiff_plain;h=7af63191370fd2337d0bc9045d211b918c61fdd1 LU-13571 lnd: Use NETWORK_TIMEOUT for txs on ibp_tx_queue TXs on the ibp_tx_queue are waiting for a connection to be established. Failure to establish a connection could be due to a problem with either the local NI or the remote NI, and o2iblnd cannot currently distinguish between these failures. As such, it should return LNET_MSG_STATUS_NETWORK_TIMEOUT to LNet so that LNet will decrement the health value of both the local NI and the remote NI and future sends can take these health values into account. Test-Parameters: trivial HPE-bug-id: LUS-9342 Signed-off-by: Chris Horn Change-Id: Idbbbe95483d25ec48b83e33a00685f72fa5292e6 Reviewed-on: https://review.whamcloud.com/39899 Reviewed-by: Amir Shehata Reviewed-by: Serguei Smirnov Tested-by: jenkins Tested-by: Maloo Reviewed-by: Oleg Drokin --- diff --git a/lnet/klnds/o2iblnd/o2iblnd_cb.c b/lnet/klnds/o2iblnd/o2iblnd_cb.c index 7e078b2..70ec9e8 100644 --- a/lnet/klnds/o2iblnd/o2iblnd_cb.c +++ b/lnet/klnds/o2iblnd/o2iblnd_cb.c @@ -3374,7 +3374,7 @@ kiblnd_check_conns (int idx) if (!list_empty(&timedout_txs)) kiblnd_txlist_done(&timedout_txs, -ETIMEDOUT, - LNET_MSG_STATUS_LOCAL_TIMEOUT); + LNET_MSG_STATUS_NETWORK_TIMEOUT); /* Handle timeout by closing the whole * connection. We can only be sure RDMA activity