Whamcloud - gitweb
LU-16995 kfilnd: Handle TAG_RX_OK in TN_STATE_FAIL 87/51787/2
authorChris Horn <chris.horn@hpe.com>
Thu, 13 Apr 2023 15:36:37 +0000 (09:36 -0600)
committerOleg Drokin <green@whamcloud.com>
Tue, 22 Aug 2023 06:42:04 +0000 (06:42 +0000)
commit338801448049e002821f5935b40019e6a6addd3f
tree4047eb1326c444c50d14f4f45d921b6c85643c92
parent9fdbc6ac9f2d4999dbec98a2e1f61194255247d8
LU-16995 kfilnd: Handle TAG_RX_OK in TN_STATE_FAIL

It is possible for the fabric to delay packets such that the retry
handler cancels the message but it is still delivered to the target.
If the timing is right then the initiator may receive a TAG_RX_OK
event after the transaction has transitioned to TN_STATE_FAIL. This
currently trips an LBUG, but instead we can allow the transaction to
complete normally.

Test-Parameters: trivial
HPE-bug-id: LUS-11572
Signed-off-by: Chris Horn <chris.horn@hpe.com>
Change-Id: I381d64713a7942fed09d41b30f64be602193057f
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/51787
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Ron Gredvig <ron.gredvig@hpe.com>
Reviewed-by: Ian Ziemba <ian.ziemba@hpe.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
lnet/klnds/kfilnd/kfilnd.h
lnet/klnds/kfilnd/kfilnd_tn.c
lustre/tests/sanity-lnet.sh