Whamcloud - gitweb
LU-17325 o2iblnd: CM_EVENT_UNREACHABLE on established conn 98/53298/2
authorSerguei Smirnov <ssmirnov@whamcloud.com>
Thu, 30 Nov 2023 18:55:11 +0000 (10:55 -0800)
committerOleg Drokin <green@whamcloud.com>
Wed, 20 Dec 2023 01:57:51 +0000 (01:57 +0000)
commitf7051f0092b19416ed86d7f4bbfe1cba7bb74c02
treefca8c165d839f8d7cbd8896d3a65e59bb7f7b94c
parent0ccf1311382059d22cf4788136939647fba1317a
LU-17325 o2iblnd: CM_EVENT_UNREACHABLE on established conn

There were examples in the field with RoCE setups which demonstrate
that CM_EVENT_UNREACHABLE may be received when connection is already
in ESTABLISHED state. This causes an assert in kiblnd_cm_callback to
fail.

Handle this in a more gracious manner: report the event as unexpected
and allow the flow to continue. If there are indeed issues on
the connection, it is expected to report transaction errors later
and get cleaned up without crashing the whole system.

Test-Parameters: trivial testlist=sanity-lnet
Signed-off-by: Serguei Smirnov <ssmirnov@whamcloud.com>
Change-Id: If32166fe9fc59e025609c2035cb1c03d3bed22f2
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/53298
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Chris Horn <chris.horn@hpe.com>
Reviewed-by: Frank Sehr <fsehr@whamcloud.com>
Reviewed-by: Cyril Bordage <cbordage@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
lnet/klnds/o2iblnd/o2iblnd_cb.c