Whamcloud - gitweb
LU-14876 out: don't connect to busy MDS-MDS export
authorMikhail Pershin <mpershin@whamcloud.com>
Wed, 9 Mar 2022 08:45:38 +0000 (00:45 -0800)
committerAndreas Dilger <adilger@whamcloud.com>
Thu, 10 Mar 2022 04:33:35 +0000 (04:33 +0000)
commitd25a694e02a01bcd37d557806993b124a07fe3f4
treeb8a100a2a845880e9aa7879fb0d392ab30d68be8
parente4274f2f5d0b90d57df3affef8ac7bc2a8aedd30
LU-14876 out: don't connect to busy MDS-MDS export

MDS-MDS connection is missing check for busy requests upon
reconnect, so resent can be executed concurrently with
original request.

- in ptlrpc_server_check_resend_in_progress() remove exception
  for bulk requests, they can be compared by XID nowadays.
  This prevents OUT requests vs resent execution as well.
- fix messages in target_handle_connect() to report correct
  information about connection details
- in out_handle() check for last_xid only once per OUT_UPDATE
- test 110m is added to recovery-small to reproduce the issue

Lustre-change: https://review.whamcloud.com/44390
Lustre-commit: 301d76a71176c186129231ddd1323bae21100165

Signed-off-by: Mikhail Pershin <mpershin@whamcloud.com>
Change-Id: I2ad183674d59a2cdeab0037bd8551c607b10ffeb
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Alex Zhuravlev <bzzz@whamcloud.com>
Reviewed-by: Lai Siyao <lai.siyao@whamcloud.com>
Reviewed-on: https://review.whamcloud.com/46762
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
lustre/ldlm/ldlm_lib.c
lustre/ptlrpc/service.c
lustre/target/out_handler.c
lustre/tests/recovery-small.sh