Whamcloud - gitweb
LU-14876 out: don't connect to busy MDS-MDS export 62/44362/6
authorMikhail Pershin <mpershin@whamcloud.com>
Wed, 21 Jul 2021 15:14:01 +0000 (18:14 +0300)
committerOleg Drokin <green@whamcloud.com>
Mon, 13 Sep 2021 20:09:11 +0000 (20:09 +0000)
commit67a24ac97553b684195d210b0db1d5bfad0fa5d7
tree69a0cfe11751434a21d0b7d480b3c08c1f3e639f
parent8a5131183a2b353cb475ea5826623fe59013a5ac
LU-14876 out: don't connect to busy MDS-MDS export

MDS-MDS connection is missing check for busy requests upon
reconnect, so resent can be executed concurrently with
original request.

- in ptlrpc_server_check_resend_in_progress() remove exception
  for bulk requests, they can be compared by XID nowadays.
  This prevents OUT requests vs resent execution as well.
- fix messages in target_handle_connect() to report correct
  information about connection details
- in out_handle() check for last_xid only once per OUT_UPDATE
- test 110m is added to recovery-small to reproduce the issue

Lustre-change: https://review.whamcloud.com/44390
Lustre-commit: 301d76a71176c186129231ddd1323bae21100165

Signed-off-by: Mikhail Pershin <mpershin@whamcloud.com>
Change-Id: I2ad183674d59a2cdeab0037bd8551c607b10ffeb
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Alex Zhuravlev <bzzz@whamcloud.com>
Reviewed-by: Lai Siyao <lai.siyao@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
Reviewed-on: https://review.whamcloud.com/44362
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
lustre/include/obd_support.h
lustre/ldlm/ldlm_lib.c
lustre/ptlrpc/service.c
lustre/target/out_handler.c
lustre/tests/recovery-small.sh