Whamcloud - gitweb
LU-14876 out: don't connect to busy MDS-MDS export 90/44390/5
authorMikhail Pershin <mpershin@whamcloud.com>
Wed, 21 Jul 2021 15:14:01 +0000 (18:14 +0300)
committerOleg Drokin <green@whamcloud.com>
Wed, 18 Aug 2021 01:57:42 +0000 (01:57 +0000)
commit301d76a71176c186129231ddd1323bae21100165
tree3ad1993def695546f3045e4c6f0907522cffbb0b
parent29eabeb34c5ba2cffdb5353d108ea56e0549665b
LU-14876 out: don't connect to busy MDS-MDS export

MDS-MDS connection is missing check for busy requests upon
reconnect, so resent can be executed concurrently with
original request.

- in ptlrpc_server_check_resend_in_progress() remove exception
  for bulk requests, they can be compared by XID nowadays.
  This prevents OUT requests vs resent execution as well.
- fix messages in target_handle_connect() to report correct
  information about connection details
- in out_handle() check for last_xid only once per OUT_UPDATE
- test 110m is added to recovery-small to reproduce the issue

Signed-off-by: Mikhail Pershin <mpershin@whamcloud.com>
Change-Id: I2ad183674d59a2cdeab0037bd8551c607b10ffeb
Reviewed-on: https://review.whamcloud.com/44390
Tested-by: jenkins <devops@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Alex Zhuravlev <bzzz@whamcloud.com>
Reviewed-by: Lai Siyao <lai.siyao@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
lustre/ldlm/ldlm_lib.c
lustre/ptlrpc/service.c
lustre/target/out_handler.c
lustre/tests/recovery-small.sh