Whamcloud - gitweb
LU-10729 tests: replay-dual/23d to wait 23/45623/17
authorAlex Zhuravlev <bzzz@whamcloud.com>
Fri, 19 Nov 2021 19:52:28 +0000 (22:52 +0300)
committerOleg Drokin <green@whamcloud.com>
Tue, 11 Apr 2023 20:00:27 +0000 (20:00 +0000)
replay-dual/23d simulates a dropped reply for the executed
update, but previous tests can break this:
 - the update modifies remote llog
 - there can be another uptdate to that remote log
   (from the previous tests)
 - fail_loc (OBD_FAIL_UPDATE_OBJ_NET) is applied to the
   old update
 - the 23d's update gets stuck

so the test has to ensure there is no pending/in-flight
updates.

Test-Parameters: trivial testlist=replay-dual mdscount=2 mdtcount=4
Test-Parameters: testlist=replay-dual fstype=zfs mdscount=2 mdtcount=4
Signed-off-by: Alex Zhuravlev <bzzz@whamcloud.com>
Change-Id: I3b60468d1f6f467006d5872ec62b81f57fa0423e
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/45623
Reviewed-by: Alex Deiter <alex.deiter@gmail.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
lustre/tests/replay-dual.sh

index 2a5ee54..4e7b38e 100755 (executable)
@@ -863,6 +863,12 @@ test_23d () {
        do_node $CLIENT1 $LFS mkdir -i $MDTIDX $MOUNT1/$remote_dir ||
                error "lfs mkdir -i $MDTIDX failed"
 
+       # let previous transactions to complete
+       # (distributed llog cancels, etc)
+       do_nodes $(comma_list $(mdts_nodes)) \
+               "$LCTL set_param -n osd*.*MDT*.force_sync=1"
+       sleep 2
+
        # OBD_FAIL_UPDATE_OBJ_NET    0x1701
        do_facet mds${MDTIDX} lctl set_param fail_loc=0x1701
        do_node $CLIENT1 rmdir $MOUNT1/$remote_dir &