Whamcloud - gitweb
fs/lustre-release.git
5 days agoLU-17527 tests: fix sanity/255a decimal comparison 36/56836/2
Andreas Dilger [Wed, 30 Oct 2024 22:10:56 +0000 (16:10 -0600)]
LU-17527 tests: fix sanity/255a decimal comparison

Don't use fractional percentages in the sanity test_255a performance
comparison, since bash (( ... )) cannot compare numbers with decimal
points properly.  Instead, just compute the percentage speedup with
whole numbers, since the test discards anything less than 20% speedup
and a fraction of a percent will not make much difference here.

Test-Parameters: trivial
Fixes: bdd470ff97 ("LU-9069 tests: improve output of sanity test_255a")
Fixes: 395f3e1a55 ("LU-15316 tests: use integers in sanity test_255a")
Signed-off-by: Andreas Dilger <adilger@whamcloud.com>
Change-Id: Id3b37e07168ee2590e52d01f66336027254ced55
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56836
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Arshad Hussain <arshad.hussain@aeoncomputing.com>
Reviewed-by: Jian Yu <yujian@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
5 days agoLU-18286 tests: auster node.yml labels rocky9 as RHEL 80/56580/3
Charlie Olmstead [Tue, 1 Oct 2024 17:02:05 +0000 (11:02 -0600)]
LU-18286 tests: auster node.yml labels rocky9 as RHEL

release() assumes a node with /etc/redhat-release is RHEL.
This patch removes reading this file in favor of os-release.
Reading of centos-release (if present) still required as
os-release for centos distros doesn't include minor version.

Test-Parameters: trivial
Signed-off-by: Charlie Olmstead <charlie@whamcloud.com>
Change-Id: I888f8eeacaf843120b2beb134292047b3907a9a6
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56580
Tested-by: Maloo <maloo@whamcloud.com>
Tested-by: jenkins <devops@whamcloud.com>
Reviewed-by: Alex Deiter <adeiter@ddn.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
5 days agoLU-18310 tests: add debugging to test_metabench 82/56582/4
Andreas Dilger [Fri, 4 Oct 2024 07:15:03 +0000 (01:15 -0600)]
LU-18310 tests: add debugging to test_metabench

Both parallel-scale and parallel-scale-nfs are intermittently
failing test_metabench with "No space left on device" (ENOSPC)
or "Disk quota exceeded" (EDQUOT), even though this test is
creating only about 10-20k files.

Add some debugging to see where all of the space has gone, and
what quota limits are being set.  It may be that some earlier
test (e.g. compilebench) is leaving too much junk behind.

The failure rate is very low (only 2/637 runs in the past 4 weeks),
so it likely needs to be landed to catch a failure.

Test-Parameters: trivial testlist=parallel-scale
Test-Parameters: testlist=parallel-scale-nfsv4
Test-Parameters: testgroup=full-part-1
Signed-off-by: Andreas Dilger <adilger@whamcloud.com>
Change-Id: Ie35ae677032ccc8113cbad5dc5a7b0504149717f
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56582
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Alex Deiter <adeiter@ddn.com>
Reviewed-by: Elena <elena.gryaznova@hpe.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
5 days agoLU-18298 tests: skip sanity-pcc/test_1{c,d} on SLES15 SP5 09/56809/2
Qian Yingjin [Tue, 29 Oct 2024 03:25:33 +0000 (11:25 +0800)]
LU-18298 tests: skip sanity-pcc/test_1{c,d} on SLES15 SP5

Skip the sanity-pcc/test_1{c, d} failure on SLES15 SP3 - SP5.
However, they passed on SLES15 SP6.

Test-Parameters: trivial
Signed-off-by: Qian Yingjin <qian@ddn.com>
Change-Id: I08fd0a192307f0072cc82033958dd8239ea507d5
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56809
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Tested-by: Jian Yu <yujian@whamcloud.com>
Reviewed-by: Jian Yu <yujian@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
5 days agoLU-18393 tests: $num_files should be multiple of $num_entries 85/56785/3
Emoly Liu [Fri, 25 Oct 2024 07:05:33 +0000 (15:05 +0800)]
LU-18393 tests: $num_files should be multiple of $num_entries

According to performance-sanity.sh test_4 failure
"md_validate_tests, items must be a multiple of items per directory",
set $num_files to be a multiple of $num_entries.

Test-Parameters: trivial testlist=performance-sanity
Signed-off-by: Emoly Liu <emoly@whamcloud.com>
Change-Id: I8c635649ef016389d1bd22f8318a55f8d0f77962
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56785
Tested-by: Maloo <maloo@whamcloud.com>
Tested-by: jenkins <devops@whamcloud.com>
Reviewed-by: Alex Deiter <adeiter@ddn.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
5 days agoLU-18379 tests: wait for stats to expire in conf-sanity/127 26/56726/8
Alex Zhuravlev [Thu, 17 Oct 2024 18:22:11 +0000 (21:22 +0300)]
LU-18379 tests: wait for stats to expire in conf-sanity/127

fs stats are not updated immediately on the client,
thus we need to wait.

Test-Parameters: env=ONLY=127,ONLY_REPEAT=20 testlist=conf-sanity
Test-Parameters: env=ONLY=127,ONLY_REPEAT=20 testlist=conf-sanity
Signed-off-by: Alex Zhuravlev <bzzz@whamcloud.com>
Change-Id: I28cf407f9fe4df1f46af8cd88f50670bb8f0d93f
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56726
Tested-by: Maloo <maloo@whamcloud.com>
Tested-by: jenkins <devops@whamcloud.com>
Reviewed-by: Jian Yu <yujian@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
5 days agoLU-18402 tests: skip recovery-small/155 for older MDS 95/56795/2
Jian Yu [Sat, 26 Oct 2024 03:56:45 +0000 (20:56 -0700)]
LU-18402 tests: skip recovery-small/155 for older MDS

Skip recovery-small test 155 for MDS < 2.15.58.110 since
the test depends on changes made to the MDS code on the server.

Test-Parameters: trivial testlist=recovery-small env=ONLY="155 157" \
  serverversion=2.15.5

Fixes: 71f8e5d650 ("LU-14708 ptlrpc: skip unnecessary client eviction")
Change-Id: I44137ebbfb1ec0f9a6a1cf1b42cd211caa146009
Signed-off-by: Jian Yu <yujian@whamcloud.com>
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56795
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Mikhail Pershin <mpershin@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 weeks agoNew RC 2.16.0-RC5 2.16.0-RC5 v2_16_0-RC5
Oleg Drokin [Sat, 26 Oct 2024 23:58:55 +0000 (19:58 -0400)]
New RC 2.16.0-RC5

Change-Id: I677f9f232df161eb37fc917469a6c23f48716b55
Signed-off-by: Oleg Drokin <green@whamcloud.com>
2 weeks agoLU-17525 llite: soft fail unaligned dio 75/56775/6
Shaun Tancheff [Thu, 24 Oct 2024 06:58:41 +0000 (13:58 +0700)]
LU-17525 llite: soft fail unaligned dio

Skip unaligned DIO and pass io as buffered I/O rather than
hard fail unaligned DIO.

Test-Parameters: testlist=sanity serverversion=2.14 env=SANITY_EXCEPT="65n 211 413"
Fixes: ff018bb77a ("LU-18284 llite: disallow udio exceptions")
Signed-off-by: Shaun Tancheff <shaun.tancheff@hpe.com>
Change-Id: I638f62ec96abc3032da5fbcf895cd835022fd759
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56775
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Patrick Farrell <patrick.farrell@oracle.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 weeks agoLU-18389 tests: skip added sanity/65n checks 86/56786/4
Andreas Dilger [Fri, 25 Oct 2024 07:54:01 +0000 (01:54 -0600)]
LU-18389 tests: skip added sanity/65n checks

Skip checks added at the end of sanity test_65n for MDS versions
that do not have the new layout inheritance behavior.

Test-Parameters: trivial env=ONLY=65n testlist=sanity serverversion=2.14
Fixes: 6e59408f1a ("LU-12130 lod: make pool inheritance policy more consistent")
Signed-off-by: Andreas Dilger <adilger@whamcloud.com>
Change-Id: Ic7c36801ec6e906d631bc4fc234f1f2b77e9f7dc
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56786
Reviewed-by: Jian Yu <yujian@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Tested-by: jenkins <devops@whamcloud.com>
2 weeks agoLU-17893 tests: wait destroys before replay-dual:test_28 12/56712/4
Vladimir Saveliev [Wed, 16 Oct 2024 14:45:37 +0000 (17:45 +0300)]
LU-17893 tests: wait destroys before replay-dual:test_28

replay-dual.sh:test_28() should take care that it drops only own
blocking ast. If test_26() ran before, there may be pending destroys
when test_28() runs. Dropping of blocking asts for destroys makes
replay-dual.sh to get accompanied with:

  watchdog stack traces:
  [169376.453554] Lustre: ll_ost00_057: service thread pid 236757 was
  inactive for 40.816 seconds. Watchdog stack traces are limited to 3
  per 300 seconds, skipping this one.
  [169376.461659] [<0>] ldlm_completion_ast+0x99b/0xc00 [ptlrpc]
  [169376.461782] [<0>] ldlm_cli_enqueue_local+0x302/0x890 [ptlrpc]
  [169376.461888] [<0>] ofd_destroy_by_fid+0x29c/0x570 [ofd]
  [169376.461906] [<0>] ofd_destroy_hdl+0x22c/0x960 [ofd]

  lock timeouts:
  [169638.155933] LustreError:
  236757:0:(ldlm_request.c:104:ldlm_expired_completion_wait()) ###
  lock timed out (enqueued at 1729087746, 303s ago); not entering
  recovery in server code, just going back to sleep ns..

  and system overload indications:
  [169852.021044] Lustre: ll_ost00_052: service thread pid 236555
  completed after 516.964s. This likely indicates the system was
  overloaded (too many service threads, or not enough hardware
  resources).

Wait for completion of destroys before starting test_28().

Test-Parameters: trivial testlist=replay-dual
Signed-off-by: Vladimir Saveliev <vladimir.saveliev@hpe.com>
Change-Id: I837579a428d8c2383fe884961d356ff417fc3f2e
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56712
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Jian Yu <yujian@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Tested-by: jenkins <devops@whamcloud.com>
2 weeks agoLU-14330 tests: wait for orphan thread to exit 59/56559/3
Andreas Dilger [Tue, 1 Oct 2024 21:06:06 +0000 (15:06 -0600)]
LU-14330 tests: wait for orphan thread to exit

It may take a few seconds for the orphan cleanup thread to finish.
Wait for the thread to exit rather than failing the test.

Test-Parameters: trivial
Fixes: a1e6e75a82 ("LU-12846 tests: verify orphan upgrade compatibilty")
Signed-off-by: Andreas Dilger <adilger@whamcloud.com>
Change-Id: I604be1b9f8f460d9183ba1aaddd3b77e153ebbe5
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56559
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Artem Blagodarenko <ablagodarenko@ddn.com>
Reviewed-by: Alex Deiter <adeiter@ddn.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 weeks agoLU-15553 test: mkdir_on_mdt0 in replay-dual 65/51665/6
Lai Siyao [Sat, 8 Jul 2023 22:32:29 +0000 (18:32 -0400)]
LU-15553 test: mkdir_on_mdt0 in replay-dual

Several subtests in replay-dual require test dir created on MDT0,
replace mkdir with mkdir_on_mdt0. These subtests are found by script:
grep -C 10 -n "do_facet.*SINGLEMDS" lustre/tests/*.sh | grep -w mkdir

Fixes: b9c4dc3c33 ("LU-14792 llite: enable filesystem-wide default LMV")
Test-Parameters: trivial mdscount=2 mdtcount=4 testlist=replay-dual,replay-dual,replay-dual
Signed-off-by: Lai Siyao <lai.siyao@whamcloud.com>
Change-Id: Ib28cf35575546c61bb7fa1b2c8a87ac31bd1ad4e
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/51665
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Jian Yu <yujian@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 weeks agoLU-15553 test: mkdir_on_mdt0 in conf-sanity.sh 39/56539/3
Lai Siyao [Mon, 23 Sep 2024 01:10:20 +0000 (21:10 -0400)]
LU-15553 test: mkdir_on_mdt0 in conf-sanity.sh

Change mkdir to mkdir_on_mdt0 in several conf-sanity.sh sub tests.

Fixes: b9c4dc3c33 ("LU-14792 llite: enable filesystem-wide default LMV")
Test-Parameters: trivial testlist=conf-sanity mdtcount=4
Signed-off-by: Lai Siyao <lai.siyao@whamcloud.com>
Change-Id: I5ace9df10e725802ba502ca20c60afc708b857cc
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56539
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Jian Yu <yujian@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 weeks agoLU-15553 test: mkdir_on_mdt0 in replay-vbr.sh 40/56540/5
Lai Siyao [Mon, 23 Sep 2024 01:17:40 +0000 (21:17 -0400)]
LU-15553 test: mkdir_on_mdt0 in replay-vbr.sh

Change mkdir to mkdir_on_mdt0 in several replay-vbr.sh sub tests.

Fixes: b9c4dc3c33 ("LU-14792 llite: enable filesystem-wide default LMV")
Test-Parameters: trivial testlist=replay-vbr mdtcount=4
Signed-off-by: Lai Siyao <lai.siyao@whamcloud.com>
Change-Id: I7457c155bbadb86adf8272113a4e4202b98c20a5
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56540
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Jian Yu <yujian@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 weeks agoLU-16870 tests: Make 413a/b work on server 2.14 with client master 68/56768/4
Arshad Hussain [Wed, 23 Oct 2024 11:08:52 +0000 (07:08 -0400)]
LU-16870 tests: Make 413a/b work on server 2.14 with client master

This patch makes 413a/b interop work on server 2.14
with client master.

First, this patch removes general redirect of cmd execution
(fallocate/dd) to dev/null from generate_uneven_mdts(). For
'dd' it might get little verbose however on failure it will
dump output to stdout giving more info.

Second, it adds check under check_fallocate_supported() to check
is fallocate is issued on MDS. If yes it check for version when
this feature was added. If not it falls back to 'dd' command

Third, under unload_modules() it check for version where
unload_modules_local() was added. If not it falls back to
non-unload_modules_local() version to unload module.

Test-Parameters: trivial testlist=sanity serverversion=2.14 env=ONLY=413a,413b,ONLY_REPEAT=10
Signed-off-by: Arshad Hussain <arshad.hussain@aeoncomputing.com>
Change-Id: I8a8843dd32f7e88d6d0938b67ce24353c9f9cb65
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56768
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Alex Deiter <adeiter@ddn.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 weeks agoLU-18024 quota: fix the order of freeing qmt_lvbo_free_wq 78/56778/2
Hongchao Zhang [Sun, 3 Mar 2024 02:11:36 +0000 (10:11 +0800)]
LU-18024 quota: fix the order of freeing qmt_lvbo_free_wq

In qmt_device_fini, put the freeing of the qmt_lvbo_free_wq after
all possible usage of it having finished cleanup.

Signed-off-by: Hongchao Zhang <hongchao@whamcloud.com>
Change-Id: Ia26f1bc490adff5ae7dc850e2c89baf5874f01c5
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56778
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Sergey Cheremencev <scherementsev@ddn.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 weeks agoLU-17670 tests: sanity-flr/33b allow bigger time difference 52/56752/2
Bobi Jam [Tue, 22 Oct 2024 02:15:12 +0000 (10:15 +0800)]
LU-17670 tests: sanity-flr/33b allow bigger time difference

Allow a bigger time difference between read that switching among
mirrors.

Test-Parameters: trivial testlist=sanity-flr env=ONLY=33b
Signed-off-by: Bobi Jam <bobijam@whamcloud.com>
Change-Id: I14482e290fe5a96d1d10b2d47c8b4cfcadec96ec
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56752
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Colin Faber <cfaber@ddn.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 weeks agoLU-18343 lod: old client mkdir on wrong MDT 31/56731/4
Lai Siyao [Sun, 29 Sep 2024 16:31:07 +0000 (12:31 -0400)]
LU-18343 lod: old client mkdir on wrong MDT

Old clients may not cache default LMV, and mkdir request may be sent
to wrong MDT, but MDT is too strict. Return -EREMOTE instead of
-EPROTO to let old clients retry.

Fixes: 6dbb4c6c8 ("LU-12998 lod: statfs upon nocreate check")
Test-Parameters: mdtcount=4 clientversion=2.15 testlist=sanity env=ONLY=160a
Signed-off-by: Lai Siyao <lai.siyao@whamcloud.com>
Change-Id: I2e64e2b3ccd3117de163f58d697191fab4d7be1c
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56731
Reviewed-by: Emoly Liu <emoly@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Tested-by: jenkins <devops@whamcloud.com>
2 weeks agoLU-18329 tests: check version for parallel-scale-nfs/test_1 74/56674/11
Feng Lei [Wed, 16 Oct 2024 08:21:22 +0000 (16:21 +0800)]
LU-18329 tests: check version for parallel-scale-nfs/test_1

Check the lustre client version of nfs server (MDS1 by default) in
parallel-scale-nfs/test_1 for interop test.

Fixes: 69dcd1b940 ("LU-18030 tests: Add a test to ensure permissions copy on nfs")
Signed-off-by: Feng Lei <flei@whamcloud.com>
Test-Parameters: trivial
Test-Parameters: testlist=parallel-scale-nfsv3 env=ONLY=1
Test-Parameters: serverversion=2.15.5 testlist=parallel-scale-nfsv3 env=ONLY=1
Change-Id: I76ecb3bc28f37ba7d0c24d18eead621d6b066800
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56674
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Jian Yu <yujian@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
3 weeks agoNew tag 2.16.0-RC4 2.16.0-RC4 v2_16_0-RC4
Oleg Drokin [Tue, 22 Oct 2024 00:27:19 +0000 (20:27 -0400)]
New tag 2.16.0-RC4

Change-Id: I9f87bbf152d51f50d1df6c9e75c0d1827a4e40a8
Signed-off-by: Oleg Drokin <green@whamcloud.com>
3 weeks agoLU-17906 pltrpc: Revert "LU-17906 pltrpc: don't use non-uptodate peer at connect" 36/56736/2
Jian Yu [Sat, 19 Oct 2024 00:11:09 +0000 (17:11 -0700)]
LU-17906 pltrpc: Revert "LU-17906 pltrpc: don't use non-uptodate peer at connect"

This reverts commit 6fe522d3d4f92aa2a48a573419f4590b10ef13d3.
The commit caused many regression failures: LU-18365, LU-18367,
LU-18368, LU-18366, etc.

Change-Id: Iaf7a0ec7606dab7a1b4d5b9f4fd1a24eb2c8d94d
Signed-off-by: Jian Yu <yujian@whamcloud.com>
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56736
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Oleg Drokin <green@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
3 weeks agoNew RC 2.16.0-RC3 2.16.0-RC3 v2_16_0-RC3
Oleg Drokin [Fri, 18 Oct 2024 18:53:04 +0000 (14:53 -0400)]
New RC 2.16.0-RC3

Change-Id: I89424c8fe80025746cc5baada8782ae2c77a2530
Signed-off-by: Oleg Drokin <green@whamcloud.com>
3 weeks agoLU-18024 quota: relate qmt_lvbo_free_wq and QMT 61/56661/3
Hongchao Zhang [Sun, 29 Sep 2024 14:16:42 +0000 (22:16 +0800)]
LU-18024 quota: relate qmt_lvbo_free_wq and QMT

The global variable "qmt_lvbo_free_wq" could be affected if there
are more than one Lustre mount (for instance, using snapshot),
this patch moves it into "struct qmt_device" and it will only be
used by this QMT(MDT0000) and avoid conflict.

Fixes: 2cc18ece1e ("LU-14535 quota: free lvbo in a wq")
Signed-off-by: Hongchao Zhang <hongchao@whamcloud.com>
Change-Id: I298382fa0eed6885e55991e26e4b36d435e99e26
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56661
Tested-by: Maloo <maloo@whamcloud.com>
Tested-by: jenkins <devops@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
Reviewed-by: Sergey Cheremencev <scherementsev@ddn.com>
3 weeks agoLU-18324 tests: skip sanity/205k for older MDS 03/56703/2
Jian Yu [Tue, 15 Oct 2024 19:47:24 +0000 (12:47 -0700)]
LU-18324 tests: skip sanity/205k for older MDS

Skip sanity test 205k for MDS < 2.15.64.85 since the test
depends on changes made to the MDS code on the server.

Test-Parameters: trivial testlist=sanity env=ONLY=205k \
  serverversion=2.15.5

Change-Id: I23709c2414b16a3d3a1ec742e7213597183b8f2a
Fixes: ecdcaa3986 ("LU-17512 utils: new ? operator for jobid_name")
Signed-off-by: Jian Yu <yujian@whamcloud.com>
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56703
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Colin Faber <cfaber@ddn.com>
Reviewed-by: Sarah Liu <sarah@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
3 weeks agoLU-18322 tests: skip recovery-small/154b for older MDS 58/56658/2
Jian Yu [Fri, 11 Oct 2024 06:01:40 +0000 (23:01 -0700)]
LU-18322 tests: skip recovery-small/154b for older MDS

Skip recovery-small test 154b for MDS < 2.15.60.2 since
the test depends on changes made to the MDS code on the server.

Test-Parameters: trivial testlist=recovery-small env=ONLY=154b \
  serverversion=2.15.5

Change-Id: I540b943abae2c39cbb62d5403e37cd7768cb96b2
Fixes: e818052444 ("LU-17365 lod: handle llog errors gracefuly")
Signed-off-by: Jian Yu <yujian@whamcloud.com>
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56658
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Emoly Liu <emoly@whamcloud.com>
Reviewed-by: Mikhail Pershin <mpershin@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
3 weeks agoLU-18330 tests: add interop PCCRO check for sanity-pcc/test_35 75/56675/4
Qian Yingjin [Mon, 14 Oct 2024 02:43:31 +0000 (10:43 +0800)]
LU-18330 tests: add interop PCCRO check for sanity-pcc/test_35

Add interop PCCRO connection flag check for sanity-pcc/
test_{35,36}.
Fix the interop sanity-pcc tests with b2_15.

Test-Parameters: trivial serverversion=2.15.5 testlist=sanity-pcc
Test-Parameters: trivial testlist=sanity-pcc env=ONLY=36b,ONLY_REPEAT=100
Signed-off-by: Qian Yingjin <qian@ddn.com>
Change-Id: I9174752bf123abf608a2d3e1f72650cf08e7146e
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56675
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: James Simmons <jsimmons@infradead.org>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
3 weeks agoLU-18319 tests: skip sanity-flr/test_50a for old MDS 76/56676/3
Bobi Jam [Mon, 14 Oct 2024 04:05:13 +0000 (12:05 +0800)]
LU-18319 tests: skip sanity-flr/test_50a for old MDS

Skip sanity-flr test_50a for old server because of changes that
were made on master from LU-13805 to reject unaligned DIO.

Test-Parameters: trivial
Fixes: 88d324be08 ("LU-13805 osd: Implement unaligned DIO connect flag")
Signed-off-by: Bobi Jam <bobijam@whamcloud.com>
Change-Id: I1bbd7528de458b4cca48c466203fc5cdd0ca4e8e
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56676
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Colin Faber <cfaber@ddn.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
3 weeks agoLU-18325 tests: sanity 842: load_module ldlm_extent failed 30/56630/2
Yang Sheng [Wed, 9 Oct 2024 15:13:27 +0000 (23:13 +0800)]
LU-18325 tests: sanity 842: load_module ldlm_extent failed

Skip the MDS without ldlm_extent module.

Test-Parameters: trivial testlist=sanity
Signed-off-by: Yang Sheng <ys@whamcloud.com>
Change-Id: I9d1ec659088b7314ee3ae2fb09fc49ee3ef5a922
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56630
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Jian Yu <yujian@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
3 weeks agoLU-17047 tests: sanity/77c ensure ost debug_path is valid 43/56643/5
Ronnie Sahlberg [Thu, 10 Oct 2024 03:57:39 +0000 (23:57 -0400)]
LU-17047 tests: sanity/77c ensure ost debug_path is valid

test_77c has been failing intermittently for a while due
to either debug_param not being set properly before this test is
invoked or due to silent failure of getting the param from the
remote node.

Try to avoid these spurios test flakiness by forcing the debug_path
parameter to the expected setting during test startup.
A test shouldn't depend on no residuals being present from unrelated
prior tests anyway.

If need be we can add a dedicated test that only verifies that we
can set debug_path to a specific value and then read the value back.

Test-Parameters: trivial testgroup=rolling-upgrade-client1
Test-Parameters: testgroup=rolling-downgrade-mds
Test-Parameters: testgroup=rolling-downgrade-client2
Test-Parameters: testgroup=rolling-upgrade-client2
Signed-off-by: Ronnie Sahlberg <rsahlberg@whamcloud.com>
Change-Id: I4aae3e422d8f492a7877fbe20a6c553afc35e998
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56643
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Jian Yu <yujian@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
3 weeks agoLU-18335 tests: skip conf-sanity/136 in interop 38/56638/4
Andreas Dilger [Thu, 10 Oct 2024 00:22:37 +0000 (18:22 -0600)]
LU-18335 tests: skip conf-sanity/136 in interop

conf-sanity test_136 was added to test the fix for a bug
that crashes the MDS, so skip it during interop testing.

Test-Parameters: trivial testlist=conf-sanity env=ONLY=136
Test-Parameters: testlist=conf-sanity env=ONLY=136 serverversion=2.15
Fixes: 55c143a66d ("LU-15784 obdecho: don't panic with run on second mdt")
Signed-off-by: Andreas Dilger <adilger@whamcloud.com>
Change-Id: Iec0f96387bf69218ee74b88cce5ab21e07004292
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56638
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Jian Yu <yujian@whamcloud.com>
Reviewed-by: Minh Diep <mdiep@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
3 weeks agoLU-18326 tests: skip sanity-hsm/410 for older MDS 23/56623/2
Jian Yu [Wed, 9 Oct 2024 05:47:22 +0000 (22:47 -0700)]
LU-18326 tests: skip sanity-hsm/410 for older MDS

Skip sanity-hsm test 410 for MDS < 2.15.90.10 since the test depends on
changes made to the MDS code on the server.

Test-Parameters: trivial testlist=sanity-hsm env=ONLY=410 serverversion=2.15.5

Change-Id: Ib4471b6b9e3370fe021cef1b4a37f3338bc118c1
Fixes: 80a961261a ("LU-17863 hsm: add ioctl to set data version in HSM xattr")
Signed-off-by: Jian Yu <yujian@whamcloud.com>
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56623
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Li Xi <lixi@ddn.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
3 weeks agoLU-18323 tests: skip sanity/27T for OSTs < 2.15.65.18 15/56615/2
Jian Yu [Tue, 8 Oct 2024 23:56:05 +0000 (16:56 -0700)]
LU-18323 tests: skip sanity/27T for OSTs < 2.15.65.18

Skip sanity test_27T for OSTs < 2.15.65.18 since the test depends on
changes made to the OST code on the server.

Test-Parameters: trivial testlist=sanity env=ONLY=27T serverversion=2.15.5

Change-Id: I96b134f3ade1d417adf751a69cdcfc71c15b4d75
Fixes: 8a865c65c6 ("LU-17567 tests: improve sanity.sh:test_27T")
Fixes: ccaff8d3c0 ("LU-17567 tests: skip sanity/27T for older OSTs")
Signed-off-by: Jian Yu <yujian@whamcloud.com>
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56615
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Sarah Liu <sarah@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
3 weeks agoLU-18321 tests: skip ost-pools/31 for older MDS 57/56657/3
Jian Yu [Sat, 12 Oct 2024 01:39:43 +0000 (18:39 -0700)]
LU-18321 tests: skip ost-pools/31 for older MDS

Skip the loop spilling testing in ost-pools test 31 for
MDS < 2.15.50.150 since the test depends on changes made
to the MDS code on the server.

Test-Parameters: trivial testlist=ost-pools env=ONLY=31 \
  serverversion=2.15.5

Change-Id: I4ad2ded8073eca55363fe60cc49c580ca7d983ae
Fixes: c9c842d678 ("LU-15043 lod: check for spilling loops")
Signed-off-by: Jian Yu <yujian@whamcloud.com>
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56657
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Sarah Liu <sarah@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
3 weeks agoLU-18226 tests: allow small margin for unevicted pages 37/56637/2
Andreas Dilger [Wed, 9 Oct 2024 23:33:16 +0000 (17:33 -0600)]
LU-18226 tests: allow small margin for unevicted pages

Allow a small margin for unevictable pages in case of memory pressure.

Test-Parameters: trivial
Fixes: 2a3ec2fea3 ("LU-17463 osc: add support for unevictable mlock()ed pages")
Signed-off-by: Andreas Dilger <adilger@whamcloud.com>
Change-Id: I3f1cbacfdf53f217d9ebcbeb1b34693d27f70901
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56637
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Jian Yu <yujian@whamcloud.com>
Reviewed-by: Colin Faber <cfaber@ddn.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
3 weeks agoLU-18293 test: check to sync quota setting 28/56628/3
Hongchao Zhang [Sun, 29 Sep 2024 00:32:39 +0000 (08:32 +0800)]
LU-18293 test: check to sync quota setting

If some quota setting version is droped between QMT and QSD,
the quota setting could not be transfered to QSD and cause
the corresponding quota ID is not enforced properly.

Test-Parameters: trivial testlist=sanity-quota
Signed-off-by: Hongchao Zhang <hongchao@whamcloud.com>
Change-Id: I8c34f5f045637e2f44f8cd6806b848ed93d8e784
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56628
Tested-by: Maloo <maloo@whamcloud.com>
Tested-by: jenkins <devops@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Qian Yingjin <qian@ddn.com>
Reviewed-by: Sergey Cheremencev <scherementsev@ddn.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
4 weeks agoNew RC 2.16.0-RC2 2.16.0-RC2 v2_16_0-RC2
Oleg Drokin [Fri, 11 Oct 2024 20:40:43 +0000 (16:40 -0400)]
New RC 2.16.0-RC2

Change-Id: I27c5cb17b9a340af7e0cade62c3468f984e7a7e2
Signed-off-by: Oleg Drokin <green@whamcloud.com>
4 weeks agoLU-18328 tests: fix old MDS version for sanity-sec/51 24/56624/3
Jian Yu [Fri, 11 Oct 2024 03:00:51 +0000 (20:00 -0700)]
LU-18328 tests: fix old MDS version for sanity-sec/51

Fix the old MDS version for sanity-sec test 51 to skip the test
for symbolic capability names.

The patch also reverts the NODEMAP_TESTS change from commit
v2_16_0-RC1-11-g2a5e8e355498, which casued the following failure:
  EXCEPT_7=true: command not found

Test-Parameters: trivial testlist=sanity-sec env=ONLY=51 serverversion=2.15.5

Change-Id: Idf458ad5df7a7d6105b1f4da3e06280cf66919c1
Fixes: 0b0e9162e2 ("LU-13791 mdt: allow using symbolic capability names")
Fixes: 2a5e8e3554 ("LU-18247 nodemap: initialize unused fields on disk")
Signed-off-by: Jian Yu <yujian@whamcloud.com>
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56624
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Sebastien Buisson <sbuisson@ddn.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
4 weeks agoLU-18331 tests: skip sanityn/33e for older MDS 22/56622/2
Jian Yu [Wed, 9 Oct 2024 05:27:45 +0000 (22:27 -0700)]
LU-18331 tests: skip sanityn/33e for older MDS

Skip sanityn test 33e for MDS < 2.15.56.127 since the test depends on
changes made to the MDS code on the server.

Test-Parameters: trivial testlist=sanityn env=ONLY=33e serverversion=2.15.5

Change-Id: I278af93165e928c03436489d90d72a766f27bdd7
Fixes: 2a78a9e2cd ("LU-15527 dne: refactor commit-on-sharing for DNE")
Signed-off-by: Jian Yu <yujian@whamcloud.com>
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56622
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Li Xi <lixi@ddn.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
4 weeks agoLU-15840 tests: interop check for sanity-hsm 262 77/56577/3
Frederick Dilger [Thu, 3 Oct 2024 18:37:55 +0000 (12:37 -0600)]
LU-15840 tests: interop check for sanity-hsm 262

sanity-hsm.sh test 262 was failing in interop testing with an old
MDS.
Added an interop check to skip the test unless MDS version >=
v2_15_61-204-g5ee13823a4.

Test-Parameters: trivial testlist=sanity-hsm env=ONLY=262
Test-Parameters: testlist=sanity-hsm env=ONLY=262 serverversion=2.15
Signed-off-by: Frederick Dilger <fdilger@whamcloud.com>
Change-Id: I3fc912ad633f7450ae97d9be50627b462d015dea
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56577
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Li Xi <lixi@ddn.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
4 weeks agoLU-18070 sec: clear ACL caches if ACL empty 00/56600/5
Sebastien Buisson [Mon, 7 Oct 2024 16:13:41 +0000 (18:13 +0200)]
LU-18070 sec: clear ACL caches if ACL empty

When the lli_posix_acl field of struct ll_inode_info is updated,
check if new ACL is empty, and clear ACL caches for this inode
in this case.

Also fix sanity test_103a when it is run with multiple MDTs. The
test has several requirements regarding uids and gids, but in case
they are not met, missing ids are only configured on mds1. So make
sure the directory used for the test ($DIR/$tdir) is created on mds1.

Fixes: 13fd5ebef3 ("LU-18101 sec: fix ACL handling on recent kernels again")
Fixes: aa636f8ae6 ("LU-18095 sec: fix ACL handling on recent kernels")
Test-Parameters: testlist=sanity env=ONLY=103a,ONLY_REPEAT=10 mdscount=2 mdtcount=4 osscount=1 ostcount=8 clientcount=2 clientdistro=sles15sp6 serverdistro=el8.8
Test-Parameters: testlist=sanity env=ONLY=103a,ONLY_REPEAT=10 mdscount=2 mdtcount=4 osscount=1 ostcount=8 clientcount=2 clientdistro=ubuntu2404 serverdistro=el8.8
Test-Parameters: testlist=sanity env=ONLY=103a,ONLY_REPEAT=10 mdscount=2 mdtcount=4 osscount=1 ostcount=8 clientcount=2 clientdistro=el9.3 serverdistro=el8.8
Signed-off-by: Sebastien Buisson <sbuisson@ddn.com>
Change-Id: I91109bf98bc65dfb1fcefb2551be84d9c73f8ee2
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56600
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Shaun Tancheff <shaun.tancheff@hpe.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
4 weeks agoLU-18096 enc: ll_get_symlink overlay function 68/56568/5
Sebastien Buisson [Wed, 2 Oct 2024 16:33:20 +0000 (18:33 +0200)]
LU-18096 enc: ll_get_symlink overlay function

ll_get_symlink() overlay function is necessary to properly encode for
presentation the symlink target when the encryption key is not
available, in a way that is compatible with the overlay function
ll_setup_filename(), so that further readlink without the encryption
key works properly.

Test-Parameters: testlist=sanity-sec env=ONLY=47,ONLY_REPEAT=10 mdscount=2 mdtcount=4 osscount=1 ostcount=8 clientcount=2
Test-Parameters: testlist=sanity-sec env=ONLY=47,ONLY_REPEAT=10 mdscount=2 mdtcount=4 osscount=1 ostcount=8 clientcount=2 clientdistro=ubuntu2404 serverdistro=el8.8
Signed-off-by: Sebastien Buisson <sbuisson@ddn.com>
Change-Id: I141a89d43f392dd4782e148590bc324ea252c2af
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56568
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Jian Yu <yujian@whamcloud.com>
Reviewed-by: Lai Siyao <lai.siyao@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
5 weeks agoLU-18221 tests: check UBSAN and KASAN in check_dmesg_for_errors() 59/56359/5
Jian Yu [Sun, 15 Sep 2024 23:15:16 +0000 (16:15 -0700)]
LU-18221 tests: check UBSAN and KASAN in check_dmesg_for_errors()

This patch adds checks into check_dmesg_for_errors() for UBSAN
and KASAN messages in dmesg and fails the test if they are found.

Test-Parameters: trivial mdtcount=4 mdscount=2 \
  env=SANITY_EXCEPT="17e 103" \
  clientdistro=ubuntu2404 testlist=sanity

Test-Parameters: optional clientdistro=ubuntu2404 testgroup=full-part-1
Test-Parameters: optional clientdistro=ubuntu2404 testgroup=full-part-2
Test-Parameters: optional clientdistro=ubuntu2404 testgroup=full-part-3

Change-Id: I4156fb0192e3df152287daf101bc17c228e593b5
Signed-off-by: Jian Yu <yujian@whamcloud.com>
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56359
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Yang Sheng <ys@whamcloud.com>
Reviewed-by: James Simmons <jsimmons@infradead.org>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
5 weeks agoLU-18048 tests: skip sanityn test_16j unaligned DIO 69/55769/4
Frederick Dilger [Wed, 17 Jul 2024 00:20:30 +0000 (18:20 -0600)]
LU-18048 tests: skip sanityn test_16j unaligned DIO

sanityn.sh test_16j was failing often in cases where the client
was running an older version than the server because of
changes that were made on master from LU-13805. This started with
generic testing but added UDIO-specific options in later patches so
an interop check has been added to avoid failing the test on older
versions.

Test-Parameters: trivial testlist=sanityn env=ONLY=16 serverversion=EXA6
Signed-off-by: Frederick Dilger <fdilger@whamcloud.com>
Change-Id: If8d0ad6b22e701ca867fd91bb077e7e2ee6f370b
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/55769
Reviewed-by: Shaun Tancheff <shaun.tancheff@hpe.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
5 weeks agoLU-17922 tests: skip sanity-sec/27aa for old MDS 51/56551/3
Andreas Dilger [Tue, 1 Oct 2024 16:19:43 +0000 (10:19 -0600)]
LU-17922 tests: skip sanity-sec/27aa for old MDS

Skip sanity-sec test_27aa for old MDS without nodemap ID range.

Test-Parameters: trivial testlist=sanity-sec env=ONLY=27aa mdsversion=2.15
Fixes: 8445f7b92f ("LU-17922 utils: added idmap range functionality")
Signed-off-by: Andreas Dilger <adilger@whamcloud.com>
Change-Id: I6c5383f146257830f4558c0472415deba7f5f988
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56551
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Jian Yu <yujian@whamcloud.com>
Reviewed-by: Maximilian Dilger <mdilger@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
5 weeks agoLU-18247 nodemap: initialize unused fields on disk 50/56450/5
Andreas Dilger [Sat, 21 Sep 2024 20:42:57 +0000 (14:42 -0600)]
LU-18247 nodemap: initialize unused fields on disk

Initialize the unused fields in the on-disk nodemap records to zero.
This will allow them to be used in the future for expanded nodemap
functionality.

It may be that initial usage will also depend on storing a magic
into one of the fields so that the rest of the fields can be trusted.

Test-Parameters: trivial testlist=sanity-sec
Test-Parameters: testlist=sanity-quota
Test-Parameters: testlist=sanity-selinux
Signed-off-by: Andreas Dilger <adilger@whamcloud.com>
Change-Id: I58f46396911e28f91e03a287c5b308dcaf3ebbe5
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56450
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Sebastien Buisson <sbuisson@ddn.com>
Reviewed-by: Timothy Day <timday@amazon.com>
Reviewed-by: Maximilian Dilger <mdilger@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
5 weeks agoLU-17906 pltrpc: don't use non-uptodate peer at connect 86/54286/6
Mikhail Pershin [Sun, 8 Sep 2024 08:10:55 +0000 (11:10 +0300)]
LU-17906 pltrpc: don't use non-uptodate peer at connect

If peer is not yet discovered then LNET puts messages into
pending queue until discovery is done. That pins ptlrpc
request as well, thus a connect RPC to not alive peer is
stuck until peer discovery timed out despite RPC timeout.
Moreover that means no connect attempt to other peers are
made for that time:

nids_stats:
   "192.168.252.112@tcp": { connects: 1, ... sec_ago: 31 }
   "192.168.252.113@tcp": { connects: 0, ... sec_ago: never }
   "192.168.252.115@tcp": { connects: 0, ... sec_ago: never }

After 30s it is still stuck with first NID and never tried
any other, despite connect RPC timeout is about 5-10s in
ptlrpc.

Patch prevents RPC stuck on non-uptodate peer just by
dropping such request in ptl_send_rpc(). That lets ptlrpc
to keep control over connection request expiration and new
connect attempts, so all peers are tried one by one until
some is ready.

Results with patch:
nids_stats:
   "192.168.252.112@tcp": { connects: 4, ... sec_ago: 9 }
   "192.168.252.113@tcp": { connects: 4, ... sec_ago: 4 }
   "192.168.255.115@tcp": { connects: 3, ... sec_ago: 14 }

After the same 30s we had 11 connect attempts with all
failover NIDs tried

Patch modifies also LNetPeerDiscovered() to consider
a local peer as uptodate and return error code instead of
boolean.

Import uptodate state is also not boolen now but shows
discovery status

Test-Parameters: env=ONLY=153a,ONLY_REPEAT=10 testlist=conf-sanity
Signed-off-by: Mikhail Pershin <mpershin@whamcloud.com>
Change-Id: I51d8973aa8475ce1930f292c42aa22c70cfc13db
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/54286
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Serguei Smirnov <ssmirnov@whamcloud.com>
Reviewed-by: Frank Sehr <fsehr@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
5 weeks agoLU-18284 llite: disallow udio exceptions 71/56571/6
Shaun Tancheff [Fri, 4 Oct 2024 01:37:52 +0000 (08:37 +0700)]
LU-18284 llite: disallow udio exceptions

Require the unaligned_dio feature flag to be present and
disable limited udio support for 2.14 and 2.15 as these
exceptions cause failures with 2.12 servers.

Test-Parameters: trivial
Test-Parameters: testlist=sanity clientarch=x86_64 clientdistro=el9.4  serverversion=2.12.9 serverdistro=el7.9 env=ONLY="119e 119f 119g 119h 119i 119j",ONLY_REPEAT=10
Test-Parameters: testlist=sanity clientarch=x86_64 clientdistro=el9.2  serverversion=2.12.9 serverdistro=el7.9 env=ONLY="119e 119f 119g 119h 119i 119j",ONLY_REPEAT=10
Test-Parameters: testlist=sanity clientarch=x86_64 clientdistro=el8.10 serverversion=2.12.9 serverdistro=el7.9 env=ONLY="119e 119f 119g 119h 119i 119j",ONLY_REPEAT=10
Test-Parameters: testlist=sanity clientarch=x86_64 clientdistro=el8.10 serverversion=2.15.4 serverdistro=el8.9 env=ONLY="119e 119f 119g 119h 119i 119j",ONLY_REPEAT=10
Fixes: 07a1272233 ("LU-17525 llite: unaligned DIO interop page alignment")
Signed-off-by: Shaun Tancheff <shaun.tancheff@hpe.com>
Change-Id: If3d3d41f3f1f4f96279e9c3d7978f78dd7333259
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56571
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Timothy Day <timday@amazon.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
5 weeks agoLU-18101 sec: fix ACL handling on recent kernels again 52/56552/4
Sebastien Buisson [Tue, 1 Oct 2024 16:20:25 +0000 (18:20 +0200)]
LU-18101 sec: fix ACL handling on recent kernels again

On recent distributions like Ubuntu 24.04, the kernel offers the
.get_inode_acl op on struct inode_operations. This must be defined
and fetch ACLs, otherwise they can end up being incorrect on inodes.

Fixes: aa636f8ae6 ("LU-18095 sec: fix ACL handling on recent kernels")
Test-Parameters: testlist=sanityn env=ONLY=25a,ONLY_REPEAT=10 mdscount=2 mdtcount=4 osscount=1 ostcount=8 clientcount=2
Test-Parameters: testlist=sanityn env=ONLY=25a,ONLY_REPEAT=10 mdscount=2 mdtcount=4 osscount=1 ostcount=8 clientcount=2 clientdistro=ubuntu2404 serverdistro=el8.8
Test-Parameters: testlist=sanity-sec env=ONLY=23b,ONLY_REPEAT=50 mdscount=2 mdtcount=4 osscount=1 ostcount=8 clientcount=2 clientdistro=ubuntu2404 serverdistro=el8.8
Test-Parameters: testlist=sanity env=ONLY=103a,ONLY_REPEAT=50 mdscount=2 mdtcount=4 osscount=1 ostcount=8 clientcount=2 clientdistro=ubuntu2404 serverdistro=el8.8
Signed-off-by: Sebastien Buisson <sbuisson@ddn.com>
Change-Id: Idcc642a11f6f6198217e5eadb2a2c32e8117b8b7
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56552
Reviewed-by: Jian Yu <yujian@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
5 weeks agoLU-17968 tests: sanity-hsm 35 slow VM timeout fix 16/55516/3
Frederick Dilger [Mon, 24 Jun 2024 21:10:12 +0000 (15:10 -0600)]
LU-17968 tests: sanity-hsm 35 slow VM timeout fix

Test 35 in sanity-hsm.sh has been failing intermittently since
2021-07-25, with a rate around 1/1300 runs. This is likely
caused by occasional slow VM behaviour so the timeout has been
increased from 1s to 2s to hopefully avoid future occurences
while maintaining test functionality.

Test-Patametets: trivial testlist=sanity-hsm
Test-Parameters: testlist=sanity-hsm env=ONLY=35,ONLY_REPEAT=100

Signed-off-by: Frederick Dilger <fdilger@whamcloud.com>
Change-Id: I4ed42e79d87cd8d73e373e074fa63cf6226d4471
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/55516
Tested-by: Maloo <maloo@whamcloud.com>
Tested-by: jenkins <devops@whamcloud.com>
Reviewed-by: Etienne AUJAMES <eaujames@ddn.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
5 weeks agoLU-16641 tests: fix sanity-quota_12b 69/53969/3
Sergey Cheremencev [Thu, 8 Feb 2024 12:37:29 +0000 (15:37 +0300)]
LU-16641 tests: fix sanity-quota_12b

Fix sanity-quota_12b to don't fail after
creating $ilimit files with the same inode
hardlmit. It is a legal case to fail create
2048 files when inode hard limit is also 2048.

Test-Parameters: trivial testlist=sanity-quota
Signed-off-by: Sergey Cheremencev <scherementsev@ddn.com>
Change-Id: Iea2e976ad1954dc2489ffa81e92e624364343069
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/53969
Tested-by: Maloo <maloo@whamcloud.com>
Tested-by: jenkins <devops@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Alex Zhuravlev <bzzz@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
5 weeks agoLU-18198 tests: lfsck-performance:test_5a availability check 38/56238/2
Vladimir Saveliev [Tue, 12 Mar 2024 17:52:06 +0000 (20:52 +0300)]
LU-18198 tests: lfsck-performance:test_5a availability check

lfsck-performance:test_5a is to check that OSTs have enough inodes.

HPE-bug-id: LUS-11205
Test-Parameters: mdtcount=4 testlist=lfsck-performance env=ONLY=5a,MINSUBDIR=4,MAXSUBDIR=4
Signed-off-by: Vladimir Saveliev <vladimir.saveliev@hpe.com>
Change-Id: Ifacb97eaebd2feebfafc2da7aeca4ec28ba978af
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56238
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
5 weeks agoLU-17251 test: improve parallel-scale rr_alloc test 40/52940/17
Alex Deiter [Fri, 27 Sep 2024 18:23:31 +0000 (11:23 -0700)]
LU-17251 test: improve parallel-scale rr_alloc test

Added checking for pre-created OST objects and waiting
(maximum 60 seconds) before executing the rr_alloc test.

Test-Parameters: trivial
Test-Parameters: mdtcount=4 mdscount=2 testlist=parallel-scale env=ONLY=rr_alloc,ONLY_REPEAT=8
Test-Parameters: mdtcount=4 mdscount=2 testlist=parallel-scale env=ONLY=rr_alloc,ONLY_REPEAT=8
Test-Parameters: mdtcount=4 mdscount=2 testlist=parallel-scale env=ONLY=rr_alloc,ONLY_REPEAT=8
Test-Parameters: testlist=parallel-scale env=ONLY=rr_alloc,ONLY_REPEAT=8
Test-Parameters: testlist=parallel-scale env=ONLY=rr_alloc,ONLY_REPEAT=8
Test-Parameters: testlist=parallel-scale env=ONLY=rr_alloc,ONLY_REPEAT=8

Signed-off-by: Alex Deiter <adeiter@tintri.com>
Change-Id: Ib604b99138ceccf384476ad2876d9df7cd7d524b
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/52940
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Jian Yu <yujian@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
5 weeks agoLU-18191 tests: sanity-quota 90b racer fix 32/56532/4
Frederick Dilger [Sat, 28 Sep 2024 21:01:30 +0000 (15:01 -0600)]
LU-18191 tests: sanity-quota 90b racer fix

It looks like there was a small race condition in the test. There was
one block of root user usage that would be freed since the start of
the test.

Now, if the "head" or "tail" does not match, it will be refetched and
tested to see if it is still not matching. This will handle the race
condition without adding overhead for the passing cases.

Test-Parameters: trivial testlist=sanity-quota env=ONLY=90b,ONLY_MINUTES=120
Signed-off-by: Frederick Dilger <fdilger@whamcloud.com>
Change-Id: Ica46a79320c64caec006287bc5657488d5d1956a
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56532
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Sergey Cheremencev <scherementsev@ddn.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
5 weeks agoLU-17525 test: re-enable sanity tests: 56x 56xa 56xb 47/55647/16
Shaun Tancheff [Thu, 19 Sep 2024 06:28:40 +0000 (13:28 +0700)]
LU-17525 test: re-enable sanity tests: 56x 56xa 56xb

The tests are expected to be working again.

Test-Parameters: trivial
Test-Parameters: testlist=sanity clientarch=aarch64 clientdistro=el9.3   serverdistro=el8.9 env=ONLY="56x 56xa 56xb",ONLY_REPEAT=10
Test-Parameters: testlist=sanity clientarch=ppc64le clientdistro=el8.9   serverdistro=el8.9 env=ONLY="56x 56xa 56xb",ONLY_REPEAT=10
Test-Parameters: testlist=sanity clientarch=aarch64 clientdistro=el9.3   serverversion=2.15.4 serverdistro=el8.9   env=ONLY="56x 56xa 56xb",ONLY_REPEAT=10
Test-Parameters: testlist=sanity clientarch=aarch64 clientdistro=el9.3   env=ONLY="56x 56xa 56xb",ONLY_REPEAT=10
Test-Parameters: testlist=sanity clientarch=x86_64 clientdistro=el8.10   env=ONLY="56x 56xa 56xb",ONLY_REPEAT=10
Test-Parameters: testlist=sanity clientarch=x86_64 clientdistro=el9.4   env=ONLY="56x 56xa 56xb",ONLY_REPEAT=10
Test-Parameters: testlist=sanity serverversion=2.15.4 serverdistro=el8.9   serverdistro=el8.9 env=ONLY="56x 56xa 56xb",ONLY_REPEAT=10
Signed-off-by: Shaun Tancheff <shaun.tancheff@hpe.com>
Change-Id: I497bcca269b0c69411e41ddeeb5d8f37a9346002
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/55647
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: James Simmons <jsimmons@infradead.org>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
5 weeks agoLU-4315 doc: remove usage of lgroff-macros 39/56439/2
Andreas Dilger [Fri, 20 Sep 2024 08:57:43 +0000 (02:57 -0600)]
LU-4315 doc: remove usage of lgroff-macros

A few updated man pages were incorrectly including the non-existent
lgroff-macros file.  That is not actually needed, and breaks them.

Fix lfs-setquota.1 man page to use --projid instead of --project.

Test-Parameters: trivial
Build-Parameters: distro=el9.3 arch=x86_64
Fixes: 5bb91eaff0 ("LU-4315 doc: updating lfs-[c-g] man page style")
Fixes: d2e3bff585 ("LU-7544 doc: improve setquota man page")
Signed-off-by: Andreas Dilger <adilger@whamcloud.com>
Change-Id: I2bc485d2b1b90878ee8dbad978e05626ef3ebbe5
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56439
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Timothy Day <timday@amazon.com>
Reviewed-by: Frederick Dilger <fdilger@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
6 weeks agoNew RC 2.16.0-RC1 2.16.0-RC1 v2_16_0-RC1
Oleg Drokin [Mon, 30 Sep 2024 15:38:35 +0000 (11:38 -0400)]
New RC 2.16.0-RC1

Change-Id: I7405a94fb63a0a876112962b440d4313def1ccab
Signed-off-by: Oleg Drokin <green@whamcloud.com>
6 weeks agoLU-18049 mgc: fix memory corruption 00/56500/2
Sergey Cheremencev [Wed, 25 Sep 2024 16:27:49 +0000 (19:27 +0300)]
LU-18049 mgc: fix memory corruption

Fix memory corruption in mgc_apply_recovery_logs
caused by type address mistake of struct lnet_nid.
When mne_nid_count was > 1, at 2nd iteration it
stored nid at addr+400(sizeof(lnet_nid)*sizeof(lnet_nid))
instead of next array element, i.e. addr+20.
This caused a lot of memory corruptions with different
back traces, depending on the owner of memory located
near nid array. Corruptions usually happened in kmalloc-64.
It might corrupt the data inside slab objects or slub
service structres(freepointer).

Test-Parameters: trivial testlist=sanity-sec env=ONLY=31,ONLY_REPEAT=10 serverversion=2.15
Test-Parameters: trivial testlist=sanity-sec env=ONLY=31,ONLY_REPEAT=10 serverversion=2.15
Test-Parameters: trivial testlist=sanity-sec env=ONLY=31,ONLY_REPEAT=10 serverversion=EXA6

Fixes: e4d2d4ff74 ("LU-13306 mgc: handle large NID formats")
Signed-off-by: Sergey Cheremencev <scherementsev@ddn.com>
Change-Id: I3719a09a3814f24ef26c2b118de629b42d13313c
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56500
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Shaun Tancheff <shaun.tancheff@hpe.com>
Reviewed-by: James Simmons <jsimmons@infradead.org>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
6 weeks agoLU-18217 build: Ensure LINUX_RELEASE is defined 33/56333/3
Shaun Tancheff [Wed, 11 Sep 2024 16:07:56 +0000 (23:07 +0700)]
LU-18217 build: Ensure LINUX_RELEASE is defined

Ensure LINUX_RELEASE is defined to something consumable by buildrpm

Ex:
  sh ./autogen.sh
  ./configure --enable-dist
  make srpm

Test-Parameters: trivial
Fixes: 0c46ba62efb ("LU-17850 build: prefer LINUXRELEASE over uname -r")
Signed-off-by: Shaun Tancheff <shaun.tancheff@hpe.com>
Change-Id: I647a27729c4d14a3983edc74f19a420bb6968631
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56333
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
Reviewed-by: Jian Yu <yujian@whamcloud.com>
Reviewed-by: Petros Koutoupis <petros.koutoupis@hpe.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
6 weeks agoLU-17993 llite: add further check for user buffer 04/56404/8
Shaun Tancheff [Tue, 24 Sep 2024 17:27:01 +0000 (00:27 +0700)]
LU-17993 llite: add further check for user buffer

The alignment check in ll_iov_iter_alignment for the user buffer
could be wrong if the IO count is less than PAGE_SIZE, add more
checks for it.

Linux commit v5.19-rc4-8-gcfa320f72882
   iov: introduce iov_iter_aligned

Added iov_iter_is_aligned which can do the additional checking,
provide an equivalent for older kernels.

Test-Parameters: testlist=sanity clientarch=aarch64 clientdistro=el9.3   serverdistro=el8.9 env=ONLY="119",SANITY_EXCEPT="119i",ONLY_REPEAT=50
Test-Parameters: testlist=sanity clientarch=ppc64le clientdistro=el8.9   serverdistro=el8.9 env=ONLY="119",SANITY_EXCEPT="119i",ONLY_REPEAT=50
Test-Parameters: testlist=sanity clientarch=aarch64 clientdistro=el9.3   serverversion=2.15.4 serverdistro=el8.9   env=ONLY="119",SANITY_EXCEPT="119i",ONLY_REPEAT=50
Test-Parameters: testlist=sanity clientarch=aarch64 clientdistro=el9.3   env=ONLY="119",SANITY_EXCEPT="119i",ONLY_REPEAT=50
Test-Parameters: testlist=sanity clientarch=x86_64 clientdistro=el8.10   env=ONLY="119",SANITY_EXCEPT="119i",ONLY_REPEAT=50
Test-Parameters: testlist=sanity clientarch=x86_64 clientdistro=el9.4   env=ONLY="119",SANITY_EXCEPT="119i",ONLY_REPEAT=50
Test-Parameters: testlist=sanity serverversion=2.15.4 serverdistro=el8.9   serverdistro=el8.9 env=ONLY="119",SANITY_EXCEPT="119i",ONLY_REPEAT=50
Signed-off-by: Hongchao Zhang <hongchao@whamcloud.com>
Signed-off-by: Shaun Tancheff <shaun.tancheff@hpe.com>
Change-Id: I640dc417e52cc5497e5dd898ea3018f990da3fda
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56404
Reviewed-by: Patrick Farrell <patrick.farrell@oracle.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
6 weeks agoLU-18049 obdclass: fix class_add_nids_to_uuid 99/56499/2
Sergey Cheremencev [Tue, 24 Sep 2024 23:23:21 +0000 (02:23 +0300)]
LU-18049 obdclass: fix class_add_nids_to_uuid

Store nid in un_nids array beginning from the
0-element. Without the patch it starts from
the 1st element because a counter is incremented
before copying.

Fixes: e4d2d4ff74 ("LU-13306 mgc: handle large NID formats")
Signed-off-by: Sergey Cheremencev <scherementsev@ddn.com>
Change-Id: I257825de515b083df6af8b8f3ab784b48074a1fe
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56499
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: James Simmons <jsimmons@infradead.org>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
6 weeks agoLU-18233 build: check %kernel_module_package_buildreqs 03/56403/2
Jian Yu [Wed, 18 Sep 2024 23:57:50 +0000 (16:57 -0700)]
LU-18233 build: check %kernel_module_package_buildreqs

If rpm macro %kernel_module_package_buildreqs is undefined,
building Lustre codes will report the following error:

  Dependency tokens must begin with alpha-numeric, '_' or '/':
  BuildRequires: %kernel_module_package_buildreqs

However, the real issue is that kernel-rpm-macros or
rpm-build rpm is not installed. And the error should be like:

  error: Failed build dependencies:
    kernel-rpm-macros is needed by ...

This patch fixes the above issue by checking if the macro
%kernel_module_package_buildreqs is defined before adding it
to BuildRequires so as to avoid the misleading error.

The patch also replaces suse_version with sle_version to get
the correct version value for SLES15 SP3.

Test-Parameters: trivial clientdistro=el9.4 testlist=sanity
Test-Parameters: trivial clientdistro=el8.10 testlist=sanity
Test-Parameters: trivial clientdistro=sles15sp6 \
  env=SANITY_EXCEPT="103a 120e" testlist=sanity

Fixes: 037840fb6b ("LU-12899 build: rhel8 not install kernel-rpm-macros")
Fixes: c39d269f9d ("LU-12214 build: add right kernel detect")
Fixes: 78c681d9f4 ("LU-16207 build: add rpm-build BuildRequires for SLES15 SP3")

Change-Id: I151d93c0c74ab98e6589393a914acf18b5471289
Signed-off-by: Jian Yu <yujian@whamcloud.com>
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56403
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Shaun Tancheff <shaun.tancheff@hpe.com>
Reviewed-by: Qian Yingjin <qian@ddn.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
6 weeks agoLU-18082 tests: fix sanity test_154i for interop 69/56369/4
Sebastien Buisson [Mon, 16 Sep 2024 13:43:54 +0000 (15:43 +0200)]
LU-18082 tests: fix sanity test_154i for interop

sanity test_154i can only pass if server side is also patched.
So skip this subtest in interop testing.

Test-Parameters: trivial
Test-Parameters: testlist=sanity env=ONLY=154i serverversion=2.15
Fixes: f5251cb7af ("LU-18082 utils: fid2path support for path exceeding PATH_MAX")
Signed-off-by: Sebastien Buisson <sbuisson@ddn.com>
Change-Id: Ic041fcd508e9aebb4c9edbf47bb585fb99c2ef7a
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56369
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Sarah Liu <sarah@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
7 weeks agoLU-14499 lnet: Revert "LU-13368 lnet: discard the callback" 37/41937/9
Chris Horn [Wed, 17 Aug 2022 18:11:58 +0000 (12:11 -0600)]
LU-14499 lnet: Revert "LU-13368 lnet: discard the callback"

The changes introduced by LU-13368 have been shown to cause
the o2iblnd shutdown procedure to hang on lustre_rmmod
as it infinitely waits for peers to disconnect.

Those changes were also shown to cause a memory leak. Revert them.
This reverts commit babf0232273467b7199ec9a7c36047b1968913df.

HPE-bug-id: LUS-11089
Signed-off-by: Serguei Smirnov <ssmirnov@whamcloud.com>
Signed-off-by: Chris Horn <chris.horn@hpe.com>
Change-Id: I489ae4af445b18df852ec35adc958c4fac33de09
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/41937
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Frank Sehr <fsehr@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
7 weeks agoLU-18195 ldiskfs: support lost+found NO_CHECKS for Ubunutu 70/56370/2
James Simmons [Mon, 16 Sep 2024 14:46:23 +0000 (08:46 -0600)]
LU-18195 ldiskfs: support lost+found NO_CHECKS for Ubunutu

The IGET_NO_CHECKS function was expanded to handle Lustre's
lost+found. This change breaks in that you can't apply the
patch ext4-add-IGET_NO_CHECKS-flags to the Ubuntu ext4 source
code.

Discovered in building with the new patch the original patch
was not applying to the Ubuntu ext4 source code correctly.
Now it does.

Test-Parameters: trivial
Change-Id: I58692608f3b0dd8fa970242cda4943965bec0671
Signed-off-by: James Simmons <jsimmons@infradead.org>
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56370
Reviewed-by: Timothy Day <timday@amazon.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Li Dongyang <dongyangli@ddn.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
7 weeks agoLU-17428 tests: restore recovery-small/10a lru_max_age 77/56377/2
Andreas Dilger [Tue, 17 Sep 2024 00:23:16 +0000 (18:23 -0600)]
LU-17428 tests: restore recovery-small/10a lru_max_age

Restore the longer lru_max_age in recovery-small test_10a since
this otherwise prevents the client from being evicted.

Skip the console message check for stuck MDS threads on subsequent
iterations of test_10a when the test is run in a loop, since message
ratelimiting may prevent the console message from being printed.

Test-Parameters: trivial testlist=recovery-small env=ONLY=10a,ONLY_REPEAT=10
Fixes: 357cae970c ("LU-17428 ldlm: reduce default lru_max_age")
Signed-off-by: Andreas Dilger <adilger@whamcloud.com>
Change-Id: Ia8ba5f83aa001d3c810e13637754b0e169dc3b9b
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56377
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Timothy Day <timday@amazon.com>
Reviewed-by: Arshad Hussain <arshad.hussain@aeoncomputing.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
7 weeks agoLU-12066 tests: activate OSTs in recovery-small/26b 56/56356/6
Andreas Dilger [Fri, 13 Sep 2024 15:38:41 +0000 (08:38 -0700)]
LU-12066 tests: activate OSTs in recovery-small/26b

In recovery-small.sh test_26b ensure that all OST connections are
active and not disconnected due to being idle.  Otherwise, the
number of OST exports for the remaining client may go down while
the test is running and they will never reconnect.

Change the wait condition to consider success if more exports have
disconnected than expected.

Move wait_client_evicted() down to test_26b() since it is the only
test case where this helper function is being used.

Minor test script style improvements.

Fixes: 5a6ceb664f07 ("LU-7236 ptlrpc: idle connections can disconnect")
Test-Parameters: trivial testlist=recovery-small env=ONLY=26b,ONLY_REPEAT=100
Signed-off-by: Andreas Dilger <adilger@whamcloud.com>
Change-Id: I7b21d89ea148c4fda6527870e491f59abb3ebbe5
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56356
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Shaun Tancheff <shaun.tancheff@hpe.com>
Reviewed-by: Arshad Hussain <arshad.hussain@aeoncomputing.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
7 weeks agoLU-18215 lnet: fix sanity-lnet/260 'Original values not set' error 58/56358/4
Frank Sehr [Fri, 13 Sep 2024 19:09:35 +0000 (12:09 -0700)]
LU-18215 lnet: fix sanity-lnet/260 'Original values not set' error

Increased timeout and retries again. Bypassed the the result check on
the rest values. Resetting to the original values doesn't seem to be a
use case that is applicable anyway since only the setting of new
values is important.

Test-Parameters: trivial testlist=sanity-lnet env=ONLY=260,ONLY_REPEAT=100
Signed-off-by: Frank Sehr <fsehr@whamcloud.com>
Change-Id: I95fbe97efb2161d3991928d300f582077a2d5fff
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56358
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Serguei Smirnov <ssmirnov@whamcloud.com>
Reviewed-by: Cyril Bordage <cbordage@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
7 weeks agoLU-6142 osp: SPDX for Object Storage Proxy 97/56297/2
Timothy Day [Sun, 8 Sep 2024 04:13:19 +0000 (00:13 -0400)]
LU-6142 osp: SPDX for Object Storage Proxy

Convert from verbose license text to SPDX.

Test-Parameters: trivial
Signed-off-by: Timothy Day <timday@amazon.com>
Change-Id: I2f02d3721001fadb009414c029ed2df9d08718cb
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56297
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: James Simmons <jsimmons@infradead.org>
Reviewed-by: Arshad Hussain <arshad.hussain@aeoncomputing.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
7 weeks agoLU-6142 osc: SPDX for Object Storage Client 96/56296/2
Timothy Day [Sun, 8 Sep 2024 04:04:31 +0000 (00:04 -0400)]
LU-6142 osc: SPDX for Object Storage Client

Convert from verbose license text to SPDX.

Test-Parameters: trivial
Signed-off-by: Timothy Day <timday@amazon.com>
Change-Id: Ie674e38f12f30625826a812bda7f2a1569e8e290
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56296
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: James Simmons <jsimmons@infradead.org>
Reviewed-by: Arshad Hussain <arshad.hussain@aeoncomputing.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
7 weeks agoLU-6142 mdd: SPDX for MetaData Server 95/56295/2
Timothy Day [Sat, 7 Sep 2024 23:42:04 +0000 (19:42 -0400)]
LU-6142 mdd: SPDX for MetaData Server

Convert from verbose license text to SPDX.

Test-Parameters: trivial
Signed-off-by: Timothy Day <timday@amazon.com>
Change-Id: I6875ad3d2711666e006626854408acf9b95d2701
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56295
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: James Simmons <jsimmons@infradead.org>
Reviewed-by: Arshad Hussain <arshad.hussain@aeoncomputing.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
7 weeks agoLU-6142 lod: SPDX for Logical Object Device 93/56293/2
Timothy Day [Sat, 7 Sep 2024 20:58:15 +0000 (16:58 -0400)]
LU-6142 lod: SPDX for Logical Object Device

Convert from verbose license text to SPDX.

Test-Parameters: trivial
Signed-off-by: Timothy Day <timday@amazon.com>
Change-Id: Ia87dec25013d13a9b70a47789335ad82b1600d15
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56293
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: James Simmons <jsimmons@infradead.org>
Reviewed-by: Arshad Hussain <arshad.hussain@aeoncomputing.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
7 weeks agoLU-6142 target: SPDX for generic target code 92/56292/2
Timothy Day [Sat, 7 Sep 2024 20:46:49 +0000 (16:46 -0400)]
LU-6142 target: SPDX for generic target code

Convert from verbose license text to SPDX.

Test-Parameters: trivial
Signed-off-by: Timothy Day <timday@amazon.com>
Change-Id: Ia42c117c639750be0bc770beeade80789b2ddc54
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56292
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: James Simmons <jsimmons@infradead.org>
Reviewed-by: Arshad Hussain <arshad.hussain@aeoncomputing.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
7 weeks agoLU-4315 doc: updating ll[d-v] man page style 90/56190/3
Frederick Dilger [Wed, 28 Aug 2024 20:45:11 +0000 (14:45 -0600)]
LU-4315 doc: updating ll[d-v] man page style

Updating files to match the new code style for Lustre manual pages as
enforced by 'contrib/scripts/checkpatch-man.pl'.

This also includes other changes like removing < > or { } for singular
required arguments and placing [ ] around optional ones as well as
making all arguments CAPITAL and italicized, literal arguments are
bolded. Lines over 80 characters should be split at the natural line
end rather than the word that goes over the limit as fewer lines will
need to be modified when making changes if each sentence is on it's
own line.

Only using features that appear in groff 1.22.3 as this is the
available version is CentOS 8.

Checked files:
- ll_decode_filter_fid.8
- ll_decode_linkea.8
- lljobstat.8
- llobdstat.8
- llog_reader.8
- llsom_sync.8
- llstat.8
- llverdev.8
- llverfs.8

Test-Parameters: trivial
Signed-off-by: Frederick Dilger <fdilger@whamcloud.com>
Change-Id: Id0b09d1e1826f95d6e5fe820f61ed5e32232ce90
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56190
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Arshad Hussain <arshad.hussain@aeoncomputing.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
7 weeks agoLU-4315 doc: updating llapi-[i-l] man page style 71/56171/6
Frederick Dilger [Tue, 27 Aug 2024 17:58:46 +0000 (11:58 -0600)]
LU-4315 doc: updating llapi-[i-l] man page style

Updating files to match the new code style for Lustre manual pages as
enforced by 'contrib/scripts/checkpatch-man.pl'.

This also includes other changes like removing < > or { } for singular
required arguments and placing [ ] around optional ones as well as
making all arguments CAPITAL and italicized, literal arguments are
bolded. Lines over 80 characters should be split at the natural line
end rather than the word that goes over the limit as fewer lines will
need to be modified when making changes if each sentence is on it's
own line.

Only using features that appear in groff 1.22.3 as this is the
available version is CentOS 8.

Checked files:
- llapi_ioctl.3
- llapi_ladvise.3
- llapi_layout.7
- llapi_layout_alloc.3
- llapi_layout_comp_add.3
- llapi_layout_comp_del.3
- llapi_layout_comp_extent_get.3
- llapi_layout_comp_extent_set.3
- llapi_layout_comp_flags_clear.3
- llapi_layout_comp_flags_get.3
- llapi_layout_comp_flags_set.3
- llapi_layout_comp_id_get.3
- llapi_layout_comp_use.3
- llapi_layout_comp_use_id.3
- llapi_layout_extension_size_get.3
- llapi_layout_extension_size_set.3
- llapi_layout_file_comp_add.3
- llapi_layout_file_comp_del.3
- llapi_layout_file_create.3
- llapi_layout_file_open.3

Test-Parameters: trivial
Signed-off-by: Frederick Dilger <fdilger@whamcloud.com>
Change-Id: I9e475fa6f170a52000b220e5e90f7e6c39bee737
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56171
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Arshad Hussain <arshad.hussain@aeoncomputing.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
7 weeks agoLU-6142 misc: SPDX for Lustre headers 24/55924/4
Timothy Day [Sat, 3 Aug 2024 18:15:02 +0000 (14:15 -0400)]
LU-6142 misc: SPDX for Lustre headers

Convert from verbose license text to SPDX.

Test-Parameters: trivial
Signed-off-by: Timothy Day <timday@amazon.com>
Change-Id: I3e35d1e8d53851852ba82e87ecd060d775d63f54
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/55924
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Arshad Hussain <arshad.hussain@aeoncomputing.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
8 weeks agoNew tag 2.15.91 2.15.91 v2_15_91
Oleg Drokin [Tue, 17 Sep 2024 14:41:37 +0000 (10:41 -0400)]
New tag 2.15.91

Change-Id: I7d9a6a5bf8292dc368cd9b5356b4254e65738d47
Signed-off-by: Oleg Drokin <green@whamcloud.com>
8 weeks agoLU-18212 ioctl: LL_IOC_FID2MDTIDX on server mount point 39/56339/2
Alex Zhuravlev [Sat, 24 Aug 2024 15:37:48 +0000 (23:37 +0800)]
LU-18212 ioctl: LL_IOC_FID2MDTIDX on server mount point

add LL_IOC_FID2MDTIDX ioctl support on server's mount point.

Signed-off-by: Alex Zhuravlev <bzzz@whamcloud.com>
Change-Id: If3c8c96e75573b812688686a331a38250826cd05
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56339
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
8 weeks agoLU-18169 osc: do not use deprecated NR_UNSTABLE_NFS 62/56162/5
Qian Yingjin [Mon, 26 Aug 2024 07:50:14 +0000 (15:50 +0800)]
LU-18169 osc: do not use deprecated NR_UNSTABLE_NFS

A performance test script hanged the whole system with the
configuration of NUMA and SLES15sp3 OS version.
However, when we disalbe the unstable_check for unstable pages,
the test can be passed:
lctl set_param llite.*.unstable_stats=0

Found the root reason finally: we are using NR_UNSTABLE_NFS
wrongly, it was deprecated on SLES15sp3 after backport from the
upstream linux kernel with some its own modifications:
NR_UNSTABLE_NFS, /* NFS unstable pages - DEPRECATED DO NOT USE */

This is a special bug in SLEL15 kernel.
Thus we check whether NR_UNSTABLE_NFS is defined but was
deprecated during autoconf checking.

Moreover, the cgroups (memcg) does not work for the newer kernel,
the reason is that NR_UNSTABLE_NFS was removed, and it is wrongly
using NR_ZONE_WRITE_PENDING for memory accounting.
According to the kernel patch:
"mm/writeback: discard NR_UNSTABLE_NFS, use NR_WRITEBACK instead"
kernel v5.8-rc1 commit: 8d92890bd6b8502d6aee4b37430ae6444ade7a8c
it should account unstable pages in NR_WRITEBACK and WB_WRITEBACK.
This patch fixes it accordingly.

This patch also removes the unnecessary NR_ZONE_WRITE_PENDING
check.

Fixes: d4094475c99(LU-16699 osc: Prefer NR_ZONE_WRITE_PENDING)
Signed-off-by: Qian Yingjin <qian@ddn.com>
Change-Id: I28e36aa152f1c683ad717a94c779069caab6c54c
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56162
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Shaun Tancheff <shaun.tancheff@hpe.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: James Simmons <jsimmons@infradead.org>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
8 weeks agoLU-18208 lnet: Server VM crashed: unable to handle 22/56322/3
Frank Sehr [Tue, 10 Sep 2024 23:30:21 +0000 (16:30 -0700)]
LU-18208 lnet: Server VM crashed: unable to handle

Revert "LU-18160 lnet: ensure lnetctl ping completes in a finite time"
t seems like the patch for LU-18160 introduced crashes. Maybe the
change from wait_for_complete from timeout to interupt. Reverting that
patch solved the problem.

This reverts commit 1666840bb06bbeeb35b2f9a51f9235c36886a3c6.

Test-Parameters: trivial testlist=sanity
Signed-off-by: Frank Sehr <fsehr@whamcloud.com>
Change-Id: Ie48185eb973eee65df2810d7acf940cf6981b83e
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56322
Reviewed-by: Timothy Day <timday@amazon.com>
Reviewed-by: Serguei Smirnov <ssmirnov@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Tested-by: jenkins <devops@whamcloud.com>
8 weeks agoLU-18199 scripts: fix ksocklnd-config gateway selection logic 21/56321/2
Serguei Smirnov [Tue, 10 Sep 2024 19:25:06 +0000 (12:25 -0700)]
LU-18199 scripts: fix ksocklnd-config gateway selection logic

To avoid disruption of network connectivity during LNet start-up,
fix gateway selection logic in ksocklnd-config script to:
        1) select only pingable gateways
        2) prefer to select the gateway featured in the default route
           with the "dev" matching the interface being configured

Fixes: 7f60b2b55 ("LU-17006 lnet: set up routes for going across subnets")
Test-Parameters: trivial testlist=sanity-lnet
Signed-off-by: Serguei Smirnov <ssmirnov@whamcloud.com>
Change-Id: I7fdfcf2d7e9acae18e9a2efb34a58c17d2887683
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56321
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Tested-by: Sebastien Buisson <sbuisson@ddn.com>
Tested-by: Cyril Bordage <cbordage@whamcloud.com>
Reviewed-by: Cyril Bordage <cbordage@whamcloud.com>
Reviewed-by: Frank Sehr <fsehr@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
8 weeks agoLU-18099 tests: disable userns checking in sanity-pcc/101a 12/56312/6
Feng Lei [Tue, 10 Sep 2024 02:00:22 +0000 (10:00 +0800)]
LU-18099 tests: disable userns checking in sanity-pcc/101a

ubuntu 24.04 has an additional security restriction on
unprivileged user namespace. Disable it in sanity-pcc/101a
temporarily on ubuntu2404 or later version.

Signed-off-by: Feng Lei <flei@whamcloud.com>
Test-Parameters: trivial
Test-Parameters: clientdistro=ubuntu2404 testlist=sanity-pcc env=ONLY=101a,ONLY_REPEAT=10
Change-Id: I9da81dc02e0784f4e40f1d4d276588fb354a481c
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56312
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Sebastien Buisson <sbuisson@ddn.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
8 weeks agoLU-18093 tests: error_ignore sanity.sh test_17e failure 72/56272/2
Emoly Liu [Fri, 6 Sep 2024 02:17:08 +0000 (10:17 +0800)]
LU-18093 tests: error_ignore sanity.sh test_17e failure

Ignore sanity.sh test_17e failure on ubuntu 24.04 client.

Test-Parameters: trivial testlist=sanity env=ONLY=17e \
clientdistro=ubuntu2404 serverdistro=el9.4

Signed-off-by: Emoly Liu <emoly@whamcloud.com>
Change-Id: I5dfb9915c92d23c44b431328e8dddd087b46146d
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56272
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Li Xi <lixi@ddn.com>
Reviewed-by: Lai Siyao <lai.siyao@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
8 weeks agoLU-17468 tests: skip sanity/27Ci for old servers 64/56264/3
Andreas Dilger [Thu, 5 Sep 2024 05:57:20 +0000 (23:57 -0600)]
LU-17468 tests: skip sanity/27Ci for old servers

Skip overstriping test for servers (2.12.x) that do not support
file layout overstriping.

Test-Parameters: trivial testlist=sanity env=ONLY="27Ci 27W" serverversion=2.12
Fixes: 90c093b785 ("LU-17468 lod: component add missed pattern info")
Fixes: 5b99b881c4 ("LU-17628 lfs: add lfs_setstripe admin restrict")
Signed-off-by: Andreas Dilger <adilger@whamcloud.com>
Change-Id: I67bdae4154558339df8c908cd0b5e0825d3ebbe5
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56264
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Patrick Farrell <patrick.farrell@oracle.com>
Reviewed-by: Zhenyu Xu <bobijam@hotmail.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
8 weeks agoLU-17567 tests: skip sanity/27T for older OSTs 61/56261/2
Andreas Dilger [Thu, 5 Sep 2024 02:35:30 +0000 (20:35 -0600)]
LU-17567 tests: skip sanity/27T for older OSTs

Skip sanity test_27T for OSTs < 2.14.57.71 since the test depends on
changes made to the OST code on the server.

Test-Parameters: trivial testlist=sanity env=ONLY=27T serverversion=2.14
Fixes: 73d5ee7033 ("LU-15459 llite: clear async errors on write commit sync")
Signed-off-by: Andreas Dilger <adilger@whamcloud.com>
Change-Id: I15dad430c52a351e21cf85511e2c55de07740087
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56261
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Vladimir Saveliev <vladimir.saveliev@hpe.com>
Reviewed-by: Andrew Perepechko <andrew.perepechko@hpe.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
8 weeks agoLU-18195 osd-ldiskfs: use IGET_NO_CHECKS for objects under lost+found 76/56276/3
Li Dongyang [Fri, 6 Sep 2024 11:20:24 +0000 (21:20 +1000)]
LU-18195 osd-ldiskfs: use IGET_NO_CHECKS for objects under lost+found

With e2fsck older than v1.47.1-rc2-10-g849a9e6e1,
e2fsck could link ea inode under lost+found, and this
creates problems on kernels enforce ea_inode flag checks.

Check for lost+found fid and use NO_CHECKS flag
in the lfsck code path.

Add s_lpf_ino in sbi, and set it by find the lost+found
entry during ext4_fill_super().
In ext4_lookup, check if the parent is the lost+found dir.
Note lost+found won't be changed when target mounted as
lustre, so we only lookup once during mount and don't
monitor lost+found entry. Even if it's changed duing e2fsck
or mounted as ldiskfs, we could still detect at next mount.

Test-Parameters: trivial
Test-Parameters: fstype=ldiskfs serverdistro=el9.4 testlist=conf-sanity env=ONLY=61b
Change-Id: I95492fcc2f0ec4188dfb92b1dc1aa6055cab6ca8
Signed-off-by: Li Dongyang <dongyangli@ddn.com>
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56276
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Alex Zhuravlev <bzzz@whamcloud.com>
Reviewed-by: Li Xi <lixi@ddn.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
8 weeks agoLU-18095 sec: fix ACL handling on recent kernels 54/56254/11
Sebastien Buisson [Wed, 4 Sep 2024 08:36:42 +0000 (10:36 +0200)]
LU-18095 sec: fix ACL handling on recent kernels

On recent distributions like Ubuntu 24.04, the kernel imposes that
ACLs are fetched via the dedicated .get_acl operation (or
.get_inode_acl) instead of doing this via the xattr handlers.
So in ll_get_acl() explicitly fetch the xattr containing ACLs,
XATTR_NAME_ACL_ACCESS or XATTR_NAME_ACL_DEFAULT. This is going to
populate to xattr cache, hence avoiding multiple requests to the MDS.

Also fix sanity-sec test_23b to make sure variable comparisons are
correct. And fix test cleanup to avoid leftovers.

Test-Parameters: testlist=sanity-sec env=ONLY=23b,ONLY_REPEAT=50 mdscount=2 mdtcount=4 osscount=1 ostcount=8 clientcount=2
Test-Parameters: testlist=sanity-sec env=ONLY=23b,ONLY_REPEAT=50 mdscount=2 mdtcount=4 osscount=1 ostcount=8 clientcount=2 clientdistro=ubuntu2404 serverdistro=el9.4
Test-Parameters: testlist=sanity env=ONLY=103a,ONLY_REPEAT=50 mdscount=2 mdtcount=4 osscount=1 ostcount=8 clientcount=2 clientdistro=ubuntu2404 serverdistro=el9.4
Signed-off-by: Sebastien Buisson <sbuisson@ddn.com>
Change-Id: I467d5a558eaa524e823527a8798478934f65abf9
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56254
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Jian Yu <yujian@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
8 weeks agoLU-4315 doc: updating llapi-[cr-f] man page style 38/56138/3
Frederick Dilger [Fri, 23 Aug 2024 16:24:33 +0000 (10:24 -0600)]
LU-4315 doc: updating llapi-[cr-f] man page style

Updating files to match the new code style for Lustre manual pages as
enforced by 'contrib/scripts/checkpatch-man.pl'.

This also includes other changes like removing < > or { } for singular
required arguments and placing [ ] around optional ones as well as
making all arguments CAPITAL and italicized, literal arguments are
bolded. Lines over 80 characters should be split at the natural line
end rather than the word that goes over the limit as fewer lines will
need to be modified when making changes if each sentence is on it's
own line.

Only using features that appear in groff 1.22.3 as this is the
available version is CentOS 8.

Checked files:
- llapi_create_volatile_param.3
- llapi_fd2parent.3
- llapi_fid_parse.3
- llapi_fid_to_handle.3
- llapi_file_create.3
- llapi_file_create_foreign.3
- llapi_file_get_stripe.3
- llapi_file_open.3

Test-Parameters: trivial
Signed-off-by: Frederick Dilger <fdilger@whamcloud.com>
Change-Id: I9b9728e9611da455cb4ff61f424f4b976c533b36
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56138
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Arshad Hussain <arshad.hussain@aeoncomputing.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
8 weeks agoLU-4315 doc: updating lfs-[h-l] man page style 90/56090/3
Maximilian Dilger [Wed, 7 Aug 2024 09:34:02 +0000 (03:34 -0600)]
LU-4315 doc: updating lfs-[h-l] man page style

Updating files to match the new code style for Lustre manual pages as
enforced by 'contrib/scripts/checkpatch-man.pl'.

This also includes other changes like removing < > or { } for singular
required arguments and placing [ ] around optional ones as well as
making all arguments CAPITAL and italicized, literal arguments are
bolded. Lines over 80 characters should be split at the natural line
end rather than the word that goes over the limit as fewer lines will
need to be modified when making changes if each sentence is on it's
own line.

Only using features that appear in groff 1.22.3 as this is the
available version is CentOS 8.

Checked files:
- lfs-heat_get.1
- lfs-heat_set.1
- lfs-hsm.1
- lfs-hsm_action.1
- lfs-hsm_clear.1
- lfs-hsm_set.1
- lfs-hsm_state.1
- lfs-ladvise.1

Test-Parameters: trivial
Signed-off-by: Frederick Dilger <fdilger@whamcloud.com>
Change-Id: I58223c11b886fcc8abcf9aab37d72e36a5d98af3
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56090
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Arshad Hussain <arshad.hussain@aeoncomputing.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
8 weeks agoLU-4315 doc: updating lfs-[c-g] man page style 72/56072/6
Frederick Dilger [Thu, 15 Aug 2024 07:03:19 +0000 (01:03 -0600)]
LU-4315 doc: updating lfs-[c-g] man page style

Updating files to match the new code style for Lustre manual pages as
enforced by 'contrib/scripts/checkpatch-man.pl'.

This also includes other changes like removing < > or { } for singular
required arguments and placing [ ] around optional ones as well as
making all arguments CAPITAL and italicized, literal arguments are
bolded. Lines over 80 characters should be split at the natural line
end rather than the word that goes over the limit as fewer lines will
need to be modified when making changes if each sentence is on it's
own line.

Only using features that appear in groff 1.22.3 as this is the
available version is CentOS 8.

Checked files:
- lfs-changelog.1
- lfs-changelog_clear.1
- lfs-df.1
- lfs-fid2path.1
- lfs-find.1
- lfs-flushctx.1
- lfs-getdirstripe.1
- lfs-getname.1
- lfs-getsom.1
- lfs-getstripe.1

Test-Parameters: trivial
Signed-off-by: Frederick Dilger <fdilger@whamcloud.com>
Change-Id: I238e137b3d85972229d256e09f620ec8112fd5ea
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56072
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Arshad Hussain <arshad.hussain@aeoncomputing.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
8 weeks agoLU-4315 doc: updating lctl-[node] man page style 59/56059/9
Frederick Dilger [Wed, 14 Aug 2024 17:48:46 +0000 (11:48 -0600)]
LU-4315 doc: updating lctl-[node] man page style

Updating files to match the new code style for Lustre manual pages as
enforced by 'contrib/scripts/checkpatch-man.pl'.

This also includes other changes like removing < > or { } for singular
required arguments and placing [ ] around optional ones as well as
making all arguments CAPITAL and italicized, literal arguments are
bolded. Lines over 80 characters should be split at the natural line
end rather than the word that goes over the limit as fewer lines will
need to be modified when making changes if each sentence is on it's
own line.

Only using features that appear in groff 1.22.3 as this is the
available version is CentOS 8.

Checked files:
- lctl-nodemap-activate.8
- lctl-nodemap-add.8
- lctl-nodemap-add-idmap.8
- lctl-nodemap-add-range.8
- lctl-nodemap-del.8
- lctl-nodemap-del-idmap.8
- lctl-nodemap-del-range.8
- lctl-nodemap-modify.8
- lctl-nodemap-set-fileset.8
- lctl-nodemap-set-sepol.8

Test-Parameters: trivial
Signed-off-by: Frederick Dilger <fdilger@whamcloud.com>
Change-Id: I48a17b30f6e32918f5e1a0e874faaa18c39f65e0
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56059
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Maximilian Dilger <mdilger@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
8 weeks agoLU-4315 doc: updating lctl-[li-ne] man page style 55/56055/5
Frederick Dilger [Wed, 14 Aug 2024 05:42:04 +0000 (23:42 -0600)]
LU-4315 doc: updating lctl-[li-ne] man page style

Updating files to match the new code style for Lustre manual pages as
enforced by 'contrib/scripts/checkpatch-man.pl'.

This also includes other changes like removing < > or { } for singular
required arguments and placing [ ] around optional ones as well as
making all arguments CAPITAL and italicized, literal arguments are
bolded. Lines over 80 characters should be split at the natural line
end rather than the word that goes over the limit as fewer lines will
need to be modified when making changes if each sentence is on it's
own line.

Only using features that appear in groff 1.22.3 as this is the
available version is CentOS 8.

Checked files:
- lctl-list_param.8
- lctl-llog_cancel.8
- lctl-llog_catlist.8
- lctl-llog_info.8
- lctl-llog_print.8
- lctl-network.8

Test-Parameters: trivial
Signed-off-by: Frederick Dilger <fdilger@whamcloud.com>
Change-Id: I97cd7e8665794ea80eac4c90c472826057cc71c7
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56055
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Arshad Hussain <arshad.hussain@aeoncomputing.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
8 weeks agoLU-4315 doc: updating lctl-[e-lf] man page style 28/56028/10
Frederick Dilger [Tue, 13 Aug 2024 23:19:59 +0000 (17:19 -0600)]
LU-4315 doc: updating lctl-[e-lf] man page style

Updating files to match the new code style for Lustre manual pages as
enforced by 'contrib/scripts/checkpatch-man.pl'.

This also includes other changes like removing < > or { } for singular
required arguments and placing [ ] around optional ones as well as
making all arguments CAPITAL and italicized, literal arguments are
bolded. Lines over 80 characters should be split at the natural line
end rather than the word that goes over the limit as fewer lines will
need to be modified when making changes if each sentence is on it's
own line.

Only using features that appear in groff 1.22.3 as this is the
available version is CentOS 8.

Checked files:
- lctl-erase_lcfg.8
- lctl-fork_lcfg.8
- lctl-get_param.8
- lctl-lcfg_clear.8
- lctl-lcfg_erase.8
- lctl-lcfg_fork.8
- lctl-lfsck-query.8
- lctl-lfsck-start.8
- lctl-lfsck-stop.8

Test-Parameters: trivial
Signed-off-by: Frederick Dilger <fdilger@whamcloud.com>
Change-Id: Ie4c66833f9aa577d1b6f57cbb079211450efdd4c
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56028
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Arshad Hussain <arshad.hussain@aeoncomputing.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
8 weeks agoLU-4315 doc: updating lctl-[b-d] man page style 27/56027/6
Frederick Dilger [Tue, 13 Aug 2024 09:30:38 +0000 (03:30 -0600)]
LU-4315 doc: updating lctl-[b-d] man page style

Updating files to match the new code style for Lustre manual pages as
enforced by 'contrib/scripts/checkpatch-man.pl'. This also includes
other changes like removing < > or { } for singular required
arguments and placing [ ] around optional ones as well as making all
arguments CAPITAL and italicized, literal arguments are bolded.

Only using features that appear in groff 1.22.3 as this is the
available version is CentOS 8.

Checked files:
- lctl-barrier.8
- lctl-changelog_deregister.8
- lctl-changelog_register.8
- lctl-clear_conf.8
- lctl-del_ost.8
- lctl-device_list.8
- lctl-dl.8

Test-Parameters: trivial
Signed-off-by: Frederick Dilger <fdilger@whamcloud.com>
Change-Id: Id214e36300b1b0806e7566cf71d3f97d778f8b79
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56027
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Arshad Hussain <arshad.hussain@aeoncomputing.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
8 weeks agoLU-4315 doc: add man pages to checkpatch 54/55954/54
Frederick Dilger [Wed, 7 Aug 2024 09:34:02 +0000 (03:34 -0600)]
LU-4315 doc: add man pages to checkpatch

Add formatting rules for Lustre Manual pages to checkpatch.
This will encourage not only a uniform formatting for new manual
pages, but it will help when updating older manual pages to ensure
they are following new formatting style.

Note that there is a new AVAILABILITY section that is required for
all pages. This section should include the release and commit
versions that the subject of the page was added.
The checkpatch-man.pl script will generate version based on
what appear in the NAME section of the manual which is also
a required section. This only occurs if the AVAILABILITY
section already exists but is missing either "release X.X.X"
or "commit X.X.X-*"

Using the following pages as guidelines for creating style
rules for man pages:
https://liw.fi/manpages/
https://man7.org/linux/man-pages/man7/man-pages.7.html

Test-Parameters: trivial
Signed-off-by: Frederick Dilger <fdilger@whamcloud.com>
Change-Id: I5bd7af0cdbeb49a9edc07d0cc2f8a0dccca39a9e
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/55954
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Timothy Day <timday@amazon.com>
Reviewed-by: Arshad Hussain <arshad.hussain@aeoncomputing.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 months agoLU-15253 mdt: object version race 20/45620/8
Andriy Skulysh [Mon, 31 Aug 2020 08:14:34 +0000 (11:14 +0300)]
LU-15253 mdt: object version race

MDT object version should be obtained under LDLM lock.

Change-Id: Ib828a4065d56e80cbe488a3fdfc6c5da808b20d0
HPE-bug-id: LUS-9280
Signed-off-by: Andriy Skulysh <andriy.skulysh@hpe.com>
Signed-off-by: Shaun Tancheff <shaun.tancheff@hpe.com>
Reviewed-by: Alexander Boyko <alexander.boyko@hpe.com>
Reviewed-by: Andrew Perepechko <andrew.perepechko@hpe.com>
Reviewed-by: Vitaly Fertman <vitaly.fertman@hpe.com>
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/45620
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
2 months agoLU-16741 osc: rename ptlrpc_req_finished for component osc 94/54694/4
Arshad Hussain [Mon, 8 Apr 2024 11:24:44 +0000 (07:24 -0400)]
LU-16741 osc: rename ptlrpc_req_finished for component osc

Patch renames ptlrpc_req_finished to ptlrpc_req_put for
osc component

Test-Parameters: trivial
Signed-off-by: Arshad Hussain <arshad.hussain@aeoncomputing.com>
Change-Id: Ica158cc1704ad13a10eef834555b868e7573dfb3
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/54694
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
Reviewed-by: Timothy Day <timday@amazon.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
2 months agoLU-18106 obdclass: Fix evict-by-nid for long NIDs 81/56281/2
Chris Horn [Fri, 6 Sep 2024 15:27:21 +0000 (09:27 -0600)]
LU-18106 obdclass: Fix evict-by-nid for long NIDs

LNet NIDs can be up to LNET_NIDSTR_SIZE (64) bytes (including the NUL
byte), but we are only copying UUID_MAX + 3 (43) bytes of the
userspace buffer. NIDs larger than this would get truncated, no
matching export found, and thus no eviction. Fix it by copying
LNET_NIDSTR_SIZE - 1 bytes.

Test-Parameters: trivial
Test-Parameters: testlist=conf-sanity env=ONLY=91,FORCE_LARGE_NID=true,LOAD_MODULES_REMOTE=true
Signed-off-by: Chris Horn <chris.horn@hpe.com>
Change-Id: I9ca15a89f0d41a6ca355a690a7f9fb25a80d8f82
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56281
Reviewed-by: James Simmons <jsimmons@infradead.org>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
2 months agoLU-18027 tests: skip sanity/56ei in interop testing 65/56265/3
Andreas Dilger [Thu, 5 Sep 2024 06:22:16 +0000 (00:22 -0600)]
LU-18027 tests: skip sanity/56ei in interop testing

sanity test_56ei is checking for projid on special files, but this
is not working with older servers.  Skip it in this case.

Test-Parameters: trivial testlist=sanity env=ONLY=56ei serverversion=2.14
Fixes: 501e5b2c8a ("LU-18027 lfs: lfs find handling special files")
Signed-off-by: Andreas Dilger <adilger@whamcloud.com>
Change-Id: Id20b2677d84e7f3504f410262ead6e8c7e83af62
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56265
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Shaun Tancheff <shaun.tancheff@hpe.com>
Reviewed-by: Caleb Carlson <caleb.carlson@hpe.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 months agoLU-18141 obdclass: don't check for NULL before free'ing 42/56042/3
Timothy Day [Wed, 14 Aug 2024 04:56:29 +0000 (00:56 -0400)]
LU-18141 obdclass: don't check for NULL before free'ing

The common free'ing macros already check for NULL,
so we don't need to explicitly check this beforehand.

The patch has been generated with the coccinelle script below.

@@
expression E;
@@
- if (E != NULL)
(
  OBD_FREE_PTR(E);
|
  OBD_FREE(E, ...);
|
  LIBCFS_FREE(E, ...);
|
  CFS_FREE_PTR(E);
|
  CFS_FREE_PTR_ARRAY(E, ...);
)

Test-Parameters: trivial
Signed-off-by: Timothy Day <timday@amazon.com>
Change-Id: Ie5d5567fecc1f421fb0894816c880b41ea93843b
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/56042
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Arshad Hussain <arshad.hussain@aeoncomputing.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: James Simmons <jsimmons@infradead.org>
Reviewed-by: Oleg Drokin <green@whamcloud.com>