git://git.whamcloud.com - fs/lustre-release.git/atom - lustre/ldlm history

LU-17705 ptlrpc: replace synchronize_rcu() with rcu_barrier()

2024-04-04T08:14:10Z

LU-17705 ptlrpc: replace synchronize_rcu() with rcu_barrier()

synchronize_rcu() does not wait for in-flight rcu callback completion,
thus kmem_cache_free() can still race with kmem_cache_destroy().

Fixes: a9411a9856a ("LU-17076 nrs: wait for RCU completion")
Signed-off-by: Alex Zhuravlev 
Change-Id: I2da668c06b532a41c8ce2fe681ea17cf6f3013ef
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/54669
Tested-by: jenkins 
Tested-by: Maloo 
Reviewed-by: Oleg Drokin 
Reviewed-by: Andreas Dilger 
Reviewed-by: Shaun Tancheff 
Reviewed-by: Neil Brown 
Reviewed-by: James Simmons

[D H] lustre/ldlm/ldlm_lockd.c

LU-17680 ldlm: fix ldlm_res_hop_hash() argument

2024-03-27T23:08:04Z

LU-17680 ldlm: fix ldlm_res_hop_hash() argument

Change ldlm_res_hop_hash() to use "bits" instead of "mask" as the
last argument.  Otherwise, this is hashing all of the LDLM resources
down to very few buckets (e.g. "hash & 8" or "hash & 11") instead of
the full bit range (e.g. "hash & ((1 << 8) - 1)".

This is causing significant slowdowns for file creation performance,
in particular sanity test_123ac was running 2x slower when creating
a large number of files because of the bad hashing.

Fixes: ce404bd07c ("LU-17174 misc: fix hash functions")
Signed-off-by: Andreas Dilger 
Change-Id: I823b370bdb7fac4e673d1b479204dd1216931fb6
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/54585
Tested-by: jenkins 
Tested-by: Maloo 
Reviewed-by: Alex Zhuravlev 
Reviewed-by: Oleg Drokin 
Reviewed-by: James Simmons 
Reviewed-by: Alexey Lyashkov

[D H] lustre/ldlm/ldlm_resource.c

LU-17392 build: compatibility updates for kernel 6.7

2024-02-29T20:55:58Z

LU-17392 build: compatibility updates for kernel 6.7

Linux commit v6.6-rc4-53-gc42d50aefd17
  mm: shrinker: add infrastructure for dynamically allocating
      shrinker

Users of struct shrinker must dynamically allocate shrinker objects
to avoid run-time warnings.

Provide a wrapper for older kernels to alloc+register shinkers
and unregister+free.

Use get_group_info() and put_group_info() wrappers instead of
open coding the reference counting on group_info.usage

Signed-off-by: Shaun Tancheff 
Change-Id: Ie07bdb7fe3eb6060bd84f95f860f1b53d120a605
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/53621
Tested-by: Maloo 
Tested-by: jenkins 
Reviewed-by: Oleg Drokin 
Reviewed-by: Andreas Dilger 
Reviewed-by: Jian Yu 
Reviewed-by: James Simmons

[D H] lustre/ldlm/ldlm_pool.c

LU-17589 flock: Flock blocking information becomes stale

2023-08-18T11:23:25Z

LU-17589 flock: Flock blocking information becomes stale

Blocking information remains to point for already cancelled lock.

Find new blocker on each reprocess.

Change-Id: I8d353795170f4fd0ae55dd646035cf8feb4cc162
HPE-bug-id: LUS-11784, LUS-11999
Signed-off-by: Andriy Skulysh 
Reviewed-by: Vitaly Fertman 
Reviewed-by: Alexander Zarochentsev 
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/54219
Tested-by: jenkins 
Tested-by: Maloo 
Reviewed-by: Yang Sheng 
Reviewed-by: Oleg Drokin

[D H] lustre/ldlm/ldlm_flock.c

LU-11085 tests: Add performance test for ldlm_extent code

2024-02-20T00:57:29Z

LU-11085 tests: Add performance test for ldlm_extent code

Add a new test module "ldlm_extent" which exercises the extent code by
creating multiple extent locks, and discarding them.
Each run is timed and a number of runs are combined to provide a
mean and standard deviation.

Two different tests are performed, with a ramp of locks to keep to
allow seeing any scalability issues:
1/ create lots of non-overlapping extents in
   random order, keeping up to 8000 at a time.
2/ create both random tiny extents and whole-file
   extents, alternating.  Keep up to 1,000,000.
   These are PR and so don't conflict.

Each test runs for at most 5 minutes
(30 loops of 10 seconds each = 300 seconds).

Test-Parameters: trivial env=SLOW=yes env=ONLY=842 testlist=sanity
Signed-off-by: Mr NeilBrown 
Change-Id: I552da3c64fb467cbefb7d25eee709dd038bd454f
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/54204
Reviewed-by: Andreas Dilger 
Reviewed-by: James Simmons 
Reviewed-by: Timothy Day 
Reviewed-by: Oleg Drokin 
Tested-by: jenkins 
Tested-by: Maloo

[D H] lustre/ldlm/ldlm_internal.h
[D H] lustre/ldlm/ldlm_lock.c
[D H] lustre/ldlm/ldlm_resource.c

LU-17022 obdclass: convert obd_conn_inprogress to atomic_t

2024-02-25T23:41:46Z

LU-17022 obdclass: convert obd_conn_inprogress to atomic_t

Using atomic_t for obd_conn_inprogress means we don't need to take a
spinlock.
Also send wakeup when value reaches zero, and wait for the wakeup
instead of using a yield() loop.

Change-Id: I9af29e068203cde951e592c408906d121702fa18
Signed-off-by: Mr NeilBrown 
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/51906
Tested-by: jenkins 
Tested-by: Maloo 
Reviewed-by: Andreas Dilger 
Reviewed-by: Timothy Day 
Reviewed-by: Arshad Hussain 
Reviewed-by: James Simmons 
Reviewed-by: Oleg Drokin

[D H] lustre/ldlm/ldlm_lib.c

LU-17484 gss: reply error for SEC_CTX_INIT on wrong node

2024-02-08T12:44:21Z

LU-17484 gss: reply error for SEC_CTX_INIT on wrong node

When a server receives a SEC_CTX_INIT request for a target that is not
available (either stopping, or not set up yet, or moved to a failover
node), the request gets dropped. This makes the client-side RPC time
out, increasing the time it takes to establish a proper gss context
with the target, because it slows down the HA mechanism that tries
alternate failover NIDs.
Instead of dropping the request reply for SEC_CTX_INIT, the server
needs to send back a proper error reply. The client will then be able
to immediately try alternate failover NIDs, speeding mount/reconnect
process up, and avoiding potential eviction.

Test-Parameters: trivial
Test-Parameters: kerberos=true testlist=sanity-krb5
Test-Parameters: testgroup=review-dne-selinux-ssk-part-2
Signed-off-by: Sebastien Buisson 
Change-Id: Id2cefaa7d54729a63c7be13b65d7ace579bcaa78
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/53970
Reviewed-by: Aurelien Degremont 
Reviewed-by: Andreas Dilger 
Reviewed-by: Oleg Drokin 
Tested-by: jenkins 
Tested-by: Maloo

[D H] lustre/ldlm/ldlm_lib.c

LU-17415 ldlm: lock conversion to skip cancelled locks

2024-01-11T05:28:40Z

LU-17415 ldlm: lock conversion to skip cancelled locks

ldlm_cli_inodebits_convert() should re-check the lock so it's
not being cancelled to skip such locks and avoid an assertion:

LustreError:
15208:0:(ldlm_lock.c:1095:ldlm_grant_lock_with_skiplist())
	ASSERTION( ldlm_is_granted(lock) ) failed:

Signed-off-by: Alex Zhuravlev 
Change-Id: If212931d8fa6a2d8f56c44714de830d5fb4a9a6b
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/53645
Tested-by: jenkins 
Tested-by: Maloo 
Reviewed-by: Andreas Dilger 
Reviewed-by: Mikhail Pershin 
Reviewed-by: Oleg Drokin

[D H] lustre/ldlm/ldlm_inodebits.c

LU-6142 ldlm: Fix style issues for ldlm folder

2024-02-12T06:07:38Z

LU-6142 ldlm: Fix style issues for ldlm folder

This patch fixes issues reported by checkpatch
for files under folder lustre/ldlm/

Test-Parameters: trivial
Signed-off-by: Arshad Hussain 
Change-Id: I3c15c6a6e3d21bce9c8609e60ec481b484f00480
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/54003
Tested-by: jenkins 
Tested-by: Maloo 
Reviewed-by: Andreas Dilger 
Reviewed-by: Timothy Day 
Reviewed-by: Oleg Drokin

[D H] lustre/ldlm/ldlm_flock.c
[D H] lustre/ldlm/ldlm_inodebits.c
[D H] lustre/ldlm/ldlm_internal.h
[D H] lustre/ldlm/ldlm_lib.c
[D H] lustre/ldlm/ldlm_lockd.c
[D H] lustre/ldlm/ldlm_pool.c
[D H] lustre/ldlm/ldlm_resource.c

LU-6142 ldlm: Fix style issues for ldlm_lock.c

2024-02-11T20:42:19Z

LU-6142 ldlm: Fix style issues for ldlm_lock.c

This patch fixes issues reported by checkpatch
for file lustre/ldlm/ldlm_lock.c

Test-Parameters: trivial
Signed-off-by: Arshad Hussain 
Change-Id: I492eacb0bf8033a78f1001a350c9fe4258729693
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/54002
Tested-by: jenkins 
Tested-by: Maloo 
Reviewed-by: Andreas Dilger 
Reviewed-by: Timothy Day 
Reviewed-by: Oleg Drokin

[D H] lustre/ldlm/ldlm_lock.c

LU-17276 ldlm: add interval in flock

2023-12-13T20:30:36Z

LU-17276 ldlm: add interval in flock

Add necessary changes for using interval tree in flock.

Signed-off-by: Yang Sheng 
Change-Id: I94c416b4215b863b54eccfe7025f2976fe40181a
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/53447
Tested-by: jenkins 
Tested-by: Maloo 
Reviewed-by: Andreas Dilger 
Reviewed-by: Alex Zhuravlev 
Reviewed-by: Oleg Drokin

[D H] lustre/ldlm/ldlm_extent.c
[D H] lustre/ldlm/ldlm_flock.c
[D H] lustre/ldlm/ldlm_internal.h
[D H] lustre/ldlm/ldlm_lock.c
[D H] lustre/ldlm/ldlm_resource.c

LU-16314 llite: Migrate LASSERTF %p to %px

2023-06-06T03:44:53Z

LU-16314 llite: Migrate LASSERTF %p to %px

This change covers lustre/ec through lustre/mgs and
converts LASSERTF statements to explicitly use %px.

Use %px to explicitly report the non-hashed pointer value
messages printed when a kernel panic is imminent. When
analyzing a crash dump the associated kernel address can
be used to determine the system state that lead to the
system crash.

As crash dumps can and are provided by customers from
production systems the use of the kernel command line
parameter:
    no_hash_pointers
is not always possible.

Ref: Documentation/core-api/printk-formats.rst

Test-Parameters: trivial
HPE-bug-id: LUS-10945
Signed-off-by: Shaun Tancheff 
Change-Id: I708d9ef60c63f5b4006c7986599a2f39fc9e5fdf
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/51213
Tested-by: jenkins 
Tested-by: Maloo 
Reviewed-by: Petros Koutoupis 
Reviewed-by: Andreas Dilger 
Reviewed-by: Oleg Drokin

[D H] lustre/ldlm/ldlm_lockd.c
[D H] lustre/ldlm/ldlm_request.c

LU-17242 debug: use dump_stack() where possible

2024-01-09T17:17:10Z

LU-17242 debug: use dump_stack() where possible

In some cases, libcfs_debug_dumpstack() can fail to output a
stack trace - either because the needed symbols are not exported
or those symbols can't be resolved at runtime. This seems to
occur more often with newer kernels. The messages appears only
as:

 Lustre: ldlm_cb01_002: service thread pid 57876 was inactive for
   40.494 seconds. The thread might be hung, or it might only be
   slow and will resume later. Dumping the stack trace for
   debugging purposes:
 Pid: 57876, comm: ldlm_cb01_002 6.1.70 #1 SMP PREEMPT_DYNAMIC
   Thu Jan  4 18:52:41 UTC 2024
 Call Trace TBD:

with no stack trace (seen on CentOS 8.5 with ml 6.1.70).

For reference, the runtime symbol lookup was added and updated in:

 b49ce7a ("LU-12400 libcfs: save_stack_trace_tsk if ARCH_STACKWALK")
 58ac9d3 ("LU-14099 build: Fix for unconfigured arch_stackwalk")

First, add a message when the symbol can't be resolved correctly.
This makes it much easier to understand why the stack trace is
missing.

Second, replace libcfs_debug_dumpstack(NULL) with dump_stack().
When the task_struct is NULL, libcfs uses the current
task_struct. This replicates the functionality of dump_stack().
Using dump_stack() is more reliable, more in line with kernel
style, and not likely to be un-exported in the future.

Finally, in lustre/osc/osc_object.c the stack isn't dumped since
there is already an LBUG().

There only remains one user of libcfs_debug_dumpstack() which
uses a task_struct other than current. This can be cleaned up
in a future patch.

Test-Parameters: trivial
Signed-off-by: Timothy Day 
Change-Id: I196c1da7e39b1a694c0cb67ecfaab58ab3e4662c
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/53625
Tested-by: jenkins 
Tested-by: Maloo 
Reviewed-by: Andreas Dilger 
Reviewed-by: James Simmons 
Reviewed-by: Patrick Farrell 
Reviewed-by: Oleg Drokin

[D H] lustre/ldlm/ldlm_lock.c
[D H] lustre/ldlm/ldlm_lockd.c

LU-16796 ldlm: Change struct ldlm_resource to use refcount_t

2023-12-12T07:16:19Z

LU-16796 ldlm: Change struct ldlm_resource to use refcount_t

This patch changes struct ldlm_resource and
struct nrs_tbf_client to use refcount_t instead of atomic_t

This patch also only changes spaces to tabs which were close
to lines of code being changed.

Signed-off-by: Arshad Hussain 
Change-Id: Ic15f27bc6281725f00bddc465668f81291aad6ec
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/53416
Tested-by: jenkins 
Tested-by: Maloo 
Reviewed-by: Andreas Dilger 
Reviewed-by: Timothy Day 
Reviewed-by: James Simmons 
Reviewed-by: Oleg Drokin

[D H] lustre/ldlm/ldlm_lock.c
[D H] lustre/ldlm/ldlm_resource.c

LU-17078 ldlm: do not spin up thread for local cancels

2023-08-31T00:07:03Z

LU-17078 ldlm: do not spin up thread for local cancels

When doing lockless IO on the client, the server is
responsible for taking LDLM locks for each IO.

Currently, the server sends these locks to a separate
thread for cancellation.  This behavior is necessary on the
client where a lock may protect a large number of cached
pages, so cancelling it in a user thread may introduce
unacceptable delays.  But the server doesn't have cached
pages, so it makes more sense for the server to do the
cancellation in the same thread.

We do this by not spinning up an ldlm_bl thread for
cancellations of local (server side only) locks.

This improves 4K DIO random read performance by about 9%.

Without patch, maximum server IOPs on 4K reads:
2864k IOPS

With patch:
3118k IOPS

This is the maximum performance achieved with many clients
and client threads doing 4K random AIO reads from different
files.

Signed-off-by: Patrick Farrell 
Change-Id: Ia996732780d278c5d0bc290c5484e3bc325a347a
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/52192
Tested-by: jenkins 
Tested-by: Maloo 
Reviewed-by: Alex Zhuravlev 
Reviewed-by: Oleg Drokin 
Reviewed-by: Andreas Dilger

[D H] lustre/ldlm/ldlm_lock.c

LU-17278 ldlm: don't grant failed lock

2023-11-09T13:29:03Z

LU-17278 ldlm: don't grant failed lock

lock convert can re-grant lock if it loses some bits. this
procedure can race with the import's invalidation. thus
lock can become invalid (l_granted_mode=LCK_MINMODE):
LustreError: 8637:0:(ldlm_lock.c:1095:ldlm_grant_lock_with_skiplist())
	ASSERTION( ldlm_is_granted(lock) )

Signed-off-by: Alex Zhuravlev 
Change-Id: I7bb20d62948224647d7632f2822fba44d39a7713
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/53051
Tested-by: jenkins 
Tested-by: Maloo 
Reviewed-by: Andreas Dilger 
Reviewed-by: Mikhail Pershin 
Reviewed-by: Oleg Drokin

[D H] lustre/ldlm/ldlm_inodebits.c

LU-17174 misc: fix hash functions

2023-10-10T08:38:21Z

LU-17174 misc: fix hash functions

1) LU-16518 landing caused a bug which visible with debug kernel

UBSAN: Undefined behaviour in include/linux/hash.h:81:31
shift exponent 64 is too large for 64-bit type
	'long long unsigned int'
Call Trace:
dump_stack+0x8e/0xd0
ubsan_epilogue+0x5/0x21
ldlm_export_lock_hash+0x49/0x4d [ptlrpc]
cfs_hash_bd_from_key+0x88/0x2e0 [libcfs]

2) use a high bits unstead of low as it more accurate.

HPe-bug-id: LUS-11925
Fixes: 239e8268 (LU-16518 misc: use fixed hash code)
Signed-off-by: Alexey Lyashkov 
Change-Id: Ie1c531ad220f44e55fbf80674a49472fb6024252
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/52611
Tested-by: jenkins 
Tested-by: Maloo 
Reviewed-by: James Simmons 
Reviewed-by: Oleg Drokin 
Reviewed-by: Timothy Day

[D H] lustre/ldlm/ldlm_flock.c
[D H] lustre/ldlm/ldlm_lockd.c

LU-13805 llite: Implement unaligned DIO connect flag

2023-10-24T18:29:27Z

LU-13805 llite: Implement unaligned DIO connect flag

Unupgraded ZFS servers may crash if they received unaligned
DIO, so we need a compat flag and a test to recognize those
servers.

This patch implements that logic.

Fixes: 7194eb6431 ("LU-13805 clio: bounce buffer for unaligned DIO")
Signed-off-by: Patrick Farrell 
Change-Id: I5d6ee3fa5dca989c671417f35a981767ee55d6e2
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/51126
Tested-by: jenkins 
Tested-by: Maloo 
Reviewed-by: Sebastien Buisson 
Reviewed-by: Oleg Drokin 
Reviewed-by: Andreas Dilger

[D H] lustre/ldlm/ldlm_lib.c

LU-17188 mdt: remove n from LDLM_DEBUG

2023-10-12T18:13:23Z

LU-17188 mdt: remove n from LDLM_DEBUG

LDLM_DEBUG() doesn't need n in an extra message

Test-Parameters: trivial
Signed-off-by: Alex Zhuravlev 
Change-Id: I5a62cccb0a17b3f878206e8bbec6c1fbe07c4753
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/52673
Reviewed-by: Patrick Farrell 
Reviewed-by: Lai Siyao 
Reviewed-by: Oleg Drokin 
Tested-by: jenkins 
Tested-by: Maloo

[D H] lustre/ldlm/ldlm_extent.c
[D H] lustre/ldlm/ldlm_lockd.c

LU-8802 obd: remove MAX_OBD_DEVICES

2023-05-09T05:07:36Z

LU-8802 obd: remove MAX_OBD_DEVICES

Remove this arbitrary limit by reimplementing the array as an
Xarray. Xarray can grow and shink dynamically, hence saving
memory and allow for many more OBD devices. There is still
technically a limit OBD_MAX_INDEX, which is xa_limit_31b.max
or around 2 billion. This is far more than is practically
useful.

This patch also adds various iterators for OBD devices, which
are used to simplify code in various places.

Removing class_obd_list() since it is unused. Rename
class_dev_by_str() to class_str2obd() to keep the pattern.
Several class_* functions have been refactored to improve
locking. The larger issue of OBD device locking will be
addressed separately.

Update the OBD device lifecycle test to try loading
more devices (about 24,000 for now).

Currently, adding an additional OBD device is an O(n^2)
operation due to the class_name2dev calls in
class_register_device(). This will be addressed in a future
patch adding a hash table for OBD device name lookups.

Further, OBD life cycle management could likely be simplified
by using Xarray marks. Right now, it is handled by a bit
field in the obd_device struct. Since the scope of the changes
needed to simplify this seem large, this will also be addressed
separately.

Test-Parameters: testlist=sanity env=ONLY=55,ONLY_REPEAT=10
Signed-off-by: Timothy Day 
Change-Id: Icb2cd94a5529e79f5d3ebd0de5e0f225cf212075
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/51040
Tested-by: jenkins 
Tested-by: Maloo 
Reviewed-by: James Simmons 
Reviewed-by: Neil Brown 
Reviewed-by: Andreas Dilger 
Reviewed-by: Oleg Drokin

[D H] lustre/ldlm/ldlm_lib.c