Whamcloud - gitweb
LU-11518 ldlm: pool fixes 63/39563/6
authorVitaly Fertman <c17818@cray.com>
Fri, 31 Jul 2020 18:30:09 +0000 (21:30 +0300)
committerOleg Drokin <green@whamcloud.com>
Sat, 19 Sep 2020 14:13:12 +0000 (14:13 +0000)
commit1806d6e8291758a835a846cd3c593e6699e56b15
tree6131049e1eebde1a1e973e71c3615c08da40432f
parent6052cc88eb1232ac3b0193f0d47881887a2dcfdc
LU-11518 ldlm: pool fixes

At the time the client side recalc period was increased up to 10secs
the grant & cancel rates started showing the speed not in seconds but
in tens of seconds.

At the pool initialization time, the server side recalc job should not
be delayed on client's recalc period.

It may happen an NS time is significant and comparable (or even more)
than the recalc period of the next NS (all the following NS's) in the
list. If the time has been already spent on the next NS, it does not
mean we want to double the delay for the original NS and recalc after
next N secs.

Make lock volume factor more fine grained (default is 100 now vs the
original 1): it is likely to cancel locks on clients twice faster than
server requested is too fast.

Protect missed pl_server_lock_volume update by the pool lock.

Replace ktime_get_real_seconds with ktime_get_seconds for the recal
interval.

Signed-off-by: Vitaly Fertman <c17818@cray.com>
Change-Id: Icba73209682a1b1d0d20c087581fad4f73ee3389
HPE-bug-id: LUS-8678
Reviewed-by: Andriy Skulysh <c17819@cray.com>
Reviewed-by: Alexey Lyashkov <c17817@cray.com>
Tested-by: Alexander Lezhoev <c17454@cray.com>
Reviewed-on: https://review.whamcloud.com/39563
Tested-by: jenkins <devops@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Gu Zheng <gzheng@ddn.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
lustre/include/lustre_dlm.h
lustre/ldlm/ldlm_pool.c
lustre/ldlm/ldlm_request.c
lustre/tests/sanity.sh