Whamcloud - gitweb
LU-17788 ptlrpc: restore watchdog revival message 42/54942/12
authorAndreas Dilger <adilger@whamcloud.com>
Sat, 27 Apr 2024 02:48:15 +0000 (20:48 -0600)
committerOleg Drokin <green@whamcloud.com>
Tue, 21 May 2024 18:46:24 +0000 (18:46 +0000)
commit20c09eff4d397e7158aa4408e0cb50b102cc61c0
tree9d2d323ddd386bb50cf0f63eb995330e216f0664
parent17c8de2b7378aa2da710b4842d71f22761ac6d3b
LU-17788 ptlrpc: restore watchdog revival message

Restore the "Service thread pid NNN completed after SSS.mmm
seconds.  This likely indicates the system was overloaded"
message that was lost during ptlrpc watchdog restructuring.

Do not rate limit this message, so that it is possible to see
when all threads are restored, even if their corresponding
"Service thread pid NNN was inactive" message was throttled.

Update recovery-small test_10a to check for these messages,
so that they are not removed again in the future.

Test-Parameters: testlist=recovery-small env=ONLY=10a
Test-Parameters: testlist=recovery-small env=ONLY=10a
Test-Parameters: testlist=recovery-small env=ONLY=10a
Test-Parameters: testlist=recovery-small env=ONLY=10a
Test-Parameters: testlist=recovery-small env=ONLY=10a
Fixes: fc9de679a4 ("LU-9859 libcfs: add watchdog for ptlrpc service threads.")
Signed-off-by: Andreas Dilger <adilger@whamcloud.com>
Change-Id: I0c7e96fb7f73ca5562a6f5ad780a79ffc83ebbe5
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/54942
Reviewed-by: Alex Zhuravlev <bzzz@whamcloud.com>
Reviewed-by: James Simmons <jsimmons@infradead.org>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Tested-by: jenkins <devops@whamcloud.com>
lustre/include/lustre_net.h
lustre/ptlrpc/service.c
lustre/tests/recovery-small.sh