Whamcloud - gitweb
LU-18594 tests: add vmstat and recovery status info 73/57573/2
authorElena Gryaznova <elena.gryaznova@hpe.com>
Mon, 23 Dec 2024 10:51:33 +0000 (13:51 +0300)
committerOleg Drokin <green@whamcloud.com>
Sun, 2 Feb 2025 06:28:34 +0000 (06:28 +0000)
commitba6398629c6aca2f5978b12283f554a3410092cd
treeb30163b9e9abd5af94148ff630f6f2b38b7ce33a
parent745486da697fc577cb6b927bcfbca9cf587666e4
LU-18594 tests: add vmstat and recovery status info

Patch adds:
   -- the ability to collect vmstat and recovery status
      info. Set VMSTAT_DELAY=value starts vmstat with delay=value,
      set RECOVERY_STATUS_DELAY=value runs:
          lctl get_param *.*.recovery_status
      every "value" seconds on victim server nodes and their pairs;

   -- the precmd and postcmd return code check and stops if the
      commands failed;

   -- minor cleanup:
      a little bit verbose ha_sleep()
      the turnable tmp directory.

Test-Parameters: trivial
Signed-off-by: Elena Gryaznova <elena.gryaznova@hpe.com>
HPE-bug-id: LUS-12232
Reviewed-by: Alexander Boyko <alexander.boyko@hpe.com>
Reviewed-by: Vladimir Saveliev <vladimir.saveliev@hpe.com>
Change-Id: I4087c73f58bf58b163f164e28b267a536569268a
Reviewed-on: https://review.whamcloud.com/c/fs/lustre-release/+/57573
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
Reviewed-by: Jian Yu <yujian@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
lustre/tests/ha.sh