Whamcloud - gitweb
LU-8457 pacemaker: Pacemaker script to monitor LNet 66/22266/5
authorGabriele Paciucci <gabriele.paciucci@intel.com>
Thu, 1 Sep 2016 15:54:55 +0000 (16:54 +0100)
committerOleg Drokin <oleg.drokin@intel.com>
Tue, 7 Feb 2017 06:12:30 +0000 (06:12 +0000)
commit9018f11cd5a1ab82353e79271163ef51db081e95
treeda1bb668017a3841bae62dbe285e8e601160109e
parent18c95c436a55a2c7c8b8f71c0935e8d92c70c42f
LU-8457 pacemaker: Pacemaker script to monitor LNet

A new script to be used in Pacemaker to monitor LNet compatible
with ZFS and LDISKFS based Lustre server installations.
This RA is able to monitor a single LNet device using the
Pacemaker's clone technology.

pcs resource create [Resource Name] ocf:lustre:healthLNET
dampen=[seconds 5s]
multiplier=[number 1000]
lctl=[true|false]
device=[device name ib0]
host_list=[list of NIDs, space separated]
--clone

where:
* dampen The time to wait (dampening) further changes occur
* multiplier The number by which to multiply the number of
connected ping nodes by
* attempts Number of ping attempts, per host, before
declaring it dead
* timeout How long, in seconds, to wait before declaring
a ping lost
* lctl Option to enable lctl ping instead of the normal ping.
The default is true
* device Device used for the LNET network. We assume the
same device accross the cluster

This script should be located in /usr/lib/ocf/resource.d/lustre/
of both the Lustre servers with permission 755.

Test-Parameters: trivial
Signed-off-by: Gabriele Paciucci <gabriele.paciucci@intel.com>
Change-Id: I6292ce36dde0083fa95cb1d047fe582bd7d53116
Reviewed-on: https://review.whamcloud.com/22266
Tested-by: Jenkins
Reviewed-by: Nathaniel Clark <nathaniel.l.clark@intel.com>
Reviewed-by: Christopher J. Morrone <morrone2@llnl.gov>
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Oleg Drokin <oleg.drokin@intel.com>
contrib/scripts/pacemaker/healthLNET [new file with mode: 0644]