Whamcloud - gitweb
fs/lustre-release.git
2 years agoLU-11071 ldiskfs: fix ldiskfs patch for ubuntu18 44/33144/2
Li Dongyang [Wed, 12 Sep 2018 05:50:58 +0000 (15:50 +1000)]
LU-11071 ldiskfs: fix ldiskfs patch for ubuntu18

the original ext4-corrupted-inode-block-bitmaps-handling-patches.patch
missed the changes to ext4_validate_inode_bitmap, add it back

Test-Parameters: trivial
Signed-off-by: Li Dongyang <dongyangli@ddn.com>
Change-Id: I42ff97be5c5680be04d69e0e818b622df6bd400d
Reviewed-on: https://review.whamcloud.com/33144
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Yang Sheng <ys@whamcloud.com>
Reviewed-by: James Simmons <uja.ornl@yahoo.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-10899 obdclass: remove unused ll_import_cachep 19/33119/2
Andreas Dilger [Thu, 6 Sep 2018 10:24:30 +0000 (18:24 +0800)]
LU-10899 obdclass: remove unused ll_import_cachep

The ll_import_cache is not used anywhere, and can be removed.

Test-Parameters: trivial
Signed-off-by: Andreas Dilger <adilger@whamcloud.com>
Change-Id: I3f43c81ffbf1385117f9ba1f8f9881388a3ebbe5
Reviewed-on: https://review.whamcloud.com/33119
Tested-by: Jenkins
Reviewed-by: John L. Hammond <jhammond@whamcloud.com>
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: James Simmons <uja.ornl@yahoo.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-11303 out: clean up osp_update_rpc_pack() macro 07/33107/3
Andreas Dilger [Tue, 4 Sep 2018 09:50:48 +0000 (17:50 +0800)]
LU-11303 out: clean up osp_update_rpc_pack() macro

The osp_update_rpc_pack() macro was using both an operation "name"
argument as well as an "opcode" argument, but totally ignoring the
"opcode" argument.  This allowed a caller in osp_attr_set() to pass
a non-existent opcode name to the function that was ignored.

The function also concatenated the passed name string to generate
a function name to be called (which supplied its own opcode value).
This name concatenation obscures the code functionality, since the
called functions are not actually referenced directly in the code.

Instead, pass the function name as the argument, and drop the opcode
completely.  This at least makes it possible to see where the function
is being called.  Change the macro to be upper-case, so that it is
more clear that it is a macro rather than a real function.  It would
be better to replace the macro with an actual function, but this is
not very practical due to the use of variable-length argument lists.

Test-Parameters: trivial
Signed-off-by: Andreas Dilger <adilger@whamcloud.com>
Change-Id: I7bf5c1d0f7a9abab1d8de8046cc408961f3ebbe5
Reviewed-on: https://review.whamcloud.com/33107
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: John L. Hammond <jhammond@whamcloud.com>
Reviewed-by: Hongchao Zhang <hongchao@whamcloud.com>
Reviewed-by: James Simmons <uja.ornl@yahoo.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-8066 llite: use more private data in dump_pgcache 09/33009/14
NeilBrown [Tue, 11 Sep 2018 13:41:52 +0000 (09:41 -0400)]
LU-8066 llite: use more private data in dump_pgcache

The dump_page_cache debugfs file allocates and frees an 'env' in each
call to vvp_pgcache_start,next,show.  This is likely to be fast, but
does introduce the need to check for errors.

It is reasonable to allocate a single 'env' when the file is opened,
and use that throughout.

So create 'seq_private' structure which stores the sbi, env, and
refcheck, and attach this to the seqfile.

Then use it throughout instead of allocating 'env' repeatedly.

Linux-commit: fc5f499af16c247c5bff68d2625a6d1ca48b3332

Test-Parameters: trivial envdefinitions="ONLY=63b" testlist=sanity
Test-Parameters: trivial envdefinitions="ONLY=118" testlist=sanity
Test-Parameters: trivial envdefinitions="ONLY=20" testlist=sanityn

Change-Id: Id7e7f5ec1729d1e5af041adcb247ca11299ed734
Signed-off-by: NeilBrown <neilb@suse.com>
Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
Signed-off-by: James Simmons <uja.ornl@yahoo.com>
Reviewed-on: https://review.whamcloud.com/33009
Reviewed-by: John L. Hammond <jhammond@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-11133 dkms: Add missing ldiskfs module in dkms config 00/32800/4
Sebastien Piechurski [Mon, 20 Aug 2018 09:06:07 +0000 (11:06 +0200)]
LU-11133 dkms: Add missing ldiskfs module in dkms config

The dkms.conf file generated by the script dkms.mkconf does not
include the ldiskfs kernel module. This results in not being able to
use ldiskfs when using the lustre-ldiskfs-dkms or lustre-all-dkms
packages.
This patch adds the module in the configuration for both of these
packages.

Test-Parameters: trivial
Signed-off-by: Sebastien Piechurski <sebastien.piechurski@atos.net>
Change-Id: Icc98fba208c78c3061fe591e3315af1eb9199f4f
Reviewed-on: https://review.whamcloud.com/32800
Reviewed-by: Nathaniel Clark <nclark@whamcloud.com>
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Minh Diep <mdiep@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-8066 llite: move /proc/fs/lustre/llite/statahead_{max, agl} to sysfs 04/32504/3
James Simmons [Fri, 10 Aug 2018 01:49:28 +0000 (21:49 -0400)]
LU-8066 llite: move /proc/fs/lustre/llite/statahead_{max, agl} to sysfs

Move statahead_max and statahead_agl files from /proc/fs/lustre/llite/*
to /sys/fs/lustre/llite/*/

This is a modified version of

Linux-commit: 4081a3201ade8b3be715ebb0b7879a2bfe5fcc98

due to the large amount of changes to the OpenSFS/Intel branch.

Change-Id: Ic16d48d0cf1baabb462952de9739544923eba8cd
Signed-off-by: Oleg Drokin <green@linuxhacker.ru>
Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
Signed-off-by: James Simmons <uja.ornl@yahoo.com>
Reviewed-on: https://review.whamcloud.com/32504
Tested-by: Jenkins
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Bobi Jam <bobijam@hotmail.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-8066 llite: move /proc/fs/lustre/llite/stats_track* to 03/32503/7
James Simmons [Mon, 10 Sep 2018 17:02:17 +0000 (13:02 -0400)]
LU-8066 llite: move /proc/fs/lustre/llite/stats_track* to
 sysfs

Move stats_track_pid, stats_track_ppid and stats_track_gid files
from /proc/fs/lustre/llite/* to /sys/fs/lustre/llite/*/

This is a modified version of

Linux-commit: 62cf4b1315e86a1f373f77acf32a23d76428ba27

due to the large amount of changes to the OpenSFS/Intel branch.

Test-Parameters: trivial

Change-Id: I29dc2dd269f66e40a7d016ba366353ae2ca7fb51
Signed-off-by: Oleg Drokin <green@linuxhacker.ru>
Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
Signed-off-by: James Simmons <uja.ornl@yahoo.com>
Reviewed-on: https://review.whamcloud.com/32503
Tested-by: Jenkins
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Ben Evans <bevans@cray.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-10124 lnet: Correctly add peer MR value while importing 38/31138/8
Sonia Sharma [Thu, 1 Feb 2018 23:40:03 +0000 (15:40 -0800)]
LU-10124 lnet: Correctly add peer MR value while importing

while adding peer using lnetctl import, the MR value of the
peer is not correctly imported.

Checks for MR value other than True/False in
handle_yaml_config_peer() -
1. No value provided - Use default as True
2. Value other than True/False - Error out

Change-Id: I02a21e35086f1c6f29081b464dd1a63aba692cbc
Test-Parameters: trivial
Signed-off-by: Sonia Sharma <sonia.sharma@intel.com>
Reviewed-on: https://review.whamcloud.com/31138
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Amir Shehata <ashehata@whamcloud.com>
Reviewed-by: James Simmons <uja.ornl@yahoo.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-4684 migrate: shrink dir layout after migration 26/31626/22
Lai Siyao [Thu, 8 Mar 2018 03:24:31 +0000 (11:24 +0800)]
LU-4684 migrate: shrink dir layout after migration

Use setxattr(XATTR_NAME_LMV) to shrink dir layout after migration,
this may change dir fid because it may shrink dir to 1-stripe, if so
replace this dir with its stripe.

Signed-off-by: Lai Siyao <lai.siyao@whamcloud.com>
Change-Id: I4fd3e83e188ac0c6c4845183c17701276fae94a8
Reviewed-on: https://review.whamcloud.com/31626
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Fan Yong <fan.yong@intel.com>
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-4684 migrate: migrate striped directory 27/31427/26
Lai Siyao [Mon, 22 Jan 2018 18:34:54 +0000 (02:34 +0800)]
LU-4684 migrate: migrate striped directory

Migrate striped directory in below steps:
1. create target object if needed: if source is directory, a
   target object is always created, otherwise if source is
   already located on the target MDT, or source still has
   link on source MDT, then skip creating.
a) if source is directory, detach source stripes and
   attach them to target.
b) migrate source xattrs to target.
c) if source is regular file, update PFID to target
   fid.
d) update fid to target for all links of source
2. update namespace
a) migrate dirent from source parent to target parent.
b) update linkea parent fid to target parent.
c) destroy source object.

This implementation improves following fields:
1. all involved objects are locked to avoid race.
2. directory migration doesn't migrate its dir entries, instead
   it's done in each sub file migration, this avoids timeout in
   migrating dir entries for large directory, and also avoids
   touching dir entries without lock.
3. file/dir is migrated in one transaction, so migrate recovery
   is the same as others.
4. migrating directory can be accessed (modifiable) like normal
   directory.
5. if migration of sub files under a directory fails, user can
   redo migrate to finish migration of this directory.

Disable migrate in racer.sh.

Signed-off-by: Lai Siyao <lai.siyao@whamcloud.com>
Change-Id: I7906e50a0bf55375eafdf2cf5500979dd2d94d48
Reviewed-on: https://review.whamcloud.com/31427
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Fan Yong <fan.yong@intel.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-11016 obdclass: change JobID error into warning 28/33128/2
Andreas Dilger [Fri, 7 Sep 2018 01:08:37 +0000 (09:08 +0800)]
LU-11016 obdclass: change JobID error into warning

Change the "jobid value too large" message from a console error into a
warning, since it doesn't really indicate incorrect Lustre behaviour.

Test-Parameters: trivial
Signed-off-by: Andreas Dilger <adilger@whamcloud.com>
Change-Id: Ibde3b686061fc7b4daa48dae5623e60ad157e5c8
Reviewed-on: https://review.whamcloud.com/33128
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Ben Evans <bevans@cray.com>
Reviewed-by: Jian Yu <yujian@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-11349 tests: remove insanity test 15 21/33121/3
James Nunez [Thu, 6 Sep 2018 20:34:36 +0000 (14:34 -0600)]
LU-11349 tests: remove insanity test 15

Test 15 of the insanity test suite has not been used
in testing since 2003 and tries to run a script that no
longer exists. Thus, this test needs to be removed.

At the same time, clean up the script in converting
spaces to tabs.

Test-Parameters: trivial testlist=insanity
Signed-off-by: James Nunez <jnunez@whamcloud.com>
Change-Id: Ie3b2f88dde925d7a1c53f288215b3643d20fd5ff
Reviewed-on: https://review.whamcloud.com/33121
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Emoly Liu <emoly@whamcloud.com>
Reviewed-by: Wei Liu <sarah@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-11281 ptlrpc: race in AT early reply 71/33071/4
Hongchao Zhang [Thu, 26 Jul 2018 14:15:14 +0000 (10:15 -0400)]
LU-11281 ptlrpc: race in AT early reply

In ptlrpc_at_check_timed, the refcount of the request could
be already dropped to zero, the ptlrpc_server_drop_request
could continue without the "scp_at_lock" and free the request
by writing 0x5a5a5a5a5a5a5a5a to the memory, but the following
"atomic_inc_not_zero(&rq->rq_refcount)" will return nonzero and
cause freed request to be used in ptlrpc_at_send_early_reply.

Change-Id: I5d884be86de007f49b044e022ad90663b08078d7
Signed-off-by: Hongchao Zhang <hongchao@whamcloud.com>
Reviewed-on: https://review.whamcloud.com/33071
Tested-by: Jenkins
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Lai Siyao <lai.siyao@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-11010 tests: remove calls to return after skip() 32/32732/4
James Nunez [Tue, 26 Jun 2018 22:30:07 +0000 (16:30 -0600)]
LU-11010 tests: remove calls to return after skip()

The skip routine now contains a call to exit. All calls
to skip() and skip_env() should be reviewed and calls to
return() that follow skip() should be removed.

This is the third patch in a series that removes calls
to return() after skip() in the Lustre test suites.

Calls to return after skip() are removed for:
large-lun.sh
large-scale.sh
lfsck-performance.sh
lnet-selftest.sh
lustre-rsync-test.sh
mds-survey.sh
metadata-updates.sh

Test-Parameters: trivial testlist=large-lun,large-scale,lfsck-performance,lnet-selftest,lustre-rsync-test,mds-survey,metadata-updates

Signed-off-by: James Nunez <jnunez@whamcloud.com>
Change-Id: Id11dc4bbde5824976a5a4d8f94ea59efd45e46f4
Reviewed-on: https://review.whamcloud.com/32732
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Emoly Liu <emoly@whamcloud.com>
Reviewed-by: Jian Yu <yujian@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-11165 llite: set iflags and catch error from md_setattr() properly 60/32860/5
Wang Shilong [Mon, 23 Jul 2018 23:45:59 +0000 (07:45 +0800)]
LU-11165 llite: set iflags and catch error from md_setattr() properly

As John Hammond pointed out:
In ll_ioctl_fssetxattr() we may discard errors from md_setattr().
And we only set the local i_flags when lli_clob is not NULL.

This patch tried to address above problems.

Test-Parameters: trivial testlist=sanity-quota
Change-Id: Ib65ec61932b00f71367a8da6be67e3f94ebd5005
Signed-off-by: Wang Shilong <wshilong@ddn.com>
Reviewed-on: https://review.whamcloud.com/32860
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: John L. Hammond <jhammond@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-9268 tests: check obj distribution for PFL striping 63/26863/7
James Nunez [Wed, 27 Jun 2018 17:00:22 +0000 (11:00 -0600)]
LU-9268 tests: check obj distribution for PFL striping

Add test to sanity-pfl to verfy that new components are
distributed evenly among OSTs for composite files.

Test-Parameters: trivial osscount=3 ostcount=4 testlist=sanity-pfl
Signed-off-by: Alex Zhuravlev <alexey.zhuravlev@intel.com>
Signed-off-by: James Nunez <james.a.nunez@intel.com>
Change-Id: I0335fd376f6008529885236faad19a29eb01d5ba
Reviewed-on: https://review.whamcloud.com/26863
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Bobi Jam <bobijam@hotmail.com>
Reviewed-by: Patrick Farrell <paf@cray.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-10660 mdt: revoke lease lock for truncate 93/33093/6
Jian Yu [Mon, 10 Sep 2018 05:10:30 +0000 (22:10 -0700)]
LU-10660 mdt: revoke lease lock for truncate

Lustre lease lock is usually used to protect file data
against concurrent access. Open lock used on MDT side
is for this purpose. However, truncate will change
file data but it doesn't revoke lease lock.

This patch fixes the issue by acquiring open sem,
checking lease count and revoking lease if there exists
any pending lease on the file.

Change-Id: Ia55457c6d7e1e76f98b41feef39577822304513b
Signed-off-by: Jian Yu <yujian@whamcloud.com>
Reviewed-on: https://review.whamcloud.com/33093
Tested-by: Jenkins
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Jinshan Xiong <jinshan.xiong@gmail.com>
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoNew tag 2.11.55 2.11.55 v2_11_55 v2_11_55_0
Oleg Drokin [Thu, 13 Sep 2018 04:52:03 +0000 (00:52 -0400)]
New tag 2.11.55

Change-Id: I08a290fd7cd909cf24405daf412fe9a0dd4a8757
Signed-off-by: Oleg Drokin <green@whamcloud.com>
2 years agoRevert "LU-11057 tests: exercise 'network' client mount option" 39/33139/2
Oleg Drokin [Tue, 11 Sep 2018 16:52:23 +0000 (16:52 +0000)]
Revert "LU-11057 tests: exercise 'network' client mount option"

This is causing 100% failures in review-dne-zfs-part-2
which are tracked in LU-11363

This reverts commit 76af9f62481b1ac910cd6a0f7237fe32b0657c29.

Change-Id: Ie82f848ce1f170b2c93f106c5886deadf7b4df75
Reviewed-on: https://review.whamcloud.com/33139
Reviewed-by: Oleg Drokin <green@whamcloud.com>
Tested-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-10447 build: fix checkpatch.pl to not mangle case 65/32865/2
Andreas Dilger [Tue, 24 Jul 2018 06:06:53 +0000 (00:06 -0600)]
LU-10447 build: fix checkpatch.pl to not mangle case

Fix the checkpatch.pl script to not mangle the case of the lines
in the spelling.txt file, so that the replacement text can be
found, and is shown as intended.

Add checks for "GETSTRIPE" and "SETSTRIPE" to spelling.txt.

Sort the spelling.txt file so that it is easier to see what lines
are present in the file.

Test-Parameters: trivial
Signed-off-by: Andreas Dilger <adilger@whamcloud.com>
Change-Id: Id3a3319bc62898fbdc94573512cd6a665b13cd11
Reviewed-on: https://review.whamcloud.com/32865
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: James Simmons <uja.ornl@yahoo.com>
Reviewed-by: Jian Yu <yujian@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-8066 llite: move /proc/fs/lustre/llite/checksum_pages to sysfs 02/32502/8
James Simmons [Thu, 16 Aug 2018 23:48:25 +0000 (19:48 -0400)]
LU-8066 llite: move /proc/fs/lustre/llite/checksum_pages to sysfs

Move checksum_pages file from /proc/fs/lustre/llite/*
to /sys/fs/lustre/llite/*/

This is a modified version of

Linux-commit: 40cc864a9fcd48d7f3a67421a8e578e8d4cbe308

due to the large amount of changes to the OpenSFS/Intel branch.

Change-Id: Ice48c69503680ac995b60844a8d0d7ae2bddd102
Signed-off-by: Oleg Drokin <green@linuxhacker.ru>
Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
Signed-off-by: James Simmons <uja.ornl@yahoo.com>
Reviewed-on: https://review.whamcloud.com/32502
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Sonia Sharma <sharmaso@whamcloud.com>
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
2 years agoLU-11012 doc: improve the lfs-setstripe.1 man page 63/32463/5
Andreas Dilger [Tue, 24 Jul 2018 05:25:30 +0000 (23:25 -0600)]
LU-11012 doc: improve the lfs-setstripe.1 man page

Restructure the lfs-setstripe.1 man page to be more like a standard
man page.  The DESCRIPTION section provides a high-level overview of
the command, and describes the various different ways that it can be
used.  The various options are described in the STRIPE_OPTIONS and
COMPONENT_OPTIONS sections.

Describe how default layouts are inherited for plain and composite
files, and that different pools can be specified for each component.

Add description of the --copy option.

Test-Parameters: trivial
Signed-off-by: Andreas Dilger <adilger@whamcloud.com>
Change-Id: I7ba3f3f4dba6825f283dd6874ae1988914c774fb
Reviewed-on: https://review.whamcloud.com/32463
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Joseph Gmitter <jgmitter@whamcloud.com>
Reviewed-by: Jian Yu <yujian@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-11309 lnd: Clean up logging 96/33096/4
Amir Shehata [Thu, 30 Aug 2018 20:30:46 +0000 (13:30 -0700)]
LU-11309 lnd: Clean up logging

No need to output error in ksocknal_tx_done() as this error is
tracked in lnet.
No need to keep a cookie in the connection. It's always set to
the message. This will allow us to set the msg's health status
properly before calling lnet_finalize()

Test-Parameters: trivial
Signed-off-by: Amir Shehata <ashehata@whamcloud.com>
Change-Id: I168c8ca32192ba4f8f4480e8e7384d45f0a062a7
Reviewed-on: https://review.whamcloud.com/33096
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Doug Oucharek <dougso@me.com>
Reviewed-by: Sonia Sharma <sharmaso@whamcloud.com>
Reviewed-by: James Simmons <uja.ornl@yahoo.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-11283 lnet: fix setting health value manually 74/33074/3
Amir Shehata [Fri, 24 Aug 2018 19:08:31 +0000 (12:08 -0700)]
LU-11283 lnet: fix setting health value manually

Changed the health value option letter to 't' instead
of 'h' to avoid conflict with help option.
Don't increment optind unnecessarily.
In the handling code in api-ni.c there was a missing return.

Test-Parameters: trivial
Signed-off-by: Amir Shehata <ashehata@whamcloud.com>
Change-Id: Ib56f19d1781e44297158c64c5b26004220a1db21
Reviewed-on: https://review.whamcloud.com/33074
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Sonia Sharma <sharmaso@whamcloud.com>
Reviewed-by: Doug Oucharek <dougso@me.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-11245 flr: lfs mirror dump command 97/32997/5
Bobi Jam [Tue, 14 Aug 2018 04:53:15 +0000 (12:53 +0800)]
LU-11245 flr: lfs mirror dump command

Adds "lfs mirror dump" command to dump a mirror's content of a
mirrored file.

Usage:

lfs mirror dump <-N|--mirror-id <mirror_id>>
[-o|--outfile <output_file> ] <mirrored_file>

Options:

--mirror-id <mirror_id>
  Specify the mirror by mirror_id to dump its content.
--outfile <output_file>
  Dump the content into file output_file, if not specified, the
  content will be dump to the standard output stream.

Test-Parameters: testlist=sanity-flr
Signed-off-by: Bobi Jam <bobijam@whamcloud.com>
Change-Id: I2846b1c1968d9cc302c07b57a05687af50530d53
Reviewed-on: https://review.whamcloud.com/32997
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Jinshan Xiong <jinshan.xiong@gmail.com>
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-11057 tests: exercise 'network' client mount option 90/32590/10
Sebastien Buisson [Thu, 31 May 2018 09:22:11 +0000 (11:22 +0200)]
LU-11057 tests: exercise 'network' client mount option

Test 'network' client mount option that enables to limit connections
from the client to be on the network NID specified.

This option is currently not compatible with Dynamic Peer Discovery.
test_31 will have to be updated once Dynamic Peer Discovery is
modified so that it does not conflict with 'network' client mount
option.

Test-Parameters: trivial testlist=sanity-sec
Signed-off-by: Sebastien Buisson <sbuisson@ddn.com>
Change-Id: I4631a034137a346d0bb4efed20983e2a8bd7b0ec
Reviewed-on: https://review.whamcloud.com/32590
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Amir Shehata <ashehata@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-10961 ldlm: don't cancel DoM locks before replay 91/32791/6
Mikhail Pershin [Thu, 5 Jul 2018 10:18:06 +0000 (13:18 +0300)]
LU-10961 ldlm: don't cancel DoM locks before replay

Weigh a DOM locks before lock replay like that is done
for OSC EXTENT locks and don't cancel locks with data.

Add DoM replay tests for file creation and write cases.

Signed-off-by: Mikhail Pershin <mpershin@whamcloud.com>
Change-Id: If706835455c2b25c01cfcde033fccc4f3c67ee24
Reviewed-on: https://review.whamcloud.com/32791
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Patrick Farrell <paf@cray.com>
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-11270 mdt: better DOM data discard 26/33026/6
Mikhail Pershin [Sat, 18 Aug 2018 18:50:53 +0000 (21:50 +0300)]
LU-11270 mdt: better DOM data discard

Discard DOM data when all object are put and unlocked.
So current thread will wait for discard but resources are
not blocked by it. Also cancel discard lock immediately.

Patch also add missing part for lock prolongation for DoM
locks and fix for tgt_blocking_ast() to don't miss a call to
ldlm_server_blocking_ast() in case of lu_env_init() failure.

Signed-off-by: Mikhail Pershin <mpershin@whamcloud.com>
Change-Id: I09aaa08e28d2a280ca1f9939315311c45d0334b5
Reviewed-on: https://review.whamcloud.com/33026
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-9855 include: replace OBD_CHECK_DEV by obd_check_dev 01/33101/2
Aya Mahfouz [Sat, 1 Sep 2018 19:03:55 +0000 (15:03 -0400)]
LU-9855 include: replace OBD_CHECK_DEV by obd_check_dev

Static inline functions are preferred over macros. Hence, the function
obd_check_dev was introduced. obd_check_dev replaces the macro
OBD_CHECK_DEV. All functions that call obd_check_dev store the return
values and return them if they represent an error code.

Some of the changes were carried out manually while others were done
using coccinelle.

Linux-commit: 4ca1b8fbe609b7c52aeaa78a5f4678efcc9b8d73

Test-Parameters: trivial

Signed-off-by: Aya Mahfouz <mahfouz.saif.elyazal@gmail.com>
Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
Signed-off-by: James Simmons <uja.ornl@yahoo.com>
Change-Id: Iad96cc151a711892d950bd2ccaa93b13d6b23ae5
Reviewed-on: https://review.whamcloud.com/33101
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Ben Evans <bevans@cray.com>
2 years agoLU-11328 ldlm: fix compile problem with --disable-lru-size 03/33103/2
Li Dongyang [Mon, 3 Sep 2018 07:36:34 +0000 (17:36 +1000)]
LU-11328 ldlm: fix compile problem with --disable-lru-size

add the missing type change for ldlm_pool_recalc.

Test-Parameters: trivial
Signed-off-by: Li Dongyang <dongyangli@ddn.com>
Change-Id: I5b96dbe79e9b202888cf0f23a947c2d6c0f0d8d8
Reviewed-on: https://review.whamcloud.com/33103
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: James Simmons <uja.ornl@yahoo.com>
2 years agoLU-10740 tests: disable tests for replay-dne-zfs-part-4 06/33106/4
Andreas Dilger [Mon, 3 Sep 2018 23:08:08 +0000 (07:08 +0800)]
LU-10740 tests: disable tests for replay-dne-zfs-part-4

Disable a few tests, by adding them to the ALWAYS_EXCEPT list,
that are constantly failing for replay-single so that
review-dne-zfs-part-4 might normally pass.

A potential fix for replay-ost-single test_6 and test_7 to calculate
the log size for DNE+ZFS systems more accurately for multiple MDTs.

Test-Parameters: trivial testgroup=review-dne-zfs-part-4
Test-Parameters: trivial testgroup=review-dne-zfs-part-3
Test-Parameters: testlist=sanity-sec
Signed-off-by: Andreas Dilger <adilger@whamcloud.com>
Signed-off-by: James Nunez <jnunez@whamcloud.com>
Change-Id: I2316baeacbb18686d2958265f7128e95033ebbe5
Reviewed-on: https://review.whamcloud.com/33106
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: John L. Hammond <jhammond@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-11304 misc: update all url links to whamcloud 94/33094/2
Wang Shilong [Thu, 30 Aug 2018 11:46:36 +0000 (19:46 +0800)]
LU-11304 misc: update all url links to whamcloud

Even old links could redirect to whamcloud automatically,
We'd better update to use new whamcloud links to
avoid any further confusions.

Test-parameters: trivial
Change-Id: Ida7161a062d822141bf0c1fdf20b2098a21ea9e7
Signed-off-by: Wang Shilong <wshilong@ddn.com>
Reviewed-on: https://review.whamcloud.com/33094
Reviewed-by: Peter Jones <pjones@whamcloud.com>
Tested-by: Jenkins
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Tested-by: Maloo <hpdd-maloo@intel.com>
2 years agoLU-11279 lod: reset ostlist properly in lod_get_default_lov_striping 69/33069/7
Wang Shilong [Fri, 24 Aug 2018 06:49:06 +0000 (14:49 +0800)]
LU-11279 lod: reset ostlist properly in lod_get_default_lov_striping

Ostlist might be allocated previously, and we should
reset them properly, otherwise, it will pollute new
default setting and cause unexpected behavior.

Test-Parameters: trivial testlist=sanity,sanity,sanity,sanity,sanity,sanity,sanity,sanity,sanity
Change-Id: I9b7acb5f05ec4b371da99f68b9647f0b75cd7021
Signed-off-by: Wang Shilong <wshilong@ddn.com>
Reviewed-on: https://review.whamcloud.com/33069
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Bobi Jam <bobijam@hotmail.com>
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-11006 lnet: fix show peer yaml tree with no peer 20/32320/2
Sonia Sharma [Tue, 8 May 2018 04:08:03 +0000 (21:08 -0700)]
LU-11006 lnet: fix show peer yaml tree with no peer

When no peer exists then the root created for the peer
yaml tree should be deleted. And lnetctl show peer
should not display anything.

Currently lnetctl peer show shows the root string "peer"
even when there is no peer. This create issues when
starting lnet using /etc/lnet.conf derived from the
existing configuration.

Change-Id: Ie310a49e60386b579b48898b032467b1bc112da9
Test-Parameters: trivial
Signed-off-by: Sonia Sharma <sonia.sharma@intel.com>
Reviewed-on: https://review.whamcloud.com/32320
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Amir Shehata <ashehata@whamcloud.com>
Reviewed-by: Olaf Weber <olaf.weber@hpe.com>
Reviewed-by: James Simmons <uja.ornl@yahoo.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-8130 libcfs: prepare rhashtable support 02/32102/27
James Simmons [Sun, 19 Aug 2018 13:25:52 +0000 (09:25 -0400)]
LU-8130 libcfs: prepare rhashtable support

Linux has a resizeable hashtable implementation in lib,
so we should use that instead of having one in libcfs.
In the process we gain lockless lookup which should be
a performance boost. All modern distributions Lustre
support has rhashtable support but a few pieces are
missing for systems running a 4.4 kernel. The other
target platforms have the full implementation we need.

Test-Parameters: trivial

Change-Id: I63d5b7dae9d52eed12dbefed8ca6062af33efd30
Signed-off-by: James Simmons <uja.ornl@yahoo.com>
Reviewed-on: https://review.whamcloud.com/32102
Reviewed-by: John L. Hammond <jhammond@whamcloud.com>
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
2 years agoLU-10654 lnet: lnetctl doesn't error out on mistyped options 65/31265/3
Sonia Sharma [Sun, 11 Feb 2018 18:02:18 +0000 (10:02 -0800)]
LU-10654 lnet: lnetctl doesn't error out on mistyped options

Running lnetctl command to add/delete peer/net/route should
error out with mistyped options.

This patch add the changes in lnetctl.c to make lnetctl
error out withmistyped options.

Change-Id: Ib8ae54bea919d6ff235b4ca3a23807a809f8962c
Test-Parameters: trivial
Signed-off-by: Sonia Sharma <sonia.sharma@intel.com>
Reviewed-on: https://review.whamcloud.com/31265
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Dmitry Eremin <dmitry.eremin@intel.com>
Reviewed-by: Amir Shehata <ashehata@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-11273 lnet: update logging 44/33044/2
Amir Shehata [Tue, 21 Aug 2018 19:29:27 +0000 (12:29 -0700)]
LU-11273 lnet: update logging

Add the retry count when logging message sending/resending.
Make timed out responses visible on net error.
Log cases when a message is not resent

Test-Parameters: trivial
Signed-off-by: Amir Shehata <ashehata@whamcloud.com>
Change-Id: I0908d495c8ba54754fa77b0fc3b5df59317bb2e8
Reviewed-on: https://review.whamcloud.com/33044
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Olaf Weber <olaf.weber@hpe.com>
Reviewed-by: Doug Oucharek <dougso@me.com>
Reviewed-by: Sonia Sharma <sharmaso@whamcloud.com>
Reviewed-by: James Simmons <uja.ornl@yahoo.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-11195 lod: Mark comps cached on replay of layout change 04/32904/2
Ann Koehler [Mon, 30 Jul 2018 21:02:59 +0000 (16:02 -0500)]
LU-11195 lod: Mark comps cached on replay of layout change

Replay of a layout change request on a PFL file leaves the object
in an unexpected state: Some components can have llc_stripe set
but ldo_comp_cached is not set in the object. The next layout
change request on the same object will LBUG when it tries to free
the comp entries.

The fix is to set ldo_comp_cached on replay so subsequent layout
change requests will use the in memory components rather than
fetching them from disk.

Signed-off-by: Ann Koehler <amk@cray.com>
Change-Id: I8eaee5614c7f2f6e6a3f2c51de93a65422a3122b
Reviewed-on: https://review.whamcloud.com/32904
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Bobi Jam <bobijam@hotmail.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-10818 obdecho: don't set ma_need in echo_attr_get_complex() 97/33097/2
Nikitas Angelinas [Fri, 31 Aug 2018 08:04:18 +0000 (11:04 +0300)]
LU-10818 obdecho: don't set ma_need in echo_attr_get_complex()

echo_attr_get_complex() copies ma_need to a local variable, masks
MA_* values other than MA_INODE if MA_INODE is set in ma_need,
and restores the saved value of ma_need before the function exits.
This does not seem to be useful, and triggers an assertion in
echo_big_lmm_get() when MA_LOV and/or MA_LMV is set in ma_need.

Signed-off-by: Nikitas Angelinas <nangelinas@cray.com>
Cray-bug-id: LUS-6252
Reviewed-by: Patrick Farrell <paf@cray.com>
Reviewed-by: Andrew Perepechko <c17827@cray.com>
Change-Id: I3f5a01b57bdd83937f19fd1fa392b53f7b316455
Reviewed-on: https://review.whamcloud.com/33097
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Tested-by: Jenkins
Reviewed-by: Lai Siyao <lai.siyao@whamcloud.com>
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-11057 obd: check '-o network' and peer discovery conflict 62/32562/5
Sebastien Buisson [Fri, 25 May 2018 16:15:25 +0000 (01:15 +0900)]
LU-11057 obd: check '-o network' and peer discovery conflict

"-o network=net" client mount option is not taken into account
when LNet dynamic peer discovery is active.
Check if LNet dynamic peer discovery is active on local node. If it
is, return error if "-o network=net" option is specified.

This patch will have to be reverted when the incompatibility between
"-o network=net" client mount option and LNet dynamic peer discovery
is resolved.

Signed-off-by: Sebastien Buisson <sbuisson@ddn.com>
Change-Id: I0520e58b22b7adecf797fbd351506c2f8712dc85
Reviewed-on: https://review.whamcloud.com/32562
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Amir Shehata <ashehata@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-11272 lnet: router handling 43/33043/3
Amir Shehata [Tue, 21 Aug 2018 19:23:26 +0000 (12:23 -0700)]
LU-11272 lnet: router handling

Re-create the md and mdh if the router checker ping times out.
When re-transmitting a message do so even if the peer is marked down
to fulfill the message's retry quota.

Test-Parameters: trivial
Signed-off-by: Amir Shehata <ashehata@whamcloud.com>
Change-Id: I7b2a1ec6602dac9a112f4d318b0512f68f923969
Reviewed-on: https://review.whamcloud.com/33043
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Olaf Weber <olaf.weber@hpe.com>
Reviewed-by: Sonia Sharma <sharmaso@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-11271 lnd: conditionally set health status 42/33042/2
Amir Shehata [Tue, 21 Aug 2018 19:15:30 +0000 (12:15 -0700)]
LU-11271 lnd: conditionally set health status

For specific error scenarios a more accurate health status is set
per transmit. These shouldn't be overwritten in
kiblnd_txlist_done()

Test-Parameters: trivial
Signed-off-by: Amir Shehata <ashehata@whamcloud.com>
Change-Id: I6c3ad6660aa654d32e823b29ebe3aedb9fc5508e
Reviewed-on: https://review.whamcloud.com/33042
Tested-by: Jenkins
Reviewed-by: Olaf Weber <olaf.weber@hpe.com>
Reviewed-by: Sonia Sharma <sharmaso@whamcloud.com>
Reviewed-by: James Simmons <uja.ornl@yahoo.com>
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-11275 llite: check truncate race for DOM pages 87/33087/2
Mikhail Pershin [Tue, 28 Aug 2018 10:06:21 +0000 (13:06 +0300)]
LU-11275 llite: check truncate race for DOM pages

In ll_dom_finish_open() check vmpage mapping still
exists after locking and exit otherwise. This can
happen if page has been truncated concurrently.

Signed-off-by: Mikhail Pershin <mpershin@whamcloud.com>
Change-Id: Ib6ef551673a40ad99baaa9bd620225c65ce34454
Reviewed-on: https://review.whamcloud.com/33087
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
2 years agoLU-11282 osd-zfs: drop cache immediately 72/33072/7
Alex Zhuravlev [Fri, 24 Aug 2018 10:43:43 +0000 (14:43 +0400)]
LU-11282 osd-zfs: drop cache immediately

if this is requested via:
  lctl set_param osd-zfs.*.readcache_max_filesize=<bytes>

dropping cache at read is almost free, but may take few
cycles at write as we have to find corresponding dbufs.

Change-Id: I107fc1bf5a8d7655da4054048ff07d3dffa9d4d8
Signed-off-by: Alex Zhuravlev <bzzz@whamcloud.com>
Reviewed-on: https://review.whamcloud.com/33072
Tested-by: Jenkins
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Brian Behlendorf <behlendorf1@llnl.gov>
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-9855 obdclass: simplify md_stats code 22/32822/12
James Simmons [Fri, 10 Aug 2018 16:19:30 +0000 (12:19 -0400)]
LU-9855 obdclass: simplify md_stats code

The md_stats code is layered in many levels of macros that make
the code difficult to read as well as introduce undetected
errors. This peels away the macro wrappers by replacing it with
the function lprocfs_exp_count_increment() which doesn't care
about the order of the function pointers in struct md_ops. The
other change is macros used for initializing the counters. This
is done by replacing lprocfs_init_mps_stats() with very simple
handling in lprocfs_alloc_md_stats().

Change-Id: I036ce4518ffb08d53e2d27bcdea564a4c799181d
Signed-off-by: James Simmons <uja.ornl@yahoo.com>
Reviewed-on: https://review.whamcloud.com/32822
Reviewed-by: Ben Evans <bevans@cray.com>
Reviewed-by: John L. Hammond <jhammond@whamcloud.com>
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-11110 ofd: remove obdfilter.*.* symlinks in few releases 52/32752/5
Emoly Liu [Mon, 2 Jul 2018 06:53:00 +0000 (14:53 +0800)]
LU-11110 ofd: remove obdfilter.*.* symlinks in few releases

Add a #if LUSTRE_VERSION_CODE < OBD_OCD_VERSION(2, 14, 53, 0) check
around the obdfilter.*.* symlinks creation code to keep them in
place for another few releases, so that old test scripts that use
them will not break, then remove them.

Change-Id: I703c7ec3af8434b0de8b7cbed19c2c32611f6b18
Signed-off-by: Emoly Liu <emoly@whamcloud.com>
Reviewed-on: https://review.whamcloud.com/32752
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: James Nunez <jnunez@whamcloud.com>
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: James Simmons <uja.ornl@yahoo.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-11075 ldlm: correct logic in ldlm_prepare_lru_list() 60/32660/2
John L. Hammond [Thu, 7 Jun 2018 17:08:42 +0000 (12:08 -0500)]
LU-11075 ldlm: correct logic in ldlm_prepare_lru_list()

In ldlm_prepare_lru_list() fix an (x != a || x != b) type error and
correct a use after free.

Signed-off-by: John L. Hammond <john.hammond@intel.com>
Change-Id: I4e34e531260295805c4461e7d8d98675400f1148
Reviewed-on: https://review.whamcloud.com/32660
Tested-by: Jenkins
Reviewed-by: Mike Pershin <mpershin@whamcloud.com>
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-10365 tests: set proper paths headers for sanity test 400a 37/31737/8
James Simmons [Tue, 21 Aug 2018 18:39:57 +0000 (14:39 -0400)]
LU-10365 tests: set proper paths headers for sanity test 400a

For the case when /usr/include/lustre doesn't exist sanity test
400a attempts to use the lustre user land headers located in the
source tree. Some of the lustre user land headers are wrappers
around the UAPI headers so we need to include those paths as well.

A test move was done in the linux kernel that moved the UAPI headers
to their proper place. Errors were reported mainly due to
linux/types.h being missing. This could be the reason Ubuntu18 fails
the sanity 400a test.

Test-Parameters: trivial clientdistro=ubuntu1604 testlist=sanity envdefinitions=ONLY=400a

Change-Id: If17da7d9fc4cedb3b9c18feaafbee47d1f94d49b
Signed-off-by: James Simmons <uja.ornl@yahoo.com>
Reviewed-on: https://review.whamcloud.com/31737
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: James Nunez <jnunez@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-6142 obdclass: Fix style issues for lu_ref.c 81/33081/4
Arshad Hussain [Sat, 25 Aug 2018 22:00:07 +0000 (03:30 +0530)]
LU-6142 obdclass: Fix style issues for lu_ref.c

This patch fixes issues reported by checkpatch
for file lustre/obdclass/lu_ref.c

Change-Id: I8733fcac454685704b327219ba4afb096d3943c3
Test-Parameters: trivial
Signed-off-by: Arshad Hussain <arshad.super@gmail.com>
Reviewed-on: https://review.whamcloud.com/33081
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: James Simmons <uja.ornl@yahoo.com>
2 years agoLU-6142 obdclass: Fix style issues for lustre_handles.c 80/33080/2
Arshad Hussain [Sat, 25 Aug 2018 21:39:24 +0000 (03:09 +0530)]
LU-6142 obdclass: Fix style issues for lustre_handles.c

This patch fixes issues reported by checkpatch
for file lustre/obdclass/lustre_handles.c

Change-Id: I6e6ad8c56e225dcdd3707bf5f3b233eda3f90320
Test-Parameters: trivial
Signed-off-by: Arshad Hussain <arshad.super@gmail.com>
Reviewed-on: https://review.whamcloud.com/33080
Tested-by: Jenkins
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: James Simmons <uja.ornl@yahoo.com>
2 years agoLU-6142 obdclass: Fix style issues for lustre_peer.c 79/33079/3
Arshad Hussain [Sat, 25 Aug 2018 18:25:59 +0000 (23:55 +0530)]
LU-6142 obdclass: Fix style issues for lustre_peer.c

This patch fixes issues reported by checkpatch
for file lustre/obdclass/lustre_peer.c

Change-Id: I6cf95dfdd709974cae62626ac50a3507588f425d
Test-Parameters: trivial
Signed-off-by: Arshad Hussain <arshad.super@gmail.com>
Reviewed-on: https://review.whamcloud.com/33079
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: James Simmons <uja.ornl@yahoo.com>
2 years agoLU-11255 kernel: kernel update [SLES12 SP3 4.4.143-94.47] 65/33065/3
Jian Yu [Fri, 24 Aug 2018 06:36:34 +0000 (23:36 -0700)]
LU-11255 kernel: kernel update [SLES12 SP3 4.4.143-94.47]

Update SLES12 SP3 kernel to 4.4.143-94.47.

Test-Parameters: mdtfilesystemtype=ldiskfs ostfilesystemtype=ldiskfs \
clientdistro=sles12sp3 ossdistro=sles12sp3 mdsdistro=sles12sp3 \
testgroup=review-ldiskfs

Change-Id: I8b2c99c9a65149f1b149fa91351970034d6f7a47
Signed-off-by: Jian Yu <yujian@whamcloud.com>
Reviewed-on: https://review.whamcloud.com/33065
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Yang Sheng <ys@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-11125 ofd: decrease message level 85/32985/5
Mikhail Pershin [Mon, 13 Aug 2018 14:34:30 +0000 (17:34 +0300)]
LU-11125 ofd: decrease message level

The "destroys_in_progress already cleared" message
in ofd_create_hdl() may be result of high load on OST
server prior failover. It is not an error, so decrease
its level to D_HA from D_ERROR.

Signed-off-by: Mikhail Pershin <mpershin@whamcloud.com>
Change-Id: Id5142672a61244a6362be3778d0769baafc87b86
Reviewed-on: https://review.whamcloud.com/32985
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Lai Siyao <lai.siyao@whamcloud.com>
2 years agoLU-10509 mdd: don't set size attr for DOM file 08/33008/2
Mikhail Pershin [Wed, 15 Aug 2018 21:15:03 +0000 (00:15 +0300)]
LU-10509 mdd: don't set size attr for DOM file

When client does truncate it calls ll_md_setattr() followed by
ll_setattr_ost() to set size on OSTs. With DOM file that causes
setattr on MDT first including size then PUNCH RPC on the same
object. That was considered as non-optimized situation and
LU-11033 is intended to improve it, but with ZFS there is
check in OSD which does no truncate if size is the same already.
Therefore real file blocks are not truncated actually so sparse
write beyond the end of file will get old data in hole instead of
zeroes.

Quick patch checks if mdd_attr_set() is going to set SIZE attr for
DOM file and clear LA_SIZE bit, assuming there will be truncate.

Complete solution for this will be implemented under LU-11033

Signed-off-by: Mikhail Pershin <mpershin@whamcloud.com>
Change-Id: I47873dccf4270e5f0338f7b6696aa5969cfb9444
Reviewed-on: https://review.whamcloud.com/33008
Tested-by: Jenkins
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: John L. Hammond <jhammond@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-10018 protocol: MDT as a statfs proxy 36/29136/91
Alex Zhuravlev [Thu, 21 Sep 2017 15:24:18 +0000 (18:24 +0300)]
LU-10018 protocol: MDT as a statfs proxy

MDT can act as a proxy for statfs data. this should
make df faster (RTT vs RTT*(#MDTs+1)) and enable
idling connections so that clients don't connect to
each OST just to report statfs data. the protocol
has been changing slightly to let MDT differentiate
self and aggregated statfs.

also, obd_statfs has got a new field "granted" where
OST reports how much space has been granted to the
requesting MDT so that space can be added to available
space.

client's NID is used to distribute MDS_STATFS among
MDTS.

Change-Id: I59e03cb5abf809ae8820f874ec51dd2b74e1806c
Signed-off-by: Alex Zhuravlev <alexey.zhuravlev@intel.com>
Reviewed-on: https://review.whamcloud.com/29136
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Mike Pershin <mpershin@whamcloud.com>
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-8708 osc: enable/disable OSC grant shrink 03/23203/18
Bobi Jam [Mon, 17 Oct 2016 09:50:41 +0000 (17:50 +0800)]
LU-8708 osc: enable/disable OSC grant shrink

Add an OSC proc interface to enable/disable client's grant shrink
feature.

lctl get_param osc.*.grant_shrink
lctl set_param osc.*.grant_shrink={0,1}

Change-Id: I7974b3bf1c4f9c294dd0d4871d09b1a2e45a8d78
Signed-off-by: Bobi Jam <bobijam@whamcloud.com>
Reviewed-on: https://review.whamcloud.com/23203
Reviewed-by: James Simmons <uja.ornl@yahoo.com>
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-11212 lod: preserve mirror ID on mirror extension 38/32938/7
Bobi Jam [Tue, 12 Jun 2018 11:28:16 +0000 (19:28 +0800)]
LU-11212 lod: preserve mirror ID on mirror extension

When merging/expanding existing mirrors of a FLR file, we need keep
its existing mirror's mirror ID.

Signed-off-by: Bobi Jam <bobijam.xu@intel.com>
Change-Id: If139076c37c33bb1a330e1a5e997f8f56015fd9a
Reviewed-on: https://review.whamcloud.com/32938
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Jian Yu <yujian@whamcloud.com>
Reviewed-by: Jinshan Xiong <jinshan.xiong@gmail.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-11238 lod: refine obj avoid collect for FLR 95/32995/4
Bobi Jam [Tue, 14 Aug 2018 03:12:12 +0000 (11:12 +0800)]
LU-11238 lod: refine obj avoid collect for FLR

When a FLR file is being created, the MDS tries to allocate objects
for the first components of all mirrors, and in this decalre phase,
the objects for their component has been allocated, but the
component's ID and init flag has not been set until the exec phase,
lod_create()->lod_striped_create(), so lod_collect_avoidance() should
take heed of this scenario.

This patch also addes some debug messages.

Test-Parameters: testlist=sanity-flr,sanity-flr,sanity-flr,sanity-flr,sanity-flr
Signed-off-by: Bobi Jam <bobijam@whamcloud.com>
Change-Id: I16ef2da44f6db06a8e0bc67ae2646cdc3ff3bb63
Reviewed-on: https://review.whamcloud.com/32995
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: John L. Hammond <jhammond@whamcloud.com>
Reviewed-by: Jinshan Xiong <jinshan.xiong@gmail.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-11259 test: correct fail_loc names in replay-{single,dual} 15/33015/2
John L. Hammond [Thu, 16 Aug 2018 14:37:23 +0000 (09:37 -0500)]
LU-11259 test: correct fail_loc names in replay-{single,dual}

Some comments in replay-single and replay-dual confusingly referred to
OBD_FAIL_OUT_UPDATE_NET_REP as OBD_FAIL_OBJ_UPDATE_NET_REP or
OBD_FAIL_UPDATE_OBJ_NET_REP. Correct these.

Test-Parameters: trivial
Signed-off-by: John L. Hammond <jhammond@whamcloud.com>
Change-Id: Ib724e8151ba0ea34a5dacf2f148673a52dc37824
Reviewed-on: https://review.whamcloud.com/33015
Tested-by: Jenkins
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: James Nunez <jnunez@whamcloud.com>
2 years agoLU-8066 tests: don't access /proc/sys/lnet/debug directly 05/33005/2
Andreas Dilger [Wed, 15 Aug 2018 17:39:15 +0000 (11:39 -0600)]
LU-8066 tests: don't access /proc/sys/lnet/debug directly

In replay-single test_70e use "lctl set_param" to set the debug mask
rather than writing into the /proc/sys/lnet/debug file directly, since
this tunable moved to sysfs in commit v2_10_51_0-12-g7092309f32.

Clean up the test code style in test_70e as well.

Test-Parameters: trivial testlist=replay-single
Signed-off-by: Andreas Dilger <adilger@whamcloud.com>
Change-Id: I65c2bd9643fc6fc54a5de7b6404d316c0ff12537
Reviewed-on: https://review.whamcloud.com/33005
Reviewed-by: James Simmons <uja.ornl@yahoo.com>
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Jian Yu <yujian@whamcloud.com>
Reviewed-by: Doug Oucharek <dougso@me.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-11240 gnilnd: Replace KGNILND_BUILD_REV 90/32990/2
Chris Horn [Tue, 5 Jun 2018 19:32:32 +0000 (14:32 -0500)]
LU-11240 gnilnd: Replace KGNILND_BUILD_REV

The current format of the gnilnd version string causes a compilation
error. Since gnilnd doesn't really need its own version string we just
replace it with LUSTRE_VERSION_STRING.

Cray-bug-id: LUS-6072
Test-Parameters: trivial
Signed-off-by: Chris Horn <hornc@cray.com>
Change-Id: I6f45df2566853a6f4c2078cf72c7eac7a52f3fad
Reviewed-on: https://review.whamcloud.com/32990
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: James Simmons <uja.ornl@yahoo.com>
Reviewed-by: Chuck Fossen <chuckf@cray.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-11135 mdt: LASSERT(lu_object_exists(o)) fails 03/32803/4
Andriy Skulysh [Thu, 3 May 2018 10:07:22 +0000 (13:07 +0300)]
LU-11135 mdt: LASSERT(lu_object_exists(o)) fails

mdt_object_find() can return a vaild nonexisting object.
It's return value needs to be checked additionaly if exists.

Change-Id: Ib1f5bd5289a69e29437db520706591929bf55830
Cray-bug-id: LUS-6192
Signed-off-by: Andriy Skulysh <c17819@cray.com>
Reviewed-on: https://review.whamcloud.com/32803
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Alexandr Boyko <c17825@cray.com>
Reviewed-by: Andrew Perepechko <c17827@cray.com>
Reviewed-by: Alex Zhuravlev <bzzz@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-11056 lwp: fix lwp reconnection issue 36/32536/6
Hongchao Zhang [Thu, 24 May 2018 20:09:27 +0000 (16:09 -0400)]
LU-11056 lwp: fix lwp reconnection issue

After the OST or MDT was restarted, the lwp reconnection can be
failed for -EALREADY because the connect count in the connecttion
request is less then the value saved in the corresponding export
at MDT0000, which could cause the system hang.

The patch also changes lustre_lwp_connect to use OBD_CONNECT_MDS_MDS
flag only when the connection is between MDTs.

Change-Id: I9ae7b4faadc65fdaa78458a06315b1739d144feb
Signed-off-by: Hongchao Zhang <hongchao.zhang@intel.com>
Reviewed-on: https://review.whamcloud.com/32536
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Mike Pershin <mpershin@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-10855 llog: use llog_common_cat_ops 12/31812/3
John L. Hammond [Tue, 27 Mar 2018 17:17:10 +0000 (12:17 -0500)]
LU-10855 llog: use llog_common_cat_ops

Remove changelog_orig_logops, hsm_actions_logops, and
osp_mds_ost_orig_logops, replacing each with llog_common_cat_ops.

Signed-off-by: John L. Hammond <john.hammond@intel.com>
Change-Id: Ia19337350452f9793b3ea9a56343ef3a065c1f83
Reviewed-on: https://review.whamcloud.com/31812
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Mike Pershin <mpershin@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-10824 llite: don't use ll_mnt to get fstype name 25/33025/3
James Simmons [Sat, 18 Aug 2018 16:07:12 +0000 (12:07 -0400)]
LU-10824 llite: don't use ll_mnt to get fstype name

Originally lustre would report using the fstype proc file either
'lustre' or 'llite'. This required us to query struct super_block
but its been a very long time since that is the case. This also
removes a direct use of ll_mnt. The fix is simply report 'lustre'.

Test-Parameters: trivial

Change-Id: Ia766c8e0a027e58a48de8fa6e2756238e20312b2
Signed-off-by: James Simmons <uja.ornl@yahoo.com>
Reviewed-on: https://review.whamcloud.com/33025
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Bobi Jam <bobijam@hotmail.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-6142 obdclass: Fix style issues for statfs_pack.c 81/32981/2
Arshad Hussain [Sun, 12 Aug 2018 04:18:54 +0000 (09:48 +0530)]
LU-6142 obdclass: Fix style issues for statfs_pack.c

This patch fixes issues reported by checkpatch
for file lustre/obdclass/statfs_pack.c

Change-Id: I7a34dd87875ab049c3339022f3153fb07937021e
Test-Parameters: trivial
Signed-off-by: Arshad Hussain <arshad.super@gmail.com>
Reviewed-on: https://review.whamcloud.com/32981
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: James Simmons <uja.ornl@yahoo.com>
2 years agoLU-6142 osd-ldiskfs: Fix style issues for osd_handler.c 18/32818/5
Arshad Hussain [Sat, 14 Jul 2018 15:27:17 +0000 (20:57 +0530)]
LU-6142 osd-ldiskfs: Fix style issues for osd_handler.c

This patch fixes issues reported by checkpatch
for file lustre/osd-ldiskfs/osd_handler.c

Change-Id: Ifd6468acc75b59a4324385c68af1175a74a3c312
Test-Parameters: trivial
Signed-off-by: Arshad Hussain <arshad.super@gmail.com>
Reviewed-on: https://review.whamcloud.com/32818
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: James Simmons <uja.ornl@yahoo.com>
2 years agoLU-11266 build: update changelog for Ubuntu 20/33020/3
Minh Diep [Fri, 17 Aug 2018 19:10:08 +0000 (12:10 -0700)]
LU-11266 build: update changelog for Ubuntu

Record the version that we are building

Test-Parameters: trivial

Change-Id: Ib1c2e74774d8a6caa6c3f70814affb53cf8cd22e
Signed-off-by: Minh Diep <mdiep@whamcloud.com>
Reviewed-on: https://review.whamcloud.com/33020
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Jian Yu <yujian@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-11009 test: add version check to test_102 40/32340/7
Wei Liu [Wed, 9 May 2018 19:53:11 +0000 (12:53 -0700)]
LU-11009 test: add version check to test_102

Skip test_102 if server is equal or less than 2.9.53

Test-Parameters:trivial testlist=conf-sanity envdefinitions=ONLY=102 serverjob=lustre-b2_9 serverbuildno=22
Signed-off-by: Wei Liu <sarah@whamcloud.com>
Change-Id: I1964a7a5df8b910652b2fe774703d7b62f953e95
Reviewed-on: https://review.whamcloud.com/32340
Reviewed-by: James Nunez <jnunez@whamcloud.com>
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-11040 utils: improve mount usage/man page 81/32481/6
Andreas Dilger [Wed, 25 Jul 2018 07:15:40 +0000 (01:15 -0600)]
LU-11040 utils: improve mount usage/man page

Improve the description of the mount.lustre.8 man page and usage:
- provide separate SYNOPSYS for client and server mount commands
- move "acl" option out of general options into server-only options,
  since client option was removed and ACLs are only controlled by MDS
- correct "CLIENT OPTIONS" section to be named "SERVER OPTIONS"
- add checksum, lruresize, lazystatfs, 32bitapi, user_fid2path usage
- mark the default values of the options in the usage message

Test-Parameters: trivial
Signed-off-by: Andreas Dilger <adilger@whamcloud.com>
Change-Id: I28fe0f13d363e0a26ffcbc1ba9923e4fd35804f0
Reviewed-on: https://review.whamcloud.com/32481
Tested-by: Jenkins
Reviewed-by: Emoly Liu <emoly@whamcloud.com>
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: James Nunez <jnunez@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-11096 osd: wrap new blk integrity stuff 25/32725/9
Alex Zhuravlev [Thu, 9 Aug 2018 22:26:35 +0000 (18:26 -0400)]
LU-11096 osd: wrap new blk integrity stuff

to be able to build Lustre against kernels with no blk integrity.

Change-Id: I050020e94524f4519fdf46a22f0d847979754291
Signed-off-by: Alex Zhuravlev <bzzz@whamcloud.com>
Reviewed-on: https://review.whamcloud.com/32725
Reviewed-by: James Simmons <uja.ornl@yahoo.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Chris Horn <hornc@cray.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-8066 lov: fix lov.*.stripeoffset printing 19/33019/3
Andreas Dilger [Fri, 17 Aug 2018 18:48:44 +0000 (12:48 -0600)]
LU-8066 lov: fix lov.*.stripeoffset printing

The move of lov.*.stripeoffset from /proc to /sys in commit 3c900918
reverted the printing of stripeoffset from a signed value to an
unsigned value, which is broken for the common value of "-1".  This
was previously fixed in LU-9611 commit f93276d9.

Test-Parameters: trivial
Signed-off-by: Andreas Dilger <adilger@whamcloud.com>
Change-Id: Ib61305ddbf902dd74ac0e16c0c2fe6920052ddf4
Reviewed-on: https://review.whamcloud.com/33019
Reviewed-by: Jian Yu <yujian@whamcloud.com>
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: James Simmons <uja.ornl@yahoo.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-11215 tests: replace "large_xattr" with "ea_inode" 12/33012/2
Li Dongyang [Thu, 16 Aug 2018 06:26:12 +0000 (16:26 +1000)]
LU-11215 tests: replace "large_xattr" with "ea_inode"

Change the test scripts over to using the "ea_inode" name, since
this is what the upstream e2fsprogs is using.  The "large_xattr"
feature name was only ever used in the Lustre-patched e2fsprogs.

Don't try to turn off "ea_inode" feature on the targets anymore,
it's not supported by upstream e2fsprogs.

e2fsprogs commit: 5b72578279fe2470e682692a15d70a43d9289e0f

Test-Parameters: trivial testlist=conf-sanity
Signed-off-by: Andreas Dilger <adilger@whamcloud.com>
Signed-off-by: Li Dongyang <dongyangli@ddn.com>
Change-Id: I83bd303827fa28050d1d6d2416b2d630dc94ec12
Reviewed-on: https://review.whamcloud.com/33012
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: James Simmons <uja.ornl@yahoo.com>
2 years agoLU-4256 test: add lustre-rsync-test 2b to ALWAYS_EXCEPT 06/33006/3
John L. Hammond [Wed, 15 Aug 2018 20:06:16 +0000 (15:06 -0500)]
LU-4256 test: add lustre-rsync-test 2b to ALWAYS_EXCEPT

This test continues to fail at a low rate so disable it.

Test-Parameters: trivial testlist=lustre-rsync-test
Signed-off-by: John L. Hammond <jhammond@whamcloud.com>
Change-Id: I8fe4d039e8edd0552e56ee9451cc05f08cb34c8d
Reviewed-on: https://review.whamcloud.com/33006
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Jian Yu <yujian@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-11244 build: apply IB_OPTIONS to debian rules 96/32996/2
Jinshan Xiong [Tue, 14 Aug 2018 03:33:33 +0000 (20:33 -0700)]
LU-11244 build: apply IB_OPTIONS to debian rules

IB_OPTIONS should be honored when making debian package.

Signed-off-by: Jinshan Xiong <jinshan.xiong@uber.com>
Change-Id: Ibc16a5428d47f072499c39a62ea457c922ae7352
Reviewed-on: https://review.whamcloud.com/32996
Tested-by: Jenkins
Reviewed-by: James Simmons <uja.ornl@yahoo.com>
Tested-by: James Simmons <uja.ornl@yahoo.com>
Reviewed-by: Thomas Stibor <t.stibor@gsi.de>
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Martin Schroeder <martin.h.schroeder@intel.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-11227 lod: lod_sync: don't attempt sync to inactive targets 64/32964/5
Robin Humble [Thu, 9 Aug 2018 05:33:04 +0000 (15:33 +1000)]
LU-11227 lod: lod_sync: don't attempt sync to inactive targets

chgrp on a client triggers lod_sync() which in turn loops over OST/MDT
targets with dt_sync(). dt_sync() fails with -ENOTCONN when targets
have been deactivated (ie. set to active=0). The client retries
infinitely causing the client process to hang and considerably MDS
network traffic, load, and disk i/o.

the fix is to not attempt dt_sync() to ost/mdt targets that have been
deactivated and also (because of possible races) to ignore connection
errors.

tested with Lustre 2.10.4.

Signed-off-by: Robin Humble <plaguedbypenguins@gmail.com>
Change-Id: I617509cf7944541489f4fd9762c233b771132165
Reviewed-on: https://review.whamcloud.com/32964
Reviewed-by: Alex Zhuravlev <bzzz@whamcloud.com>
Tested-by: Jenkins
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: John L. Hammond <jhammond@whamcloud.com>
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-11226 flr: mirror resync regression 68/32968/5
Bobi Jam [Thu, 9 Aug 2018 06:35:49 +0000 (14:35 +0800)]
LU-11226 flr: mirror resync regression

There is a glitch in the lfs mirror resync tool in commit
0e5c12ac29a9622e8ca05d5e39cd5e2a721ace93, resync write needs to
restricted to the component's extent.

Signed-off-by: Bobi Jam <bobijam@whamcloud.com>
Change-Id: Ifbd3f16b2f621407b31c7fe37ce9745de48fcc99
Reviewed-on: https://review.whamcloud.com/32968
Tested-by: Jenkins
Reviewed-by: Jian Yu <yujian@whamcloud.com>
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-11146 lustre: fix setstripe for specific osts upon dir 14/32814/16
Wang Shilong [Wed, 11 Jul 2018 14:11:47 +0000 (22:11 +0800)]
LU-11146 lustre: fix setstripe for specific osts upon dir

LOV_USER_MAGIC_SPECIFIC function is broken and it
was not available for setting directory.

1)llite doesn't handle LOV_USER_MAGIC_SPECIFIC case
properly for dir {set,get}_stripe, and ioctl
LL_IOC_LOV_SETSTRIPE did not alloc enough buf,
copy ost lists from userspace.

2)lod_get_default_lov_striping() did not handle
LOV_USER_MAGIC_SPECIFIC type that newly created
files/dir won't inherit parent setting well.

3)there is not any case to cover lfs setstripe
'-o' interface which make it hard to figure out
when this function was broken.

Change-Id: Icc2ee60a474e5e565db12b35a9a38fde65b05bbd
Signed-off-by: Wang Shilong <wshilong@ddn.com>
Reviewed-on: https://review.whamcloud.com/32814
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Bobi Jam <bobijam@hotmail.com>
Reviewed-by: Jian Yu <yujian@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-8066 llite: move /proc/fs/lustre/llite/uuid to sysfs 01/32501/9
James Simmons [Sun, 29 Jul 2018 14:34:19 +0000 (10:34 -0400)]
LU-8066 llite: move /proc/fs/lustre/llite/uuid to sysfs

Move uuid file from /proc/fs/lustre/llite/*
to /sys/fs/lustre/llite/*/

This is a modified version of

Linux-commit: ec55a6299990efa969dfc00d95c72444ff1e3461

due to the large amount of changes to the OpenSFS/Intel branch.

Change-Id: I2dc13c248879f554f9f7ed6dc62a6772a59f6f35
Signed-off-by: Oleg Drokin <green@linuxhacker.ru>
Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
Signed-off-by: James Simmons <uja.ornl@yahoo.com>
Reviewed-on: https://review.whamcloud.com/32501
Reviewed-by: Bobi Jam <bobijam@hotmail.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
2 years agoLU-8215 tests: sanity-benchmark/iozone should wait for space recovery 99/20499/2
Alex Zhuravlev [Mon, 30 May 2016 10:45:51 +0000 (14:45 +0400)]
LU-8215 tests: sanity-benchmark/iozone should wait for space recovery

otherwise it may fail due to a transient state where the space confsumed
by the previous run hasn't recovered yet. this happens to tiny filesystems
used in local setups.

Change-Id: I04b3ce096621583629277c1e52c64a1551bc8ace
Signed-off-by: Alex Zhuravlev <alexey.zhuravlev@intel.com>
Reviewed-on: https://review.whamcloud.com/20499
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: James Nunez <jnunez@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-11201 lfsck: check linkea entry validity 58/32958/2
Lai Siyao [Sun, 22 Jul 2018 21:45:23 +0000 (05:45 +0800)]
LU-11201 lfsck: check linkea entry validity

Invalid linkea data may lead to dead loop in linkea iteration, check
linkea entry validity on unpack, and if entry is not unpacked, check
entry length validity.

Test-Parameters: trivial mdscount=2 mdtcount=4 testlist=sanity-lfsck
Signed-off-by: Lai Siyao <lai.siyao@whamcloud.com>
Change-Id: I8e1890ed64fab38b85149ebbfecce04caaf41e17
Reviewed-on: https://review.whamcloud.com/32958
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Hongchao Zhang <hongchao@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-11154 llite: use proper flags for FS_IOC_{FSSET,FSGET}XATTR 28/32828/6
Wang Shilong [Wed, 18 Jul 2018 08:30:28 +0000 (16:30 +0800)]
LU-11154 llite: use proper flags for FS_IOC_{FSSET,FSGET}XATTR

Two problems addressed by this patch:

1)struct fsxattr fsx_xflags has its own flags definition
like FS_XFLAG_XXX, we should use proper convert macro for
it, here we used wrong constant flag for project inherit flag.

2)FS_XFLAG_PROJINHERIT is not a valid vfs inode flag, looking
at current linux codes, local filesystem set project inherit
flag on its private flags, we should do similar thing to Lustre

Test-Parameters: trivial testlist=sanity-quota,sanity-quota,sanity-quota
Change-Id: I453db8ed074e8008f0ec145c726d7577121422e6
Signed-off-by: Wang Shilong <wshilong@ddn.com>
Reviewed-on: https://review.whamcloud.com/32828
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Olaf Weber <olaf.weber@hpe.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-9120 lnet: LNet Health/Resiliency Feature
Oleg Drokin [Tue, 21 Aug 2018 16:15:26 +0000 (12:15 -0400)]
LU-9120 lnet: LNet Health/Resiliency Feature

The LNet Health/Resiliency feature adds the ability for LNet
to try out different interfaces available to it if message
sending fails. It maintains the health of each remote and local
interfaces and selects the best interface for sending from and best
remote interface to send to.

Merge commit '958ef71f33fa925e6657f9902702cd3677e15ec9'

Change-Id: I9ca740654c48d642fe130f98a60c5c59b9b4ebe1
Signed-off-by: Amir Shehata <ashehata@whamcloud.com>
2 years agoLU-10686 tests: stop running sanity-pfl test 9 45/32945/2
James Nunez [Mon, 6 Aug 2018 21:26:25 +0000 (15:26 -0600)]
LU-10686 tests: stop running sanity-pfl test 9

sanity-pfl test 9 consistently fails when run on a Lustre
file system with a single MDS. We need to add test 9 to
the ALWAYS_EXCCEPT list and, thus, stop running the test
until a fix for the underlying problem can be found.

Test-Parameters: trivial mdscount=1 mdtcount=1 testlist=sanity-pfl
Test-Parameters: mdscount=2 mdtcount=2 testlist=sanity-pfl
Signed-off-by: James Nunez <jnunez@whamcloud.com>
Change-Id: Ife4b3c044e2777bb9b9010e0be7c00549a683fdc
Reviewed-on: https://review.whamcloud.com/32945
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Wei Liu <sarah@whamcloud.com>
Reviewed-by: James Simmons <uja.ornl@yahoo.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-11200 libcfs: handle DECLARE_TIMER reduced to two arguments 39/32939/5
James Simmons [Mon, 6 Aug 2018 17:56:55 +0000 (13:56 -0400)]
LU-11200 libcfs: handle DECLARE_TIMER reduced to two arguments

For the linux kernel their exist two ways to initialize a
struct timer_list. One method is with setup_timer() and the other is
with the DEFINE_TIMER macro. For earlier kernels both methods employed
callbacked with a argument of the type unsigned long. In kernels 4.15+
both methods of initialization use struct timer_list pointer for its
callback argument. During the 4.14 development phase we have
setup_timer() using struct timer_list as an argument for its callback
but DEFINE_TIMER was still using unsigned long. Additionally when
DEFINE_TIMER did move to using struct timer_list it reduced the number
of arguments to the macro. This patch handles the 4.14 kernel state of
development for the timer API.

Test-Parameters: trivial

Change-Id: I1c509838153328ed4bbdfa50468a396e13037d50
Signed-off-by: James Simmons <uja.ornl@yahoo.com>
Reviewed-on: https://review.whamcloud.com/32939
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Li Dongyang <dongyangli@ddn.com>
Reviewed-by: Yang Sheng <ys@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-11014 mdc: remove obsolete intent opcodes 61/32361/6
John L. Hammond [Fri, 11 May 2018 17:04:02 +0000 (12:04 -0500)]
LU-11014 mdc: remove obsolete intent opcodes

In enum ldlm_intent_flags, remove the obsolete constants IT_UNLINK,
IT_TRUNC, IT_EXEC, IT_PIN, IT_SETXATTR. Remove any handling code for
these opcodes.

Signed-off-by: John L. Hammond <john.hammond@intel.com>
Change-Id: I66f20e4c881cb77a481805a148a33f1c2daa5f0c
Reviewed-on: https://review.whamcloud.com/32361
Reviewed-by: Fan Yong <fan.yong@intel.com>
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Mike Pershin <mpershin@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-8066 lod: migrate from proc to sysfs 98/32198/6
James Simmons [Sat, 28 Jul 2018 15:54:38 +0000 (11:54 -0400)]
LU-8066 lod: migrate from proc to sysfs

Move the lod module from using proc for most single value files
to sysfs. Create the default attrs for dt_devices which can be
used for other server side devices.

Change-Id: I734f01ef0d9f0c18efc141c835e4cf8ad2365250
Signed-off-by: James Simmons <uja.ornl@yahoo.com>
Reviewed-on: https://review.whamcloud.com/32198
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Sonia Sharma <sharmaso@whamcloud.com>
2 years agoLU-11121 mdt: take discard lock at cleanup stage 30/29930/21
Mikhal Pershin [Fri, 3 Nov 2017 09:38:04 +0000 (12:38 +0300)]
LU-11121 mdt: take discard lock at cleanup stage

Call mdt_dom_check_and_discard() after mdt_object_unlock() to
avoid possible deadlock if some third lock is conflicting with
both like in the scenario below:
 thread1: mdt_object_lock() with some bits
 thread2: take conflicting lock and wait
 thread1: mdt_dom_check_and_discard() with bits conflicting
          with thread2 causes deadlock.

Patch enables dom layout in racer to test it on regular basis
Another minor update uses 'trap' in related tests.

Test-Parameters: mdssizegb=20 mdtcount=1 mdscount=1 testlist=sanity-dom,dom-performance,racer,racer,racer
Signed-off-by: Mikhail Pershin <mpershin@whamcloud.com>
Change-Id: I63bedabb4a82cfa2f01e126d35dc8c2a89d64f56
Reviewed-on: https://review.whamcloud.com/29930
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Alex Zhuravlev <bzzz@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-11175 osc: serialize access to idle_timeout vs cleanup 83/32883/4
Alex Zhuravlev [Thu, 26 Jul 2018 07:52:38 +0000 (11:52 +0400)]
LU-11175 osc: serialize access to idle_timeout vs cleanup

use LPROCFS_CLIMP_CHECK() and LPROCFS_CLIMP_EXIT() as cl_import
can disappear due to umount.

Change-Id: I2a067f416691f39cde13cfae8f64ed5769d92041
Signed-off-by: Alex Zhuravlev <bzzz@whamcloud.com>
Reviewed-on: https://review.whamcloud.com/32883
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: James Simmons <uja.ornl@yahoo.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
2 years agoLU-6142 obdclass: Fix style issues for acl.c 51/32851/5
Arshad Hussain [Sun, 22 Jul 2018 03:00:27 +0000 (08:30 +0530)]
LU-6142 obdclass: Fix style issues for acl.c

This patch fixes issues reported by checkpatch
for file lustre/obdclass/acl.c

Change-Id: I00d4535123fb6677863bfd10937df5039ee7a339
Test-Parameters: trivial
Signed-off-by: Arshad Hussain <arshad.super@gmail.com>
Reviewed-on: https://review.whamcloud.com/32851
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: James Simmons <uja.ornl@yahoo.com>
2 years agoLU-6142 osd-ldiskfs: Fix style issues for osd_iam_lfix.c 49/32849/6
Arshad Hussain [Sat, 21 Jul 2018 19:35:19 +0000 (01:05 +0530)]
LU-6142 osd-ldiskfs: Fix style issues for osd_iam_lfix.c

This patch fixes issues reported by checkpatch
for file lustre/osd-ldiskfs/osd_iam_lfix.c

Change-Id: I9d32231e397689dd3806fecf106bc1ce2f1439a4
Test-Parameters: trivial
Signed-off-by: Arshad Hussain <arshad.super@gmail.com>
Reviewed-on: https://review.whamcloud.com/32849
Tested-by: Jenkins
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: James Simmons <uja.ornl@yahoo.com>
2 years agoLU-11116 llog: error handling cleanup 80/32780/2
Alexander Boyko [Wed, 4 Jul 2018 10:41:52 +0000 (06:41 -0400)]
LU-11116 llog: error handling cleanup

llog_cat_new_log() needs some error handling cleanup.
Save and restore thread lgi_cookie when using, to prevent
conflict/corruptions with llog_process_thread().

Signed-off-by: Alexander Boyko <c17825@cray.com>
Change-Id: I12fdfe1a72e77cfeb5ad464b8582db68a7bcfe16
Cray-bug-id: LUS-4780
Reviewed-on: https://review.whamcloud.com/32780
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Andriy Skulysh <c17819@cray.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Mike Pershin <mpershin@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
2 years agoLU-11224 obd: use correct ip_compute_csum() version 53/32953/2
James Simmons [Tue, 7 Aug 2018 17:20:54 +0000 (13:20 -0400)]
LU-11224 obd: use correct ip_compute_csum() version

The linux kernel provides a generic platform independent version
of ip_compute_csum() as well as platform optimized versions. Some
platforms will disable the generic platform version in favor of
the optimized one. If the generic version is disabled and if the
checksum.h header from asm-generic is used then we will end up
with a undefined symbol error when loading the obdclass module.
The solution is to use the platform specific checksum.h header
that will handle using the generic or optimized version for us.
As a bounus we get better performance with the right kernel
configuration.

Test-Parameters: trivial

Change-Id: Ia0cfc9f4363bb61d5e381790655423ff5f91d9be
Signed-off-by: James Simmons <uja.ornl@yahoo.com>
Reviewed-on: https://review.whamcloud.com/32953
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Li Xi <lixi@ddn.com>
Reviewed-by: Li Dongyang <dongyangli@ddn.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
2 years agoLU-9325 ptlrpc: replace simple_strtol with kstrtol 85/32785/8
James Simmons [Thu, 5 Jul 2018 03:56:02 +0000 (23:56 -0400)]
LU-9325 ptlrpc: replace simple_strtol with kstrtol

Eventually simple_strtol() will be removed so replace its use in
the ptlrpc with kstrtoXXX() class of functions.

Change-Id: I41b44c5dc329832a901c1772a9ba0608df30282a
Signed-off-by: James Simmons <uja.ornl@yahoo.com>
Reviewed-on: https://review.whamcloud.com/32785
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Nikitas Angelinas <nikitas.angelinas@gmail.com>
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
2 years agoLU-9120 lnet: LNet Health/Resiliency Feature 23/33023/1
Amir Shehata [Sat, 18 Aug 2018 01:23:53 +0000 (18:23 -0700)]
LU-9120 lnet: LNet Health/Resiliency Feature

The LNet Health/Resiliency feature adds the ability for LNet
to try out different interfaces available to it if message
sending fails. It maintains the health of each remote and local
interfaces and selects the best interface for sending from and best
remote interface to send to.

Signed-off-by: Amir Shehata <ashehata@whamcloud.com>
Change-Id: Ibcbbc34f8acfc3afb36ffe73eb27d69c147d02ce

2 years agoLU-9120 lnet: health error simulation 51/32951/13
Amir Shehata [Sun, 5 Aug 2018 21:37:29 +0000 (14:37 -0700)]
LU-9120 lnet: health error simulation

Modified the error simulation code to simulate health errors for
testing purposes. The specific error can be set. If multiple
errors are configured then one at random is chosen from the set.

EX:
lctl net_drop_add -s *@tcp -d *@tcp -m GET -i 1 -e local_interrupt

The -e can be repeated multiple times to specify different
errors to simulate. The available set are
local_interrupt
local_dropped
local_aborted
local_no_route
local_error
local_timeout
remote_error
remote_dropped
remote_timeout
network_timeout
random

a -n, "--random", has been added to randomize error generation for
drop rules. This will rely an interval value provided via -i. This
will generate a random number no bigger than interval. If the number
is smaller than half of the interval then the rule isn't matched,
otherwise it is.

The purpose of this is because drop matching can happen multiple
times in the path of sending the message, and using time based
or rate will not result in even error generation across the
multiple calls.

Signed-off-by: Amir Shehata <ashehata@whamcloud.com>
Change-Id: If070e29f68c3de10100a9d5eaa49d10cdb76a59a
Reviewed-on: https://review.whamcloud.com/32951
Tested-by: Jenkins
Reviewed-by: Sonia Sharma <sharmaso@whamcloud.com>
Reviewed-by: Olaf Weber <olaf.weber@hpe.com>
Tested-by: Maloo <hpdd-maloo@intel.com>
2 years agoLU-9120 lnet: print recovery queues content 50/32950/12
Amir Shehata [Sun, 5 Aug 2018 21:25:47 +0000 (14:25 -0700)]
LU-9120 lnet: print recovery queues content

Add commands to lnetctl to print recovery queues content from
user space.

Associated code to handle the IOCTL is added in LNet module.

for local NIs:
lnetctl debug recovery --local

for peer NIs:
lnetctl debug recovery --peer

Test-Parameters: forbuildonly
Signed-off-by: Amir Shehata <ashehata@whamcloud.com>
Change-Id: Id136d506772d95381fd5d8346d772177442a84fb
Reviewed-on: https://review.whamcloud.com/32950
Tested-by: Jenkins
Reviewed-by: Sonia Sharma <sharmaso@whamcloud.com>
Reviewed-by: Olaf Weber <olaf.weber@hpe.com>
2 years agoLU-9120 lnet: add global health statistics 49/32949/12
Amir Shehata [Sun, 5 Aug 2018 21:16:49 +0000 (14:16 -0700)]
LU-9120 lnet: add global health statistics

Added global health statistics

Print that from lnetctl.

lnetctl stats show

lnet_selftest passes the statistics block over the wire. This,
unfortunately, creates an unnecessary backwards compatibility link
for lnet_selftest, which shouldn't be there. This patch breaks
this backwards compatibility, which means lnet_selftest will
not work with older selftest modules.

Test-Parameters: forbuildonly
Signed-off-by: Amir Shehata <ashehata@whamcloud.com>
Change-Id: I4a171c4f3cf13a1e8ab0d607d3b328352f727380
Reviewed-on: https://review.whamcloud.com/32949
Reviewed-by: Olaf Weber <olaf.weber@hpe.com>
Tested-by: Jenkins
Reviewed-by: Sonia Sharma <sharmaso@whamcloud.com>
2 years agoLU-9120 lnet: set health value from user space 63/32863/14
Amir Shehata [Tue, 24 Jul 2018 00:11:07 +0000 (17:11 -0700)]
LU-9120 lnet: set health value from user space

Add commands to lnetctl to set the health value.

for local NIs:
 lnetctl net set --nid <nid> --health <value>

for peer NIs:
 lnetctl peer set --nid <nid> --health <value>

Test-Parameters: forbuildonly
Signed-off-by: Amir Shehata <ashehata@whamcloud.com>
Change-Id: I06e1238df54c94bcfecadd84fbaa30cc1ce4dd68
Reviewed-on: https://review.whamcloud.com/32863
Tested-by: Jenkins
Reviewed-by: Sonia Sharma <sharmaso@whamcloud.com>
Reviewed-by: Olaf Weber <olaf.weber@hpe.com>
2 years agoLU-9120 lnet: show peer ni health stats 83/32783/15
Amir Shehata [Wed, 4 Jul 2018 18:49:38 +0000 (11:49 -0700)]
LU-9120 lnet: show peer ni health stats

Added another section in the peer ni show output for the health
statistics.

Test-Parameters: forbuildonly
Signed-off-by: Amir Shehata <ashehata@whamcloud.com>
Change-Id: I7ab3a9343972622d90a984c4f8c0b096b15ecbdc
Reviewed-on: https://review.whamcloud.com/32783
Reviewed-by: Olaf Weber <olaf.weber@hpe.com>
Reviewed-by: Sonia Sharma <sharmaso@whamcloud.com>
Tested-by: Jenkins