Whamcloud - gitweb
fs/lustre-release.git
10 years agoLU-4219 mdd: limit os_namelen to the max of NAME_MAX 17/8217/2
Jian Yu [Fri, 8 Nov 2013 07:18:34 +0000 (15:18 +0800)]
LU-4219 mdd: limit os_namelen to the max of NAME_MAX

While running pathconf(".", _PC_NAME_MAX) on ldiskfs-based
Lustre filesystem, it returns 255 (NAME_MAX). However, on
zfs-based Lustre filesystem, it returns 256 (MAXNAMELEN).
Since Lustre can not handle the name length large than
NAME_MAX, this patch limits the value of os_namelen in
mdd_statfs() to the max of NAME_MAX.

Test-Parameters: envdefinitions=SLOW=yes \
mdtfilesystemtype=zfs mdsfilesystemtype=zfs ostfilesystemtype=zfs \
ostcount=2 testlist=posix

Signed-off-by: Jian Yu <jian.yu@intel.com>
Change-Id: I13b5398eff5b15c74b0e08e2a3394b79a3ff6a44
Reviewed-on: http://review.whamcloud.com/8217
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Andreas Dilger <andreas.dilger@intel.com>
Reviewed-by: Oleg Drokin <oleg.drokin@intel.com>
10 years agoLU-4093 tests: prevent zombie requests when stopping CT 57/8157/4
Bruno Faccini [Fri, 8 Nov 2013 08:24:21 +0000 (09:24 +0100)]
LU-4093 tests: prevent zombie requests when stopping CT

Fix to prevent any current requests to become zombie when stopping
copytool and thus not to sometimes to induce failures during next
sanity-hsm sub-tests.

Signed-off-by: Bruno Faccini <bruno.faccini@intel.com>
Change-Id: I3274e5864469251e6f2040377b84429e4f218f47
Reviewed-on: http://review.whamcloud.com/8157
Tested-by: Jenkins
Reviewed-by: John L. Hammond <john.hammond@intel.com>
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: jacques-Charles Lafoucriere <jacques-charles.lafoucriere@cea.fr>
Reviewed-by: Aurelien Degremont <aurelien.degremont@cea.fr>
Reviewed-by: Oleg Drokin <oleg.drokin@intel.com>
11 years agoLU-4181 tests: disable lnet_selftest.sh for DNE. 30/8130/3
wang di [Thu, 31 Oct 2013 07:01:15 +0000 (00:01 -0700)]
LU-4181 tests: disable lnet_selftest.sh for DNE.

Disable lnet_selftest for DNE to make review pass with DNE.

Signed-off-by: wang di <di.wang@intel.com>
Change-Id: Ie4abd794d0184955af84752f0a9c64181b85f1ec
Reviewed-on: http://review.whamcloud.com/8130
Reviewed-by: Doug Oucharek <doug.s.oucharek@intel.com>
Tested-by: Jenkins
Reviewed-by: Andreas Dilger <andreas.dilger@intel.com>
Tested-by: Maloo <hpdd-maloo@intel.com>
11 years agoNew tag 2.5.51 2.5.51 v2_5_51 v2_5_51_0
Oleg Drokin [Wed, 6 Nov 2013 07:02:57 +0000 (02:02 -0500)]
New tag 2.5.51

Change-Id: Ib4389ce656c878704b329472ab3a848e1aff6b53

11 years agoLU-4149 tests: disable sanity_scrub(7, 15) for dne test. 01/8101/3
wang di [Tue, 29 Oct 2013 07:01:22 +0000 (00:01 -0700)]
LU-4149 tests: disable sanity_scrub(7, 15) for dne test.

Disable sanity_scrub(7, 15) for dne test, so make dne pass
on maloon right now.

Signed-off-by: wang di <di.wang@intel.com>
Change-Id: Ie6553f44655c8f38451ed2cb1086e5848ca033a4
Reviewed-on: http://review.whamcloud.com/8101
Tested-by: Jenkins
Reviewed-by: Andreas Dilger <andreas.dilger@intel.com>
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Bob Glossman <bob.glossman@intel.com>
11 years agoLU-20 kernel: remove obsolete kernel patches 81/7881/2
Andreas Dilger [Tue, 1 Oct 2013 20:20:07 +0000 (14:20 -0600)]
LU-20 kernel: remove obsolete kernel patches

The jbd2-jcberr and lustre_version patches have been obsoleted since
Lustre 2.2.  They were kept around to allow transitioning between
Lustre 1.8/2.1 and 2.4/2.5 but can be removed from Lustre 2.6 since
this does not allow a direct upgrade path anyway.

Signed-off-by: Andreas Dilger <andreas.dilger@intel.com>
Change-Id: I5406272fe22580e48679c18dea562001d23ebbe5
Reviewed-on: http://review.whamcloud.com/7881
Reviewed-by: James Simmons <uja.ornl@gmail.com>
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Minh Diep <minh.diep@intel.com>
Reviewed-by: Oleg Drokin <oleg.drokin@intel.com>
11 years agoLU-3920 tests: check MDS version before testing HSM feature 21/8121/3
Vladimir Saveliev [Mon, 28 Oct 2013 19:20:21 +0000 (23:20 +0400)]
LU-3920 tests: check MDS version before testing HSM feature

Sanity tests 65y and 229 fail when MDS does not have HSM
support. Check MDS version and skip the tests in that case.

Xyratex-bug-id: MRP-1417
Signed-off-by: Vladimir Saveliev <vladimir_saveliev@xyratex.com>
Change-Id: I80496ac67ec1b6b8032f465a0b5914bb12839d3b
Reviewed-on: http://review.whamcloud.com/8121
Reviewed-by: Andreas Dilger <andreas.dilger@intel.com>
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Bob Glossman <bob.glossman@intel.com>
Reviewed-by: Faccini Bruno <bruno.faccini@intel.com>
11 years agoLU-3335 osd: self repair LMA by the RPC service thread 44/7144/10
Fan Yong [Wed, 4 Sep 2013 14:32:51 +0000 (22:32 +0800)]
LU-3335 osd: self repair LMA by the RPC service thread

Current OI scrub uses the FID in LMA to locate the OI mapping and
repair the inconsistent OI mapping. Such inode table based system
scanning does not guarantee all OI inconsistency can be repaired,
such as inode without LMA or with crashed LMA, or cannot find out
dummy OI mappings (the object for the OI mapping does not exist).

In fact, after the inode table based scanning, the OI scrub can
make another OI table based scanning to resolve above mentioned
issues. But such scanning will be nonlinear, so performance may
be bad.

Currently, we have some other relative simple solution although
they are not perfect. We can make the RPC service thread to fix
the found inconsistency by itself: when verify the LMA with the
given FID, and found no LMA, then generate LMA according to the
given FID and store the new LMA as the inode xattr. Usually, it
is rare case that the inode has no LMA. So such additional work
for the RPC service thread will not affect performance much.

Signed-off-by: Fan Yong <fan.yong@intel.com>
Change-Id: Ic6b8b533e56b6a0e388d5c38de7b113cb9bd6dad
Reviewed-on: http://review.whamcloud.com/7144
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Alex Zhuravlev <alexey.zhuravlev@intel.com>
Reviewed-by: Andreas Dilger <andreas.dilger@intel.com>
Reviewed-by: Mike Pershin <mike.pershin@intel.com>
11 years agoLU-3973 tests: use -P option of df to avoid line breaks 15/7915/5
Bruno Faccini [Thu, 31 Oct 2013 23:50:27 +0000 (00:50 +0100)]
LU-3973 tests: use -P option of df to avoid line breaks

Sometimes df command output requires to break lines to
allow for columns alignement. This causes fields parsing
to fail. Use -P option to select compatibility output
and ensure one single line format per file-system is used.

Signed-off-by: Bruno Faccini <bruno.faccini@intel.com>
Change-Id: Ibc9d4b8f50857e680a5590dbbfe312fce137bcb6
Reviewed-on: http://review.whamcloud.com/7915
Reviewed-by: Andreas Dilger <andreas.dilger@intel.com>
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: jacques-Charles Lafoucriere <jacques-charles.lafoucriere@cea.fr>
Reviewed-by: James Nunez <james.a.nunez@intel.com>
11 years agoLU-2479 ldiskfs: do not check dir max size for regular files 37/8137/2
Vladimir Saveliev [Tue, 29 Oct 2013 16:01:23 +0000 (20:01 +0400)]
LU-2479 ldiskfs: do not check dir max size for regular files

ldiskfs_append() is used not only to extend directory but also to
maintain iam container. In later case ldiskfs_append() should not
check for max directory size. The iam container is distunguished as
regular file.

Signed-off-by: Vladimir Saveliev <vladimir_saveliev@xyratex.com>
Change-Id: I205b0b1149d7bd477b3965ce68a546dfd9dc6a6f
Reviewed-on: http://review.whamcloud.com/8137
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Alexey Lyashkov <alexey_lyashkov@xyratex.com>
Reviewed-by: Andreas Dilger <andreas.dilger@intel.com>
11 years agoLU-2528 misc: quiet warning for 2.1/2.5 connections 16/7916/2
Andreas Dilger [Thu, 10 Oct 2013 18:32:34 +0000 (12:32 -0600)]
LU-2528 misc: quiet warning for 2.1/2.5 connections

The Lustre 2.5.4 client will print a warning about connections with
2.1.3 servers, yet they are still supposed to be interoperable.
Increase the window of warning to be up to Lustre 2.5.50, since we
do not intend to allow interoperability between 2.1 and 2.6 systems.
This was from the time when major releases like Lustre 1.4 and 1.8
were many years apart would have no chance for interoperation.

Only print this message once per client, to avoid flooding the console
for connections to many servers or frequent network reconnections.
Server versions should all be nearly the same in any case.

Signed-off-by: Andreas Dilger <andreas.dilger@intel.com>
Change-Id: I503523f80144c00c5ea1d312cc9204d9123ebbe5
Reviewed-on: http://review.whamcloud.com/7916
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Bob Glossman <bob.glossman@intel.com>
Reviewed-by: Jian Yu <jian.yu@intel.com>
Reviewed-by: Oleg Drokin <oleg.drokin@intel.com>
11 years agoLU-3858 test: fix sanity 27A failure caused by stripe delay 95/7495/4
Li Xi [Sat, 31 Aug 2013 13:39:36 +0000 (06:39 -0700)]
LU-3858 test: fix sanity 27A failure caused by stripe delay

When clearing the stripe of root directory, the updating of stripe
might be delayed because of the processing delay of config log.
This will cause the failure of sanity 27A test.

This makes the test suit wait for the update of root directory
stripe.

Signed-off-by: Li Xi <lixi@ddn.com>
Change-Id: Ib4f96d9d4d0ae8a0b54e3e306a7abed758dbdf39
Reviewed-on: http://review.whamcloud.com/7495
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Andreas Dilger <andreas.dilger@intel.com>
Reviewed-by: Faccini Bruno <bruno.faccini@intel.com>
Reviewed-by: Oleg Drokin <oleg.drokin@intel.com>
11 years agoLU-2613 recovery: free open/close request promptly 65/6665/11
Hongchao Zhang [Sun, 21 Jul 2013 21:40:37 +0000 (05:40 +0800)]
LU-2613 recovery: free open/close request promptly

- For the non-create open or committed open, the open request
  should be freed along with the close request as soon as the
  close done, despite that the transno of open/close is
  greater than the last committed transno known by client or not.

- Move the committed open request into another dedicated list,
  that will avoid scanning a huge replay list on receiving each
  reply (when there are many open files).

Signed-off-by: Niu Yawei <yawei.niu@intel.com>
Signed-off-by: Hongchao Zhang <hongchao.zhang@intel.com>
Change-Id: I1a25a35fe7a16681368d92d16964680b6209b3ee
Reviewed-on: http://review.whamcloud.com/6665
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Alex Zhuravlev <alexey.zhuravlev@intel.com>
Reviewed-by: Oleg Drokin <oleg.drokin@intel.com>
11 years agoLU-3968 lbuild: Extend script with build for Xeon Phi card 66/7066/17
Dmitry Eremin [Fri, 30 Aug 2013 18:29:50 +0000 (22:29 +0400)]
LU-3968 lbuild: Extend script with build for Xeon Phi card

Automatically download, compile and produce Lustre client RPMs
for Xeon Phi(TM) card if "--mpss-version" option is specified
for contrib/lbuild/lbuild script.

Also try to compile with Xeon Phi(TM) OFED if it's available.

Signed-off-by: Dmitry Eremin <dmitry.eremin@intel.com>
Change-Id: Ida07d764dc824c13f22ffb53d24e2c6f79ce3573
Reviewed-on: http://review.whamcloud.com/7066
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Bob Glossman <bob.glossman@intel.com>
Reviewed-by: Brian J. Murrell <brian.murrell@intel.com>
Reviewed-by: Oleg Drokin <oleg.drokin@intel.com>
11 years agoLU-4016 tests: use user and group names in metadata-updates 52/8052/3
James Nunez [Thu, 31 Oct 2013 19:43:25 +0000 (13:43 -0600)]
LU-4016 tests: use user and group names in metadata-updates

pdsh eats %u passed to stat executed on remote nodes.  Avoid this
problem by using the user and group names instead of numerical
identifiers in metadata-updates ownership checks.

Test-Parameters: testlist=metadata-updates
Signed-off-by: James Nunez <james.a.nunez@intel.com>
Signed-off-by: Stephen Champion <schamp@sgi.com>
Change-Id: Id0a283d58873bb64772f28cbc28c5a6abad9a48e
Reviewed-on: http://review.whamcloud.com/8052
Tested-by: Jenkins
Reviewed-by: James Simmons <uja.ornl@gmail.com>
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Oleg Drokin <oleg.drokin@intel.com>
11 years agoLU-2887 tests: disable SLOW for ZFS in parallel-scale 78/7778/9
Nathaniel Clark [Thu, 26 Sep 2013 17:39:49 +0000 (13:39 -0400)]
LU-2887 tests: disable SLOW for ZFS in parallel-scale

Because all zfs pools are on the same physical device in autotest
ZFS backed OSTs run VERY slowly so disable SLOW.  Decrease load on zfs
backed MDS.

Test-Parameters: envdefinitions=SLOW=yes,ENABLE_QUOTA=yes mdtfilesystemtype=zfs mdsfilesystemtype=zfs ostfilesystemtype=zfs testlist=parallel-scale
Signed-off-by: Nathaniel Clark <nathaniel.l.clark@intel.com>
Change-Id: I99d980adcf4259f8431c351ce6fecb1088419130
Reviewed-on: http://review.whamcloud.com/7778
Tested-by: Jenkins
Reviewed-by: Alex Zhuravlev <alexey.zhuravlev@intel.com>
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Jian Yu <jian.yu@intel.com>
Reviewed-by: Oleg Drokin <oleg.drokin@intel.com>
11 years agoLU-4060 test: skip sanityn.sh test_74 on older than 2.4.93 MDS 11/7911/4
Emoly Liu [Wed, 9 Oct 2013 15:51:36 +0000 (23:51 +0800)]
LU-4060 test: skip sanityn.sh test_74 on older than 2.4.93 MDS

2.5 client 2.4 server interop testing shows that sanityn.sh test_74
needs to be skipped for Lustre MDS versions older than 2.4.93 .

Test-Parameters: testlist=sanityn clientdistro=el6 clientarch=x86_64 serverdistro=el6 serverarch=x86_64 serverjob=lustre-b2_4 serverbuildno=46
Signed-off-by: Emoly Liu <emoly.liu@intel.com>
Change-Id: I92d29a8db3ed5d145bd7db7bd18f5593ba80576c
Reviewed-on: http://review.whamcloud.com/7911
Reviewed-by: Andreas Dilger <andreas.dilger@intel.com>
Reviewed-by: Jian Yu <jian.yu@intel.com>
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
11 years agoLU-3321 osc: Adjustment on osc LRU for performance 90/7890/9
Jinshan Xiong [Mon, 30 Sep 2013 21:24:17 +0000 (14:24 -0700)]
LU-3321 osc: Adjustment on osc LRU for performance

Add and discard pages from LRU in batch.

Signed-off-by: Jinshan Xiong <jinshan.xiong@intel.com>
Change-Id: I90287b73c05066d420a31bff21866dfa1ffec665
Reviewed-on: http://review.whamcloud.com/7890
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Niu Yawei <yawei.niu@intel.com>
Reviewed-by: Lai Siyao <lai.siyao@intel.com>
Reviewed-by: Oleg Drokin <oleg.drokin@intel.com>
11 years agoLU-3321 clio: revert LU-2622 for removing global env list 88/7888/5
Jinshan Xiong [Mon, 30 Sep 2013 21:09:26 +0000 (14:09 -0700)]
LU-3321 clio: revert LU-2622 for removing global env list

Revert "LU-2622 obdclass: Remove the global cl_env list"

This reverts commit d5390bbde9b525b13ab91f5610abe1c4bb98eff9.

The performance dropped a lot when memory reclaim process kicked
in as ll_releasepage() was called to destroy lustre pages. It turned
out that big overhead to allocate cl_env and keys on the fly so we
have to revert this patch.

The problem of LU-2622 will be solved in another patch later on.

Signed-off-by: Jinshan Xiong <jinshan.xiong@intel.com>
Change-Id: If5f9faeabbe78d3a88b25bf986a2c9a2ecc9ba12
Reviewed-on: http://review.whamcloud.com/7888
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Niu Yawei <yawei.niu@intel.com>
Reviewed-by: Lai Siyao <lai.siyao@intel.com>
Reviewed-by: Oleg Drokin <oleg.drokin@intel.com>
11 years agoLU-4156 wireshark: Fix build against wireshark 1.10.x 79/8079/2
Nathaniel Clark [Mon, 28 Oct 2013 17:56:18 +0000 (13:56 -0400)]
LU-4156 wireshark: Fix build against wireshark 1.10.x

Some of the API changes that were depricated in 1.6 and 1.8 have been
removed in 1.10.  This patch uses the new API calls (these work in 1.6
and later).

Signed-off-by: Nathaniel Clark <nathaniel.l.clark@intel.com>
Change-Id: I83a469b973007a58d81b2460eec9da418c172b4a
Reviewed-on: http://review.whamcloud.com/8079
Tested-by: Jenkins
Reviewed-by: Doug Oucharek <doug.s.oucharek@intel.com>
Reviewed-by: Amir Shehata <amir.shehata@intel.com>
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Bob Glossman <bob.glossman@intel.com>
Reviewed-by: Frank Heckes <Frank.Heckes@intel.com>
Reviewed-by: Oleg Drokin <oleg.drokin@intel.com>
11 years agoLU-3971 hsm: Copytool code cleanup 83/7583/7
Henri Doreau [Fri, 6 Sep 2013 12:24:09 +0000 (14:24 +0200)]
LU-3971 hsm: Copytool code cleanup

Minor refactoring of the bandwidth controling code.
Deletion of a superfluous select() call on regular files.

Signed-off-by: Henri Doreau <henri.doreau@cea.fr>
Change-Id: Iae550bb69c1524865b38a92d9b7674fce2f58258
Reviewed-on: http://review.whamcloud.com/7583
Reviewed-by: John L. Hammond <john.hammond@intel.com>
Reviewed-by: jacques-Charles Lafoucriere <jacques-charles.lafoucriere@cea.fr>
Reviewed-by: Faccini Bruno <bruno.faccini@intel.com>
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Oleg Drokin <oleg.drokin@intel.com>
11 years agoLU-4118 kernel: kernel update [RHEL6.4 2.6.32-358.23.2.el6] 53/8053/2
yangsheng [Wed, 23 Oct 2013 16:01:48 +0000 (00:01 +0800)]
LU-4118 kernel: kernel update [RHEL6.4 2.6.32-358.23.2.el6]

Update RHEL6.4 kernel to 2.6.32-358.23.2.el6.

Signed-off-by: yang sheng <yang.sheng@intel.com>
Change-Id: Icfe1416916f98c165ec6517d7f7a7a28b0167e4a
Reviewed-on: http://review.whamcloud.com/8053
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Bob Glossman <bob.glossman@intel.com>
Reviewed-by: Bobi Jam <bobijam@gmail.com>
Reviewed-by: Oleg Drokin <oleg.drokin@intel.com>
11 years agoLU-3850 obdecho: create remote dir from echo client 02/7502/6
wang di [Thu, 29 Aug 2013 07:00:12 +0000 (00:00 -0700)]
LU-3850 obdecho: create remote dir from echo client

Because echo client can only talk to local MDD, it will
send create remote dir req to the MDT where the dir is
located, which is different as the normal remote create
request. This patch fixes a few problem of this process

1. Not set parent during ah_init, which will make local OSD
to access the remote parent.

2. set correct attribute for remote parent in the declare
phase.

3. set root_fid for other MDT, which is needed by echo_client
to resolve the path on non-MDT0.

4. set child_base_id in oi_id directly to avoid the sanity
check in ostid_set_id,  and also child_base_id is actually
being used as the base of name, instead of FID.

5. Disable rmdir on remote MDT for echo client, i.e. echo
client can only unlink the directory on the MDT which echo
client is attached to, so lctl test_rmdir will be consistent
with normal rmdir.

Signed-off-by: wang di <di.wang@intel.com>
Change-Id: I16df8f0211afae670bfeea1ba1087a77a6b775da
Reviewed-on: http://review.whamcloud.com/7502
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Andreas Dilger <andreas.dilger@intel.com>
Reviewed-by: Alex Zhuravlev <alexey.zhuravlev@intel.com>
Reviewed-by: Oleg Drokin <oleg.drokin@intel.com>
11 years agoLU-2139 osc: Use SOFT_SYNC to urge server commit 75/4375/15
Prakash Surya [Mon, 22 Oct 2012 23:34:29 +0000 (16:34 -0700)]
LU-2139 osc: Use SOFT_SYNC to urge server commit

This change adds a BRW page flag, OBD_BRW_SOFT_SYNC. This flag is
intended to urge a server to commit a client's unstable pages to
stable storage. A client will add this flag to any BRW requests while
it is in a state where it has "many" unstable pages pinned in its cache.

The server, upon receiving a page with this flag set, *should* begin
an async commit. The idea being that, with the proactive async commit,
the client's pinned unstable pages will transition into a stable state
faster than they would have otherwise. Although, the server side portion
of this agreement is still unimplemented, so the OBD_BRW_SOFT_SYNC flag
will currently fall on deaf ears.

Signed-off-by: Prakash Surya <surya1@llnl.gov>
Change-Id: I08946a5ec6afab390655643f6d3c8320bf010aea
Reviewed-on: http://review.whamcloud.com/4375
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Jinshan Xiong <jinshan.xiong@intel.com>
Reviewed-by: Oleg Drokin <oleg.drokin@intel.com>
11 years agoLU-2139 osc: Track number of "unstable" pages per osc 74/4374/10
Prakash Surya [Mon, 22 Oct 2012 20:38:11 +0000 (13:38 -0700)]
LU-2139 osc: Track number of "unstable" pages per osc

This change adds simple accounting hooks for "unstable" pages on a per
OSC basis. Now, in addition to the per filesystem tracking, each OSC
will maintain a running total of its unstable pages. These counters are
exported through the proc interface, and can be read using the lctl
command.

For example:

    # Read number of unstable pages contained by each OSC
    lctl get_param osc.*.unstable_stats

The motivation for this change is in anticipation of implementing a
"soft sync" functionality, urging servers to commit these unstable
pages to stable storage. The per OSC accounting allows a client to
limit the soft sync request to only the OSCs which have outstanding
unstable pages.

Signed-off-by: Prakash Surya <surya1@llnl.gov>
Change-Id: Ife1f90515cb60ba9f85daba5d50483562f2697d1
Reviewed-on: http://review.whamcloud.com/4374
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Jinshan Xiong <jinshan.xiong@intel.com>
Reviewed-by: Andreas Dilger <andreas.dilger@intel.com>
Reviewed-by: Oleg Drokin <oleg.drokin@intel.com>
11 years agoLU-2139 osc: Track and limit "unstable" pages 84/6284/7
Prakash Surya [Wed, 2 Oct 2013 21:32:48 +0000 (14:32 -0700)]
LU-2139 osc: Track and limit "unstable" pages

This change adds a global counter to track the number of "unstable"
pages held by a given client, along with per file system counters. An
"unstable" page is defined as a page which has been sent to the server
as part of a bulk request, but is uncommitted to stable storage.

In addition to simply tracking the unstable pages, they now also count
towards the maximum number of "pinned" pages on the system at any given
time. Thus, a client will now be bound on the number of dirty and
unstable pages it can pin in memory. Previously only dirty pages were
accounted for in this limit.

In addition to tracking the number of unstable pages in Lustre, the
NR_UNSTABLE_NFS memory zone is also incremented and decremented for
easy monitoring using the "NFS_Unstable:" field in /proc/meminfo.
This field is also used internally by the kernel to limit the total
amount of unstable pages on the system.

The motivation for this change is twofold. First, the client must not
allow itself to disconnect from an OST while still holding unstable
pages. Otherwise, these unstable pages can get lost due to an OST
failure, and replay is not possible due to the disconnect via unmount.

Secondly, the client needs a mechanism to prevent it from allocating too
much of its available RAM to unreclaimable pages pinned by the ptlrpc
layer. If this case occurs, out of memory events can trigger as a side
effect, which we need to avoid.

The current number of unstable pages accounted for on a per file system
granularity is exported by the unstable_stats proc file, contained under
each file system's llite namespace. An example of retrieving this
information is below:

$ lctl get_param llite.*.unstable_stats

Signed-off-by: Prakash Surya <surya1@llnl.gov>
Change-Id: I67ab9ee244e6d6081069e3c3d5b2e493ddb97874
Reviewed-on: http://review.whamcloud.com/6284
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Jinshan Xiong <jinshan.xiong@intel.com>
Reviewed-by: Andreas Dilger <andreas.dilger@intel.com>
Reviewed-by: Oleg Drokin <oleg.drokin@intel.com>
11 years agoLU-4107 build: fix lustre_user.h to C++ compatible 33/8033/5
Minh Diep [Fri, 18 Oct 2013 20:12:52 +0000 (13:12 -0700)]
LU-4107 build: fix lustre_user.h to C++ compatible

Fix anonymous union.
Fix warnings
Workaround offsetof due to a bug in g++
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=14932

Signed-off-by: Minh Diep <minh.diep@intel.com>
Change-Id: I4c1b8165ab012f4226aabb60009b48063669e271
Reviewed-on: http://review.whamcloud.com/8033
Reviewed-by: John L. Hammond <john.hammond@intel.com>
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Andreas Dilger <andreas.dilger@intel.com>
Reviewed-by: Dmitry Eremin <dmitry.eremin@intel.com>
Reviewed-by: Oleg Drokin <oleg.drokin@intel.com>
11 years agoLU-3985 tests: Archive access must be done from an agent 76/8076/2
James Nunez [Mon, 28 Oct 2013 02:50:01 +0000 (20:50 -0600)]
LU-3985 tests: Archive access must be done from an agent

In sanity-hsm, all access to the archive backend must be
done from an agent. Tests 111a and 111b call the
copytool_remove_backend routine that calls find on the
archive backend. With this patch, the call to find is
done on the agent with a call to "do_facet $SINGLEAGT".

Signed-off-by: James Nunez <james.a.nunez@intel.com>
Change-Id: I537fe5e019861908612140b3c96ea2018a738e46
Reviewed-on: http://review.whamcloud.com/8076
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: John L. Hammond <john.hammond@intel.com>
Reviewed-by: Jinshan Xiong <jinshan.xiong@intel.com>
Reviewed-by: Aurelien Degremont <aurelien.degremont@cea.fr>
Reviewed-by: Oleg Drokin <oleg.drokin@intel.com>
11 years agoLU-4055 build: fix 'error handling' issues 42/7842/3
Sebastien Buisson [Thu, 3 Oct 2013 07:28:09 +0000 (09:28 +0200)]
LU-4055 build: fix 'error handling' issues

Fix 'error handling' issues found by Coverity version 6.6.1:
Unchecked return value (CHECKED_RETURN)
Calling function without checking return value.
Argument cannot be negative (NEGATIVE_RETURNS)
Negative value used as argument to a function expecting a
positive value.

Signed-off-by: Sebastien Buisson <sebastien.buisson@bull.net>
Change-Id: I1ea3428febee741be15215fcc89266a814e5fe1b
Reviewed-on: http://review.whamcloud.com/7842
Reviewed-by: Dmitry Eremin <dmitry.eremin@intel.com>
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: James Nunez <james.a.nunez@intel.com>
Reviewed-by: Oleg Drokin <oleg.drokin@intel.com>
11 years agoLU-4012 gss: upcall fails due to removed generic cache calls 70/7770/5
Thomas Stibor [Thu, 17 Oct 2013 08:10:01 +0000 (10:10 +0200)]
LU-4012 gss: upcall fails due to removed generic cache calls

With Linux kernel version >= 3.4.X the generic cache registering
routines cache_register(), cache_unregister() are removed.
This effects the GSS upcall routines for the Kerberos
support in Lustre. Before (Linux kernel <= 3.2.X) the
generic routines were coded as:
int cache_register(struct cache_detail *cd)
{
        return cache_register_net(cd, &init_net);
}
void cache_unregister(struct cache_detail *cd)
{
        cache_unregister_net(cd, &init_net);
}
To keep compatibility with former kernel versions,
cache_register_net() and cache_unregister_net() can
be applied instead, rather than the old and
removed cache_register() and cache_unregister().

Signed-off-by: Thomas Stibor <thomas@stibor.net>
Change-Id: I3eb012a1883c463ecafe3646e7e706ab96d91c5c
Reviewed-on: http://review.whamcloud.com/7770
Tested-by: Jenkins
Reviewed-by: Nathaniel Clark <nathaniel.l.clark@intel.com>
Reviewed-by: Andreas Dilger <andreas.dilger@intel.com>
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Andrew Korty <ajk@iu.edu>
11 years agoLU-3843 clio: add debug message in osc_completion() 94/7494/4
Niu Yawei [Thu, 29 Aug 2013 03:45:32 +0000 (23:45 -0400)]
LU-3843 clio: add debug message in osc_completion()

Replace LASSERT with LASSERTF in osc_completion, thus we can get
more info when the LASSERT is triggered.

Signed-off-by: Niu Yawei <yawei.niu@intel.com>
Change-Id: I1b365050d5c2348b4cdc83f2ea71f36cce7f9456
Reviewed-on: http://review.whamcloud.com/7494
Reviewed-by: Bobi Jam <bobijam@gmail.com>
Tested-by: Jenkins
Reviewed-by: Jinshan Xiong <jinshan.xiong@intel.com>
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Oleg Drokin <oleg.drokin@intel.com>
11 years agoLU-3491 llite: Replace printing of i_ino with ll_inode2fid() 48/6848/11
James Nunez [Mon, 1 Jul 2013 22:18:59 +0000 (16:18 -0600)]
LU-3491 llite: Replace printing of i_ino with ll_inode2fid()

The printing of i_ino/i_generation in llite messages is not nearly so
useful as printing the full inode FID, since i_ino is a "compressed"
version of the FID and there may be duplicate values for i_ino in some
cases (especially if running on a 32-bit client).

All instances of printing i_ino/i_generation are replaced with
the FID using ll_inode2fid(). All instances, except for one, of
printing just i_ino was replaced by printing the FID. In all
CERROR lines touched by the i_ino replacements, the device name
or fsname was added at the beginning of the message if it did not
already exist.

Signed-off-by: James Nunez <james.a.nunez@intel.com>
Change-Id: Ia32af1357729bf1951e0b713e75986c8177dd946
Reviewed-on: http://review.whamcloud.com/6848
Tested-by: Jenkins
Reviewed-by: Andreas Dilger <andreas.dilger@intel.com>
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: John L. Hammond <john.hammond@intel.com>
11 years agoLU-1199 rpm: use %defattr instead of %attr for every file 86/5486/7
James Simmons [Fri, 25 Oct 2013 14:14:20 +0000 (10:14 -0400)]
LU-1199 rpm: use %defattr instead of %attr for every file

Every file in the spec file is installed with %attr(-,root,root)
explicitly. Use %defattr once per %files section instead.

Signed-off-by: Jeff Mahoney <jeffm@suse.com>
Signed-off-by: James Simmons <uja.ornl@gmail.com>
Change-Id: Ia37757db39a6ee81fb0acd910d7d3204f3c194b1
Reviewed-on: http://review.whamcloud.com/5486
Tested-by: Jenkins
Reviewed-by: Christopher J. Morrone <chris.morrone.llnl@gmail.com>
Reviewed-by: Andreas Dilger <andreas.dilger@intel.com>
Tested-by: Maloo <hpdd-maloo@intel.com>
11 years agoLU-3570 libcfs: accelerate crc32c with pclmulqdq 27/6927/13
James Simmons [Mon, 14 Oct 2013 15:26:29 +0000 (11:26 -0400)]
LU-3570 libcfs: accelerate crc32c with pclmulqdq

Using hardware provided PCLMULQDQ instruction to accelerate
CRC32C check sum. This instruction is present starting with
Intel Westmere and AMD Bulldozer CPUs.

Also reorganize assembler code of other PCLMUL inmplementation
with using common assembler macros from inst.h.

Signed-off-by: James Simmons <uja.ornl@gmail.com>
Signed-off-by: Dmitry Eremin <dmitry.eremin@intel.com>
Change-Id: I2099ff8af3591b3e2267cb30486d7a4f0a3e90c0
Reviewed-on: http://review.whamcloud.com/6927
Tested-by: Jenkins
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Oleg Drokin <oleg.drokin@intel.com>
11 years agoLU-4007 libcfs: fix for removing lcw at lcw_dispatch_main 55/7755/3
Alexander.Boyko [Wed, 25 Sep 2013 09:18:26 +0000 (13:18 +0400)]
LU-4007 libcfs: fix for removing lcw at lcw_dispatch_main

lcw_dispatch_main() creates zombies list and moves lcw into it.
But removes entry from another list in the zombies loop.

Signed-off-by: Alexander Boyko <alexander_boyko@xyratex.com>
Xyratex-bug-id: MRP-1179
Change-Id: I9daec3d9b3dbbf244a28b829cd724c8ec60c4b6c
Reviewed-on: http://review.whamcloud.com/7755
Reviewed-by: Dmitry Eremin <dmitry.eremin@intel.com>
Tested-by: Jenkins
Reviewed-by: Doug Oucharek <doug.s.oucharek@intel.com>
Reviewed-by: Andreas Dilger <andreas.dilger@intel.com>
Tested-by: Maloo <hpdd-maloo@intel.com>
Reviewed-by: Liang Zhen <liang.zhen@intel.com>
Reviewed-by: Isaac Huang <he.huang@intel.com>
11 years agoLU-4178 tests: disable HSM sanity subtests 22/8122/2
Bob Glossman [Thu, 31 Oct 2013 18:10:24 +0000 (11:10 -0700)]
LU-4178 tests: disable HSM sanity subtests

Turn off high failure rate subtests in sanity-hsm.
If fixes for these failures land tests may be turned on again later.

Signed-off-by: Bob Glossman <bob.glossman@intel.com>
Change-Id: Iaeedd2946f33075242caad851c7f473d86cbcb04
Reviewed-on: http://review.whamcloud.com/8122
Reviewed-by: Nathaniel Clark <nathaniel.l.clark@intel.com>
Tested-by: Jenkins
Reviewed-by: Andreas Dilger <andreas.dilger@intel.com>
Tested-by: Maloo <hpdd-maloo@intel.com>
11 years agoLU-4135 mdt: restore mdt_save_lock() functionality 48/8048/3
Mikhail Pershin [Wed, 23 Oct 2013 18:17:51 +0000 (22:17 +0400)]
LU-4135 mdt: restore mdt_save_lock() functionality

Missed updates for mti_has_trans cause mdt_save_lock() to do
just decref for lock and never try to save it as intended.

Patch restores updating mti_has_trans variable and fix issue
with NULL req in case of mdt_export_cleanup() codepath

Signed-off-by: Mikhail Pershin <mike.pershin@intel.com>
Change-Id: I2e4ccae018635842df1e02d72a639b0ca1fbdb04
Reviewed-on: http://review.whamcloud.com/8048
Tested-by: Hudson
Reviewed-by: Andreas Dilger <andreas.dilger@intel.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Alex Zhuravlev <alexey.zhuravlev@intel.com>
Reviewed-by: Oleg Drokin <oleg.drokin@intel.com>
11 years agoLU-1458 tests: wait for dbench to start 14/7914/3
Nathaniel Clark [Thu, 10 Oct 2013 15:46:54 +0000 (11:46 -0400)]
LU-1458 tests: wait for dbench to start

Use wait_for_function instead of just guessing that 20s will be long
enough for dbench to start.  Wait for up to 6min in 5 second
intervals.  This also ensures that dbench will run for a little while
after it starts.

Signed-off-by: Nathaniel Clark <nathaniel.l.clark@intel.com>
Change-Id: Ic9c61b487c4013cccd620491d617c10e2206fd62
Reviewed-on: http://review.whamcloud.com/7914
Tested-by: Hudson
Reviewed-by: Bobi Jam <bobijam@gmail.com>
Reviewed-by: Bob Glossman <bob.glossman@intel.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Oleg Drokin <oleg.drokin@intel.com>
11 years agoLU-2842 mdt: Do not lbug on invalid fid entry in name 17/7617/3
Oleg Drokin [Wed, 11 Sep 2013 18:26:43 +0000 (14:26 -0400)]
LU-2842 mdt: Do not lbug on invalid fid entry in name

Instead of crashing MDS when the name exists but has invalid fid,
we should just return error and let lfsck deal with it later.

Change-Id: Iea19dac75e9dc75c43a18efa757fc1c5e4d1743e
Signed-off-by: Oleg Drokin <oleg.drokin@intel.com>
Reviewed-on: http://review.whamcloud.com/7617
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: wangdi <di.wang@intel.com>
Reviewed-by: Bob Glossman <bob.glossman@intel.com>
11 years agoLU-2906 ldlm: use accessor macros for l_flags 63/7963/4
Bruce Korb [Thu, 17 Oct 2013 23:34:23 +0000 (16:34 -0700)]
LU-2906 ldlm: use accessor macros for l_flags

Convert most all of the ldlm lock's l_flags references from direct
bit twiddling to using bit specific macros.  A few multi-bit
operations are left as an exercise for the reader.

The changes are mostly in ldlm, but also in llite, osc and quota, but
also fix a typo in the LDLM_{SET,CLEAR}_FLAG macros and:

Add a multi-bit (mask) test.
Remove the now obsolete LDLM_AST_FLAGS and LDLM_INHERIT_FLAGS defines.
Remove the obsolete LDLM_FL_HIDE_LOCK_MASK define.
Rename "local_only" mask to "off_wire" since it is confusingly similar
to a flag that (I think) means, "do not copy this lock over the wire."
The "local_only/off_wire" mask is also never used.  It's mostly
informational.

Wireshark output moved to "lustre_dlm_flags_wshark.c" and only
bits that can actually appear "on the wire" are emitted.
The "packet-lustre.c" code that references these bits now gets
emitted into that file.  e.g. the "local_only" bit is never put
on the wire, so references to it in wireshark are gone.

Signed-off-by: Bruce Korb <bruce.korb@gmail.com>
Reviewed-by: Keith Mannthey <Keith.Mannthey@intel.com>
Change-Id: I2527c46835e434f1009cf83919a203a358b04737
Reviewed-on: http://review.whamcloud.com/7963
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Doug Oucharek <doug.s.oucharek@intel.com>
Reviewed-by: Andreas Dilger <andreas.dilger@intel.com>
11 years agoLU-3679 lnet: reflect down routes in /proc/sys/lnet/routes 57/7857/4
Chris Horn [Wed, 23 Oct 2013 17:12:40 +0000 (12:12 -0500)]
LU-3679 lnet: reflect down routes in /proc/sys/lnet/routes

We consider routes "down" if the router is down or the router
NI for the target network is down. This should be reflected
in the output of /proc/sys/lnet/routes

Signed-off-by: Chris Horn <hornc@cray.com>
Change-Id: I82ee769d88aec92f1690ad9c095e32c9a9f9e282
Reviewed-on: http://review.whamcloud.com/7857
Reviewed-by: Cory Spitz <spitzcor@cray.com>
Reviewed-by: Isaac Huang <he.huang@intel.com>
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Oleg Drokin <oleg.drokin@intel.com>
11 years agoLU-3485 tests: add resize2fs testing for MDT and OST filesystems 96/7796/6
Jian Yu [Fri, 18 Oct 2013 15:09:56 +0000 (23:09 +0800)]
LU-3485 tests: add resize2fs testing for MDT and OST filesystems

This patch adds a test case into conf-sanity.sh to test running
resize2fs on MDT and OST filesystems. Currently, only offline
resizing is possible until LU-137 is finished. Before and after
resize2fs is used, e2fsck is run to verify that no corruption
is introduced during the process.

Signed-off-by: Jian Yu <jian.yu@intel.com>
Change-Id: I0957cba7fdd76276331c40b746cd87efac10ecc4
Reviewed-on: http://review.whamcloud.com/7796
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Nathaniel Clark <nathaniel.l.clark@intel.com>
Reviewed-by: Niu Yawei <yawei.niu@intel.com>
Reviewed-by: Oleg Drokin <oleg.drokin@intel.com>
11 years agoLU-1199 obdclass: build and install llog_test.ko properly 93/5493/3
James Simmons [Wed, 18 Sep 2013 13:49:21 +0000 (09:49 -0400)]
LU-1199 obdclass: build and install llog_test.ko properly

There's a hack in lustre.spec.in to copy the llog_test.ko module into
/lib/modules.

It's only needed because that module is built unconditionally and
is expected to only be installed if --enable-tests is configured.

The right way to do this is to only build it at all if --enable-tests
is configured and installation comes automatically,

The conditional in the spec file for --enable-tests already exists,
so it will be packaged correctly without the hack.

Signed-off-by: Jeff Mahoney <jeffm@suse.com>
Signed-off-by: James Simmons <uja.ornl@gmail.com>
Change-Id: I2705848e11a96ab6769ebba4481d2d0621f53d3e
Reviewed-on: http://review.whamcloud.com/5493
Tested-by: Hudson
Reviewed-by: Bob Glossman <bob.glossman@intel.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Dmitry Eremin <dmitry.eremin@intel.com>
Reviewed-by: Christopher J. Morrone <chris.morrone.llnl@gmail.com>
Reviewed-by: Stephen Champion <schamp@sgi.com>
Reviewed-by: Oleg Drokin <oleg.drokin@intel.com>
11 years agoLU-2800 autoconf: remove LIBCFS_SOCK_MAP_FD_2ARG 06/5406/7
James Simmons [Wed, 11 Sep 2013 17:07:17 +0000 (13:07 -0400)]
LU-2800 autoconf: remove LIBCFS_SOCK_MAP_FD_2ARG

v2.6.27 added a second argument to sock_map_fd. Earlier versions are
out of scope so we can remove the test for the number of arguments.

Signed-off-by: James Simmons <uja.ornl@gmail.com>
Signed-off-by: Jeff Mahoney <jeffm@suse.com>
Change-Id: I4e12da8c102b5d682db9658187330ae601b00a21
Reviewed-on: http://review.whamcloud.com/5406
Reviewed-by: Bob Glossman <bob.glossman@intel.com>
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Andreas Dilger <andreas.dilger@intel.com>
11 years agoLU-4100 utils: Remove extra space of lfs getstripe 92/7992/3
Li Xi [Fri, 18 Oct 2013 03:44:35 +0000 (20:44 -0700)]
LU-4100 utils: Remove extra space of lfs getstripe

'lfs getstripe $dir' command prints an extra space at the end of
the output line, which is particularly annoying for parsing in
scripts.

This patch fixes this problem.

Signed-off-by: Li Xi <lixi@ddn.com>
Change-Id: Iee50939f2b93bbd8d1c506bb90bc761a9f4dbeaf
Reviewed-on: http://review.whamcloud.com/7992
Tested-by: Hudson
Reviewed-by: Andreas Dilger <andreas.dilger@intel.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Oleg Drokin <oleg.drokin@intel.com>
11 years agoLU-4113 gss: uncatched error in gss_svc_upcall causes drops 60/7960/2
Thomas Stibor [Wed, 16 Oct 2013 14:00:06 +0000 (16:00 +0200)]
LU-4113 gss: uncatched error in gss_svc_upcall causes drops

Since kernel version 2.6.20 the function cache_check() in
net/sunrpc/cache.c can return the error -ETIMEDOUT. This
error should cause to recheck the cache. In current
implementation gss_svc_upcall_handle_init() is not
handling the error which causes to fall through the
switch statement returning GOTO(out, rc = SECSVC_DROP).
As a consequence this error is propagating to
ERROR:do_nego_rpc(): status: -110 (Unknown error 18446744073709551506)
ERROR:lgssc_negotiation(): negotiation rpc error: -85
ERROR:lgssc_kr_negotiate(): key 08786b66: failed to negotiation
and causes client hanging after switching to flavor krb5i/krb5p.

Signed-off-by: Thomas Stibor <thomas@stibor.net>
Change-Id: I7301bea70d96193e0f4a332c6e4b34ba326134ff
Reviewed-on: http://review.whamcloud.com/7960
Tested-by: Hudson
Reviewed-by: Andreas Dilger <andreas.dilger@intel.com>
Reviewed-by: Nathaniel Clark <nathaniel.l.clark@intel.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Andrew Korty <ajk@iu.edu>
Reviewed-by: Ken Hornstein <kenh@cmf.nrl.navy.mil>
Reviewed-by: Oleg Drokin <oleg.drokin@intel.com>
11 years agoLU-3373 tests: small fixes for sanity tests 48/7948/2
yangsheng [Mon, 14 Oct 2013 16:59:04 +0000 (00:59 +0800)]
LU-3373 tests: small fixes for sanity tests

--test_24d fix typo
--test_24A cleanup work directory in advance
--test_27k make up message more clear
--test-framework enforce LANG to en_US

Signed-off-by: yang sheng <yang.sheng@intel.com>
Change-Id: I61c227028b44a198482b4e5c9a9f715af08c3e5f
Reviewed-on: http://review.whamcloud.com/7948
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Jian Yu <jian.yu@intel.com>
Reviewed-by: Bob Glossman <bob.glossman@intel.com>
Reviewed-by: Oleg Drokin <oleg.drokin@intel.com>
11 years agoLU-4098 lmv: kernel crash due to misconfigured MDT 41/7941/3
Dmitry Eremin [Mon, 14 Oct 2013 11:43:27 +0000 (15:43 +0400)]
LU-4098 lmv: kernel crash due to misconfigured MDT

There are few places with access to lmv->tgts[] without check for NULL.
Usually it may happens when MDT configured starting from index 1
instead of 0. For example:
  mkfs.lustre --reformat --mgs --mdt --index=1 /dev/sdd1

Signed-off-by: Dmitry Eremin <dmitry.eremin@intel.com>
Change-Id: I7d9bc8876bb0b2c2669050904d4629069b61e639
Reviewed-on: http://review.whamcloud.com/7941
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Andreas Dilger <andreas.dilger@intel.com>
Reviewed-by: Alex Zhuravlev <alexey.zhuravlev@intel.com>
Reviewed-by: Oleg Drokin <oleg.drokin@intel.com>
11 years agoLU-3319 procfs: remove unsed proc entry for target handler 29/7929/2
James Simmons [Fri, 11 Oct 2013 16:08:45 +0000 (12:08 -0400)]
LU-3319 procfs: remove unsed proc entry for target handler

With 3.10 and above kernels we end up using a new data
structure for proc handling. In the target handler code
we have an unsed lprocfs_vars so we remove it to avoid
compile issues.

Signed-off-by: James Simmons <uja.ornl@gmail.com>
Change-Id: I22d3d3b1aed852df93e448d3b793884344d1719a
Reviewed-on: http://review.whamcloud.com/7929
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Mike Pershin <mike.pershin@intel.com>
Reviewed-by: Bob Glossman <bob.glossman@intel.com>
Reviewed-by: Oleg Drokin <oleg.drokin@intel.com>
11 years agoLU-4067 osd: apply PTR_ERR() to right pointer 87/7887/2
John L. Hammond [Tue, 8 Oct 2013 23:23:15 +0000 (18:23 -0500)]
LU-4067 osd: apply PTR_ERR() to right pointer

In ofd_precreate_objects(), osd_index_ea_insert(), and
osd_object_create() ensure that PTR_ERR() is applied to the right
pointer. Also convert some explicit casts of ERR_PTRs to ERR_CAST()
calls.

Signed-off-by: John L. Hammond <john.hammond@intel.com>
Change-Id: I846b10543ae426652ad72c277d2c0f0bc259fd6e
Reviewed-on: http://review.whamcloud.com/7887
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Mike Pershin <mike.pershin@intel.com>
Reviewed-by: Alex Zhuravlev <alexey.zhuravlev@intel.com>
Reviewed-by: Andreas Dilger <andreas.dilger@intel.com>
11 years agoLU-4078 mdc: fix bad ERR_PTR usage in mdc_locks.c 86/7886/2
John L. Hammond [Tue, 8 Oct 2013 22:23:09 +0000 (17:23 -0500)]
LU-4078 mdc: fix bad ERR_PTR usage in mdc_locks.c

In mdc_intent_open_pack() return an ERR_PTR() rather than NULL when
ldlm_prep_enqueue_req() fails. In mdc_intent_getattr_async() check the
return value of mdc_intent_getattr_pack() using IS_ERR(). Clean up the
includes in mdc_locks.c.

Signed-off-by: John L. Hammond <john.hammond@intel.com>
Change-Id: I89348389ccd102d1226caf16b315ac9b2a96b307
Reviewed-on: http://review.whamcloud.com/7886
Reviewed-by: Andreas Dilger <andreas.dilger@intel.com>
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Nathaniel Clark <nathaniel.l.clark@intel.com>
11 years agoLU-4064 hsm: create files to be imported on agent node 85/7885/2
John L. Hammond [Tue, 8 Oct 2013 20:52:23 +0000 (15:52 -0500)]
LU-4064 hsm: create files to be imported on agent node

In sanity-hsm test 13 create the files to be imported on the agent
node rather than the local node.

Signed-off-by: John L. Hammond <john.hammond@intel.com>
Change-Id: I3f50cc5269512d3b99644119ea18272d3a80f8a7
Reviewed-on: http://review.whamcloud.com/7885
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: jacques-Charles Lafoucriere <jacques-charles.lafoucriere@cea.fr>
Reviewed-by: Jinshan Xiong <jinshan.xiong@intel.com>
Reviewed-by: James Nunez <james.a.nunez@intel.com>
Reviewed-by: Oleg Drokin <oleg.drokin@intel.com>
11 years agoLU-4071 build: Ignore semicolon for Intel compiler 64/7864/4
Shuichi Ihara [Sun, 6 Oct 2013 14:17:27 +0000 (23:17 +0900)]
LU-4071 build: Ignore semicolon for Intel compiler

The build fails when if gcc is installed, but Intel compiler
is defined for mpicc. It does ignore semicolon for this case.

Signed-off-by: Shuichi Ihara <sihara@ddn.com>
Change-Id: Ia3306b32c8d886f7a3123ff3c6c967343f0c26b0
Reviewed-on: http://review.whamcloud.com/7864
Reviewed-by: Andreas Dilger <andreas.dilger@intel.com>
Tested-by: Hudson
Reviewed-by: Minh Diep <minh.diep@intel.com>
Reviewed-by: Dmitry Eremin <dmitry.eremin@intel.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Bob Glossman <bob.glossman@intel.com>
11 years agoLU-4070 build: RPM build errors 60/7860/2
Dmitry Eremin [Sat, 5 Oct 2013 10:27:04 +0000 (14:27 +0400)]
LU-4070 build: RPM build errors

Move %{_bindir}/mcreate and %{_bindir}/munlink files under
condition %{with lustre_tests}.

Signed-off-by: Dmitry Eremin <dmitry.eremin@intel.com>
Change-Id: I888086f672f690d9e3518f16edbf090edc8d0f14
Reviewed-on: http://review.whamcloud.com/7860
Tested-by: Hudson
Reviewed-by: Andreas Dilger <andreas.dilger@intel.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Minh Diep <minh.diep@intel.com>
Reviewed-by: Bob Glossman <bob.glossman@intel.com>
11 years agoLU-4066 test: use $LUSTRE to locate server modules 54/7854/2
John L. Hammond [Fri, 4 Oct 2013 16:56:43 +0000 (11:56 -0500)]
LU-4066 test: use $LUSTRE to locate server modules

In test-framework.sh's init_test_env look for $LUSTRE/mdt/mdt.ko
rather than `$dirname $0`/../mdt/mdt.ko to determine if the server
modules are present when they cannot be found in /lib/modules.

Signed-off-by: John L. Hammond <john.hammond@intel.com>
Change-Id: Ib16eee4d7811afcb310c7b0fd4a401ea5af2e7ad
Reviewed-on: http://review.whamcloud.com/7854
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Jian Yu <jian.yu@intel.com>
Reviewed-by: Emoly Liu <emoly.liu@intel.com>
Reviewed-by: Oleg Drokin <oleg.drokin@intel.com>
11 years agoLU-4052 build: simplify autogen.sh 40/7840/4
Christopher J. Morrone [Wed, 2 Oct 2013 23:14:03 +0000 (16:14 -0700)]
LU-4052 build: simplify autogen.sh

The concept of conditional subdirectories dates back to the days
when we had Lustre spread across multiple CVS repositories.  For
years now we have used git, and the tree is entirely available
at all times.

Therefore we can simplify autogen.sh a bit more.

Change-Id: I3d7c8e5e93b57c492bb61554a546ceb6d90b2ffe
Signed-off-by: Christopher J. Morrone <morrone2@llnl.gov>
Reviewed-on: http://review.whamcloud.com/7840
Tested-by: Hudson
Reviewed-by: Dmitry Eremin <dmitry.eremin@intel.com>
Reviewed-by: Minh Diep <minh.diep@intel.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Oleg Drokin <oleg.drokin@intel.com>
11 years agoLU-4046 build: fix 'uninitialized variables' errors 21/7821/2
Sebastien Buisson [Wed, 2 Oct 2013 08:48:09 +0000 (10:48 +0200)]
LU-4046 build: fix 'uninitialized variables' errors

Fix 'uninitialized variables' defects found by Coverity version 6.5.1:
Uninitialized scalar variable (UNINIT)
Using uninitialized value.

Signed-off-by: Sebastien Buisson <sebastien.buisson@bull.net>
Change-Id: I4b5a3293e7d01436a82b6f7cea50b84c60660b3a
Reviewed-on: http://review.whamcloud.com/7821
Tested-by: Hudson
Reviewed-by: jacques-Charles Lafoucriere <jacques-charles.lafoucriere@cea.fr>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Emoly Liu <emoly.liu@intel.com>
Reviewed-by: Oleg Drokin <oleg.drokin@intel.com>
11 years agoLU-4036 autoconf: don't check ext4 src when disable server 18/7818/2
Minh Diep [Tue, 1 Oct 2013 19:25:33 +0000 (12:25 -0700)]
LU-4036 autoconf: don't check ext4 src when disable server

We should not check for ext4 src to determine if we want
to build server codes when user explicitly use --disable-server

Signed-off-by: Minh Diep <minh.diep@intel.com>
Change-Id: I04b45cf0c0837a7b8760b7ff46645cabfc2c6ebd
Reviewed-on: http://review.whamcloud.com/7818
Reviewed-by: Andreas Dilger <andreas.dilger@intel.com>
Tested-by: Hudson
Reviewed-by: Bob Glossman <bob.glossman@intel.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
11 years agoLU-3880 tests: Allow error_ignore to accept any bug number 91/7791/2
James Nunez [Fri, 27 Sep 2013 22:00:21 +0000 (16:00 -0600)]
LU-3880 tests: Allow error_ignore to accept any bug number

error_ignore is called when a test and an error is encountered
that is supposed to be ignored. The first argument to
error_ignore is a bug number, justification as to why the error
should be ignored, that the routine assumes is a bugzilla bug
number and automatically attaches a “bz” prefix to the number.
This patch removes the assumption that all bug numbers are bugzilla
numbers and moves adding the “bz” to the bug number to the
individual calls to error_ignore. Now, we can use “LU” bug
numbers with error ignore.

Signed-off-by: James Nunez <james.a.nunez@intel.com>
Change-Id: I40c47ccf6af76e76bff99c67816e0d2aebd37e71
Reviewed-on: http://review.whamcloud.com/7791
Reviewed-by: Andreas Dilger <andreas.dilger@intel.com>
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Minh Diep <minh.diep@intel.com>
Reviewed-by: Jian Yu <jian.yu@intel.com>
11 years agoLU-3764 tests: Make calls to error_ignore not trigger FAIL 82/7782/3
James Nunez [Thu, 26 Sep 2013 21:21:42 +0000 (15:21 -0600)]
LU-3764 tests: Make calls to error_ignore not trigger FAIL

Currently, error_ignore calls error_noexit with a status of IGNORE
and a message/comment. error_noexit writes the comment to
the $LOGDIR/err file. If there is an err file, the pass
routine assumes the test failed and reports the test status as FAIL.
So, for errors that should be ignored, we don't want to be
writing the comment out to the err file.

Now, any message with an "IGNORE" status is written to
$LOGDIR/ignore file and, thus, the test will no longer be
classified as FAIL and the error will really be ignored.

Signed-off-by: James Nunez <james.a.nunez@intel.com>
Change-Id: I30f5387105f6cbce54050682b16387f73ab541d6
Reviewed-on: http://review.whamcloud.com/7782
Reviewed-by: Minh Diep <minh.diep@intel.com>
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Andreas Dilger <andreas.dilger@intel.com>
Reviewed-by: Jian Yu <jian.yu@intel.com>
Reviewed-by: John L. Hammond <john.hammond@intel.com>
Reviewed-by: Oleg Drokin <oleg.drokin@intel.com>
11 years agoLU-3996 mgs: Don't close llog that we failed to open 42/7742/4
Oleg Drokin [Tue, 24 Sep 2013 14:32:44 +0000 (10:32 -0400)]
LU-3996 mgs: Don't close llog that we failed to open

Attempting to do so leads to a crash:
LustreError: 8136:0:(llog_osd.c:241:llog_osd_read_header()) MGS-osd: error reading log header from [0xa:0xa:0x0]: rc = -14
LustreError: 8136:0:(mgs_llog.c:1386:record_start_log()) MGS: can't start log lustre-params: rc = -14
BUG: unable to handle kernel NULL pointer dereference at 00000000000000b8
IP: [<ffffffffa07f9e99>] llog_handle_put+0x9/0x70 [obdclass]
PGD 8120c8067 PUD 7b8c65067 PMD 0
Oops: 0000 [#1] SMP
...
Call Trace:
 [<ffffffffa07f9f5d>] llog_close+0x5d/0x190 [obdclass]
 [<ffffffffa0e98e8d>] mgs_write_log_direct_all+0x28d/0x7a0 [mgs]
 [<ffffffffa06c8951>] ? libcfs_debug_msg+0x41/0x50 [libcfs]
 [<ffffffffa0e99d3d>] mgs_write_log_param+0x99d/0x2980 [mgs]
 [<ffffffff81167610>] ? cache_alloc_refill+0x1c0/0x240
 [<ffffffffa0ea00b0>] mgs_setparam+0xde0/0x11a0 [mgs]
 [<ffffffffa0e8c7e2>] mgs_iocontrol+0x10c2/0x16e0 [mgs]
 [<ffffffffa07f70a7>] ? obd_ioctl_getdata+0x477/0x1120 [obdclass]
 [<ffffffffa080e344>] class_handle_ioctl+0x12e4/0x1e70 [obdclass]
 [<ffffffffa06c8951>] ? libcfs_debug_msg+0x41/0x50 [libcfs]
 [<ffffffffa07f62ab>] obd_class_ioctl+0x4b/0x190 [obdclass]
 [<ffffffff81195352>] vfs_ioctl+0x22/0xa0
 [<ffffffff811954f4>] do_vfs_ioctl+0x84/0x580
 [<ffffffff81195a71>] sys_ioctl+0x81/0xa0
 [<ffffffff8100b072>] system_call_fastpath+0x16/0x1b

Change-Id: I2bd72c130a0530cc9d218556b5b6cc9c14efc2ea
Signed-off-by: Oleg Drokin <oleg.drokin@intel.com>
Reviewed-on: http://review.whamcloud.com/7742
Reviewed-by: Andreas Dilger <andreas.dilger@intel.com>
Reviewed-by: Mike Pershin <mike.pershin@intel.com>
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
11 years agoLU-2800 autoconf: remove LIBCFS_CPUMASK_SIZE 93/7693/2
James Simmons [Wed, 18 Sep 2013 14:16:27 +0000 (10:16 -0400)]
LU-2800 autoconf: remove LIBCFS_CPUMASK_SIZE

The function cpumask_size has been in the linux kernel
since 2.6.28. Also HAVE_CPUMASK_SIZE is not even used
anywhere in the source tree. We can remove this obsolete
test.

Signed-off-by: James Simmons <uja.ornl@gmail.com>
Change-Id: Ie6090055105dcaa55fc031bf52e5a25cd7bcf321
Reviewed-on: http://review.whamcloud.com/7693
Reviewed-by: Bob Glossman <bob.glossman@intel.com>
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Minh Diep <minh.diep@intel.com>
Reviewed-by: Oleg Drokin <oleg.drokin@intel.com>
11 years agoLU-3055 libcfs: remove page_collection::pc_lock 60/7660/2
Liang Zhen [Sun, 15 Sep 2013 15:09:53 +0000 (23:09 +0800)]
LU-3055 libcfs: remove page_collection::pc_lock

page_collection::pc_lock is supposed to protect race between
functions called by smp_call_function(), however we don't have
this use-case for ages and page_collection only lives in stack
of thread, so it is safe to remove it.

Signed-off-by: Liang Zhen <liang.zhen@intel.com>
Change-Id: I89ca46e6593838d85fc0b91d1ff7c35cc28cf4ce
Reviewed-on: http://review.whamcloud.com/7660
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Bobi Jam <bobijam@gmail.com>
Reviewed-by: Sebastien Buisson <sebastien.buisson@bull.net>
Reviewed-by: Oleg Drokin <oleg.drokin@intel.com>
11 years agoLU-3616 mdt: deny write on executed file during restore 36/7636/7
Bruno Faccini [Mon, 14 Oct 2013 09:27:37 +0000 (11:27 +0200)]
LU-3616 mdt: deny write on executed file during restore

After commit c42b426c87c3d3b1dc9eda612cc831293dc80d68, from Gerrit
patch/Change-Id Ic8f82ddc9a56206307c2e5be2523fb7ce42b8638
(at http://review.whamcloud.com/3035) for LU-1338 (now HSM-5), it
is allowed to write in a file being executed during it is restored.
This patch revert to expected behavior (ETXTBSY).
Also added test_30c in sanity-hsm to verify this behavior.

Signed-off-by: Bruno Faccini <bruno.faccini@intel.com>
Change-Id: Id1e2f624dd02c2bec18ffe6cb1ae9198c1cfd23a
Reviewed-on: http://review.whamcloud.com/7636
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: John L. Hammond <john.hammond@intel.com>
Reviewed-by: jacques-Charles Lafoucriere <jacques-charles.lafoucriere@cea.fr>
Reviewed-by: Oleg Drokin <oleg.drokin@intel.com>
11 years agoLU-3936 ldlm: ldlm_cancel_stale_locks()) ASSERTION( count > 0 ) failed 26/7626/2
Andriy Skulysh [Thu, 12 Sep 2013 07:10:59 +0000 (10:10 +0300)]
LU-3936 ldlm: ldlm_cancel_stale_locks()) ASSERTION( count > 0 ) failed

number of granted locks can be really huge.
Use 64bit math to calculate amount of locks to cancel

Change-Id: Iac0b52dd578baf9955b50597d0832ac8715895f1
Xyratex-bug-id: MRP-1296
Signed-off-by: Andriy Skulysh <Andriy_Skulysh@xyratex.com>
Reviewed-by: Alexey Lyashkov <Alexey_Lyashkov@xyratex.com>
Reviewed-by: Alexander Boyko <Alexander_Boyko@xyratex.com>
Reviewed-by: Vitaly Fertman <Vitaly_Fertman@xyratex.com>
Reviewed-on: http://review.whamcloud.com/7626
Tested-by: Hudson
Reviewed-by: Alexander Boyko <alexander_boyko@xyratex.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Andreas Dilger <andreas.dilger@intel.com>
Reviewed-by: Oleg Drokin <oleg.drokin@intel.com>
11 years agoLU-3806 obdclass: add LCT_SERVER_SESSION for server session 12/7412/6
wang di [Wed, 21 Aug 2013 07:04:43 +0000 (00:04 -0700)]
LU-3806 obdclass: add LCT_SERVER_SESSION for server session

Add LCT_SERVER_SESSION for server session, and separate the
server session flag from LCT_SESSION, so to avoid allocating
session info for client stack for each server request, if
client and server are on the same node.

Signed-off-by: Wang Di <di.wang@intel.com>
Change-Id: I808c3f58cd7a03ebc166e51fe1e32ea34ae0e3e8
Reviewed-on: http://review.whamcloud.com/7412
Reviewed-by: John L. Hammond <john.hammond@intel.com>
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Mike Pershin <mike.pershin@intel.com>
Reviewed-by: Oleg Drokin <oleg.drokin@intel.com>
11 years agoLU-3692 api: More flexible logging API. 72/7572/7
Henri Doreau [Fri, 6 Sep 2013 07:43:48 +0000 (09:43 +0200)]
LU-3692 api: More flexible logging API.

Allow applications to register their own logging callback instead of
writing every log message to stderr/stdout. Registered callbacks are
global to the process, just like the log level.

Signed-off-by: Henri Doreau <henri.doreau@cea.fr>
Change-Id: I5ccfd1e24f1786408ce67b2cd87d53443bcd028a
Reviewed-on: http://review.whamcloud.com/7572
Reviewed-by: Andreas Dilger <andreas.dilger@intel.com>
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: John L. Hammond <john.hammond@intel.com>
Reviewed-by: Oleg Drokin <oleg.drokin@intel.com>
11 years agoLU-3624 iokit: fix mds-survey for multiple MDT 58/7558/3
Gregoire Pichon [Thu, 5 Sep 2013 08:29:53 +0000 (10:29 +0200)]
LU-3624 iokit: fix mds-survey for multiple MDT

The mds-survey script has several issues when called with multiple
MDT targets. This patch fixes the issues.

- make echo_client devices attached to different MDTs work in
  different directories
- correctly compute global statistics from each MDT stats
- fix mds-survey output so that number of field displayed is
  constant. This allows parsing the output for future plotting.
- fix lctl test_xxx commands to correctly handle specified
  parent directory

Signed-off-by: Gregoire Pichon <gregoire.pichon@bull.net>
Change-Id: I3ef0b82b59124f2246db535b074be94c7235936a
Reviewed-on: http://review.whamcloud.com/7558
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Tested-by: Hudson
Reviewed-by: Minh Diep <minh.diep@intel.com>
Reviewed-by: Andreas Dilger <andreas.dilger@intel.com>
Reviewed-by: wangdi <di.wang@intel.com>
Reviewed-by: Oleg Drokin <oleg.drokin@intel.com>
11 years agoLU-3790 tests: test_pios_ssf failed with 24 97/7397/2
Konstantin Sandalov [Tue, 20 Aug 2013 16:29:24 +0000 (19:29 +0300)]
LU-3790 tests: test_pios_ssf failed with 24

Wrong param to get free space in space_check () function

Signed-off-by: Konstantin Sandalov <konstantin_sandalov@xyratex.com>
Change-Id: I34abf6d63f081a515aaf95248bf5c4fc0896e024
Reviewed-on: http://review.whamcloud.com/7397
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Jian Yu <jian.yu@intel.com>
Reviewed-by: Emoly Liu <emoly.liu@intel.com>
Reviewed-by: Oleg Drokin <oleg.drokin@intel.com>
11 years agoLU-3815 tests: HSM sanity test suites 74/7374/11
Jinshan Xiong [Tue, 1 Oct 2013 19:09:11 +0000 (12:09 -0700)]
LU-3815 tests: HSM sanity test suites

Reevaluate the exception list.
Enable all sanity-hsm tests except #34, #35, #36 and #40.

Signed-off-by: Jinshan Xiong <jinshan.xiong@intel.com>
Change-Id: I66dbf6f5119fc75b379752a33a8d5e44562ff985
Reviewed-on: http://review.whamcloud.com/7374
Tested-by: Hudson
Reviewed-by: Aurelien Degremont <aurelien.degremont@cea.fr>
Reviewed-by: jacques-Charles Lafoucriere <jacques-charles.lafoucriere@cea.fr>
Reviewed-by: Andreas Dilger <andreas.dilger@intel.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Jian Yu <jian.yu@intel.com>
Reviewed-by: Oleg Drokin <oleg.drokin@intel.com>
11 years agoLU-3963 lnet: convert cfs_atomic primitives 70/7070/3
James Simmons [Thu, 17 Oct 2013 13:05:43 +0000 (09:05 -0400)]
LU-3963 lnet: convert cfs_atomic primitives

This patch convers all cfs_atomic primitives in lnet/
directory.

Signed-off-by: Liu Xuezhao <xuezhao.liu@emc.com>
Signed-off-by: Peng Tao <tao.peng@emc.com>
Signed-off-by: James Simmons <uja.ornl@gmail.com>
Change-Id: I81426bd7a8a1a97ea5f198d99193649fbbf5389a
Reviewed-on: http://review.whamcloud.com/7070
Tested-by: Hudson
Reviewed-by: Bob Glossman <bob.glossman@intel.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Andreas Dilger <andreas.dilger@intel.com>
Reviewed-by: Oleg Drokin <oleg.drokin@intel.com>
11 years agoLU-1538 tests: clean up runtests code style 39/5739/8
Andreas Dilger [Sat, 16 Mar 2013 04:44:11 +0000 (22:44 -0600)]
LU-1538 tests: clean up runtests code style

Use $(...) instead of `...` for sub-shells.
Improve some error messages and code style.
Use $DIR/$tdir for tests instead of $MOUNT/$$.

Test-Parameters: envdefinitions=SLOW=yes  testlist=runtests
Signed-off-by: Andreas Dilger <andreas.dilger@intel.com>
Change-Id: I4f3eb4a3e288c6fc45ae6ef1fb2008e8e6babe84
Reviewed-on: http://review.whamcloud.com/5739
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Bob Glossman <bob.glossman@intel.com>
Reviewed-by: Emoly Liu <emoly.liu@intel.com>
Reviewed-by: Oleg Drokin <oleg.drokin@intel.com>
11 years agoLU-2158 lvfs: remove the fsfilt and lvfs layer 12/5512/15
James Simmons [Fri, 27 Sep 2013 14:24:42 +0000 (10:24 -0400)]
LU-2158 lvfs: remove the fsfilt and lvfs layer

The fsfilt/lvfs layer has been obsoleted by the osd
layer. This patch moves the remaining fsfilt code
related to ldiskfs to osd-ldiskfs where it belongs.
The remaining bits of the lvfs code it moved to the
ptlrpc layer where it is used. The last of the procfs
code is integrated into the obdclass layer as well.

Signed-off-by: James Simmons <uja.ornl@gmail.com>
Signed-off-by: Mikhail Pershin <mike.pershin@intel.com>
Change-Id: Ie72103639fbcbbcedd0f810ac68044b38a0f0b65
Reviewed-on: http://review.whamcloud.com/5512
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Jian Yu <jian.yu@intel.com>
Reviewed-by: Bob Glossman <bob.glossman@intel.com>
Reviewed-by: Oleg Drokin <oleg.drokin@intel.com>
11 years agoLU-1199 rpm: remove SLES10 support 90/5490/3
James Simmons [Wed, 18 Sep 2013 13:44:24 +0000 (09:44 -0400)]
LU-1199 rpm: remove SLES10 support

SLES10 is out of scope for Lustre 2.4 so we can drop
the spec file hacks.

Signed-off-by: Jeff Mahoney <jeffm@suse.com>
Signed-off-by: James Simmons <uja.ornl@gmail.com>
Change-Id: I94ea2164a4b339ab26825058678b54859d2617a8
Reviewed-on: http://review.whamcloud.com/5490
Tested-by: Hudson
Reviewed-by: Bob Glossman <bob.glossman@intel.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Minh Diep <minh.diep@intel.com>
Reviewed-by: Oleg Drokin <oleg.drokin@intel.com>
11 years agoLU-2800 autoconf: remove LN_5ARGS_SYSCTL_PROC_HANDLER test 11/5411/9
James Simmons [Tue, 17 Sep 2013 17:48:24 +0000 (13:48 -0400)]
LU-2800 autoconf: remove LN_5ARGS_SYSCTL_PROC_HANDLER test

ctl_table.proc_handler has taken 5 arguments since v2.6.32.

We remove the test and dead code. Also eliminate the ll_* prefixed
compatibility shims and empty macros. I've left the LL_PROC_PROTO
macro because it definitely simplifies the appearance of the handler
definitions.

Signed-off-by: Jeff Mahoney <jeffm@suse.com>
Signed-off-by: James Simmons <uja.ornl@gmail.com>
Change-Id: I8c5fc509e1c8bc52705090f5eedf8fb10d549cf5
Reviewed-on: http://review.whamcloud.com/5411
Tested-by: Hudson
Reviewed-by: Bob Glossman <bob.glossman@intel.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Minh Diep <minh.diep@intel.com>
Reviewed-by: Oleg Drokin <oleg.drokin@intel.com>
11 years agoLU-2800 autoconf: remove LIBCFS_FUNC_DUMP_TRACE test 03/5403/9
James Simmons [Wed, 11 Sep 2013 16:47:53 +0000 (12:47 -0400)]
LU-2800 autoconf: remove LIBCFS_FUNC_DUMP_TRACE test

dump_trace has been upstream since v2.6.23. The reliable argument was
added in v2.6.25. The other bits were added in v2.6.19.
We can remove the test and dead code.

Signed-off-by: Jeff Mahoney <jeffm@suse.com>
Signed-off-by: James Simmons <uja.ornl@gmail.com>
Change-Id: I3b998f62ce4908adf377acd395725493dd717c3a
Reviewed-on: http://review.whamcloud.com/5403
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Bob Glossman <bob.glossman@intel.com>
Reviewed-by: Andreas Dilger <andreas.dilger@intel.com>
11 years agoLU-4072 tests: Decrease load on MDT for ZFS in sanity/24v 70/7870/2
Nathaniel Clark [Mon, 7 Oct 2013 15:50:04 +0000 (11:50 -0400)]
LU-4072 tests: Decrease load on MDT for ZFS in sanity/24v

Due to performance of ZFS, reduce the number of file creates until
LU-2887/LU-4072 are resolved.

Signed-off-by: Nathaniel Clark <nathaniel.l.clark@intel.com>
Change-Id: I15bc5b69d8ff114bf3cd7fa874c238ecbe29f6f9
Reviewed-on: http://review.whamcloud.com/7870
Tested-by: Hudson
Reviewed-by: Bob Glossman <bob.glossman@intel.com>
Reviewed-by: Jian Yu <jian.yu@intel.com>
Reviewed-by: Alex Zhuravlev <alexey.zhuravlev@intel.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Oleg Drokin <oleg.drokin@intel.com>
11 years agoLU-2887 tests: Reduce performance expectatiomn for zfs 48/7848/2
Nathaniel Clark [Thu, 3 Oct 2013 21:20:04 +0000 (17:20 -0400)]
LU-2887 tests: Reduce performance expectatiomn for zfs

Drop ZFS Performance expectation to below observed values.  This is a
temporary measure until ZFS performance issues are dealt with.

Signed-off-by: Nathaniel Clark <nathaniel.l.clark@intel.com>
Change-Id: I86c025e15ba67e86269344faa2ea56be0966a873
Reviewed-on: http://review.whamcloud.com/7848
Tested-by: Hudson
Reviewed-by: Andreas Dilger <andreas.dilger@intel.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Jian Yu <jian.yu@intel.com>
Reviewed-by: Niu Yawei <yawei.niu@intel.com>
Reviewed-by: Alex Zhuravlev <alexey.zhuravlev@intel.com>
Reviewed-by: Oleg Drokin <oleg.drokin@intel.com>
11 years agoLU-3626 tests: More time to allow mds-ost sync being finished 76/7676/3
wang di [Mon, 16 Sep 2013 14:02:41 +0000 (07:02 -0700)]
LU-3626 tests: More time to allow mds-ost sync being finished

Because mds-ost unlink log synchronization is not enforced by any
timeout, so we will add more time to allow mds-ost sync being
finished in replay-single test_20b. And also add more information
in the tests to help us understand which MDT slow down the
synchronziation.

Signed-off-by: wang di <di.wang@intel.com>
Change-Id: I5f63540e602ad29cd221de6dac0c22546629a9cd
Reviewed-on: http://review.whamcloud.com/7676
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Andreas Dilger <andreas.dilger@intel.com>
Reviewed-by: Jian Yu <jian.yu@intel.com>
11 years agoLU-3467 mdt: call MDT handlers via unified request handler 73/6973/19
Mikhail Pershin [Fri, 12 Jul 2013 04:53:07 +0000 (08:53 +0400)]
LU-3467 mdt: call MDT handlers via unified request handler

Move MDT to the unified request handler. All MDT-specific handlers
are swtched to use tgt_session_info instead of mdt_thread_info.
To minimize amount of changes the mdt_thread_info is initialized
just from tgt_session_info in most cases for now.

Signed-off-by: Mikhail Pershin <mike.pershin@intel.com>
Change-Id: I72e342cc900dc3dd34ca709542ad67de5015dd1f
Reviewed-on: http://review.whamcloud.com/6973
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Alex Zhuravlev <alexey.zhuravlev@intel.com>
Reviewed-by: Andreas Dilger <andreas.dilger@intel.com>
11 years agoLU-3161 lod: Added proper error handling instead of LASSERT 34/7234/9
Gaurav Mahajan [Fri, 26 Jul 2013 08:33:06 +0000 (14:03 +0530)]
LU-3161 lod: Added proper error handling instead of LASSERT

LASSERT are replaced with proper error handling in
lod_initialize_objects.

Signed-off-by: Gaurav Mahajan <gmahajan@ddn.com>
Change-Id: Ia8fbbb4956765835bf46b46b3a7addc99cb191fa
Reviewed-on: http://review.whamcloud.com/7234
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Alex Zhuravlev <alexey.zhuravlev@intel.com>
Reviewed-by: Andreas Dilger <andreas.dilger@intel.com>
11 years agoLU-4023 build: wrong type used 56/7856/3
Dmitry Eremin [Fri, 4 Oct 2013 19:49:14 +0000 (23:49 +0400)]
LU-4023 build: wrong type used

Fix PPC build, because it needs to use LPX64 instead of %#llx.

Signed-off-by: Dmitry Eremin <dmitry.eremin@intel.com>
Change-Id: I0f4756a1e529b114d535604e122500230fb8f3ef
Reviewed-on: http://review.whamcloud.com/7856
Reviewed-by: Andreas Dilger <andreas.dilger@intel.com>
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Bob Glossman <bob.glossman@intel.com>
11 years agoLU-4050 llite: NFS reexport issue 50/7850/2
Dmitry Eremin [Fri, 4 Oct 2013 09:10:07 +0000 (13:10 +0400)]
LU-4050 llite: NFS reexport issue

Suppress erroneous/confusing messages when NFS
is out of sync and requests old data.

Signed-off-by: Dmitry Eremin <dmitry.eremin@intel.com>
Change-Id: Idf382769fe3b5b502ca6f545fab6b72fcc62527c
Reviewed-on: http://review.whamcloud.com/7850
Tested-by: Hudson
Reviewed-by: Andreas Dilger <andreas.dilger@intel.com>
Reviewed-by: Bob Glossman <bob.glossman@intel.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
11 years agoSplit 2.5 and 2.6 development branches apart 2.5.50 v2_5_50 v2_5_50_0
Oleg Drokin [Fri, 11 Oct 2013 22:32:01 +0000 (18:32 -0400)]
Split 2.5 and 2.6 development branches apart

New tag 2.5.50 for what will eventually become 2.6.0

Change-Id: I6ba45bb595f43fb0dfc842b1ff21c7dc73513e14

11 years agoLU-3240 llite: Do not send parent dir fid in getattr by fid 10/7910/3
Oleg Drokin [Thu, 10 Oct 2013 00:44:10 +0000 (20:44 -0400)]
LU-3240 llite: Do not send parent dir fid in getattr by fid

Sending getattr by fid in this case is pointless, as the parent
might havelong changed and we have no control over it, but it's
irrelevant anyway, since we already have the child fid.

Signed-off-by: Oleg Drokin <oleg.drokin@intel.com>
Change-Id: I8ea05e931cd85c126274b024a440137c309e5eb7
Reviewed-on: http://review.whamcloud.com/7910
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Dmitry Eremin <dmitry.eremin@intel.com>
Reviewed-by: wangdi <di.wang@intel.com>
Reviewed-by: Andreas Dilger <andreas.dilger@intel.com>
11 years agoLU-3240 mdc: Check for all attributes validity in revalidate 60/6460/8
Alexander.Boyko [Mon, 27 May 2013 08:25:51 +0000 (12:25 +0400)]
LU-3240 mdc: Check for all attributes validity in revalidate

GETATTR needs to return attributes protected by different bits, so
we need to ensure all we have locks with all of those bits, not
just UPDATE bit.

Xyratex-bug-id: MRP-1052
Signed-off-by: Alexey Lyashkov <alexey_lyashkov@xyratex.com>
Signed-off-by: Oleg Drokin <oleg.drokin@intel.com>
Change-Id: I0e27a33341c424a8666bdbf3cdb83b1b4fe0058f
Reviewed-on: http://review.whamcloud.com/6460
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Dmitry Eremin <dmitry.eremin@intel.com>
Reviewed-by: wangdi <di.wang@intel.com>
Reviewed-by: Andreas Dilger <andreas.dilger@intel.com>
11 years agoLU-3027 lov: to not modify lov lock when sublock is canceled 41/7841/3
Jinshan Xiong [Thu, 3 Oct 2013 01:14:16 +0000 (18:14 -0700)]
LU-3027 lov: to not modify lov lock when sublock is canceled

Otherwise it will cause wrong lock to be enqueued.

lls_ever_canceled is introduced to solve potential performance
problem by this patch.

Signed-off-by: Jinshan Xiong <jinshan.xiong@intel.com>
Change-Id: I1ea62902c9d1a468f5952f30a8dc5dec22b8bb1c
Reviewed-on: http://review.whamcloud.com/7841
Reviewed-by: Bobi Jam <bobijam@gmail.com>
Reviewed-by: Lai Siyao <lai.siyao@intel.com>
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Oleg Drokin <oleg.drokin@intel.com>
11 years agoLU-3999 hsm: Rename several HSM files in /proc for MDT. 40/7740/7
Aurelien Degremont [Tue, 24 Sep 2013 12:10:09 +0000 (14:10 +0200)]
LU-3999 hsm: Rename several HSM files in /proc for MDT.

Some files in /proc, available for MDT are misnamed. This could be
incoherent or troublesome for sysadmins. This patch renames them.

`actions' is used when refering to HSM requests stored in MDT llog.
`active_requests' is used when refering to HSM requests currently
being handled by a copytool, on an agent node.

Also rename some HSM variable names and functions to have them
matching proc file names more consistently.

sanity-hsm tests have been updated accordingly.

Signed-off-by: Aurelien Degremont <aurelien.degremont@cea.fr>
Change-Id: If2b5c24e06d480554040d8f69b09236e0ddd98e9
Reviewed-on: http://review.whamcloud.com/7740
Tested-by: Hudson
Reviewed-by: John L. Hammond <john.hammond@intel.com>
Reviewed-by: Faccini Bruno <bruno.faccini@intel.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Oleg Drokin <oleg.drokin@intel.com>
11 years agoLU-4059 tests: Archive access must be done from agent 51/7851/2
James Nunez [Fri, 4 Oct 2013 14:01:26 +0000 (08:01 -0600)]
LU-4059 tests: Archive access must be done from agent

In sanity-hsm test_11, the size of an archived file is
requested from a client and not necessarily from an agent.
So, this patch now makes the call to stat from an agent.

Signed-off-by: James Nunez <james.a.nunez@intel.com>
Change-Id: I2f7ef294a0e9e28301312ad8529c6178279f5c19
Reviewed-on: http://review.whamcloud.com/7851
Tested-by: Hudson
Reviewed-by: Jinshan Xiong <jinshan.xiong@intel.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: jacques-Charles Lafoucriere <jacques-charles.lafoucriere@cea.fr>
Reviewed-by: Oleg Drokin <oleg.drokin@intel.com>
11 years agoLU-3587 tests: add sanity test for crash in changelog 51/7751/8
Bob Glossman [Tue, 24 Sep 2013 22:31:54 +0000 (15:31 -0700)]
LU-3587 tests: add sanity test for crash in changelog

This patch adds a sanity test to verify that the previous fixes committed
for this bug prevent a crash during a changelog fetch after renaming
one very long filename to another.

Test-Parameters: testlist=sanity envdefinitions=ONLY=160b

Signed-off-by: Bob Glossman <bob.glossman@intel.com>
Change-Id: I559bbd1ba9210f21ac58246366c61ee33f34ae1a
Reviewed-on: http://review.whamcloud.com/7751
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Andreas Dilger <andreas.dilger@intel.com>
Reviewed-by: John L. Hammond <john.hammond@intel.com>
Reviewed-by: Jian Yu <jian.yu@intel.com>
Reviewed-by: Oleg Drokin <oleg.drokin@intel.com>
11 years agoLU-3866 hsm: permission checks on HSM operations 65/7565/3
John L. Hammond [Thu, 5 Sep 2013 17:44:27 +0000 (12:44 -0500)]
LU-3866 hsm: permission checks on HSM operations

In the LL_IOC_HSM_CT_START case of ll_dir_ioctl() require
CAP_SYS_ADMIN, since the local handler for this ioctl may modify the
global KUC table.

In the MDC HSM handlers that do not pack a real suppgid, use -1 rather
than 0 for the suppgid in mdt_body.

In mdt_hsm_release() and the the MDT HSM RPC handlers require a
read-write client mount for all operations except restore, get HSM
state, and get HSM actions.  Require CAP_SYS_ADMIN for
MDS_HSM_PROGRESS, MDS_HSM_CT_REGISTER, and
MDS_HSM_CT_UNREGISTER. Require CAP_SYS_ADMIN in mdt_hsm_state_set()
for setting flags not in HSM_USER_MASK.

Add per-coordinator bit masks (cdt_{user,group,other}_request_mask)
indexed by the HSMA constants to govern permissions on the various
requests types. By default each mask is set to allow restore only.
Add files /proc/fs/lustre/mdt/*/hsm/{user,group,other}_request_mask to
get and set these masks.

Signed-off-by: John L. Hammond <john.hammond@intel.com>
Change-Id: Ifcb3c0950ebb11187cce62f15abbe8746f1ff7c2
Reviewed-on: http://review.whamcloud.com/7565
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Aurelien Degremont <aurelien.degremont@cea.fr>
Reviewed-by: Faccini Bruno <bruno.faccini@intel.com>
Reviewed-by: Jinshan Xiong <jinshan.xiong@intel.com>
Reviewed-by: Oleg Drokin <oleg.drokin@intel.com>
11 years agoLU-4024 tests: cleanup file system before DNE tests. 01/7801/3
wang di [Thu, 17 Jul 2014 10:50:03 +0000 (03:50 -0700)]
LU-4024 tests: cleanup file system before DNE tests.

Cleanup filesystem before DNE tests in conf-sanity.

Test-Parameters: mdtcount=2 mdscount=2
Signed-off-by: wang di <di.wang@intel.com>
Change-Id: I02c025f1ba1af3983f70694ccf99bfb82645cf14
Reviewed-on: http://review.whamcloud.com/7801
Reviewed-by: Andreas Dilger <andreas.dilger@intel.com>
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Jinshan Xiong <jinshan.xiong@intel.com>
11 years agoLU-4056 hsm: reorder MDT proc cleanup 46/7846/3
John L. Hammond [Thu, 3 Oct 2013 19:35:00 +0000 (14:35 -0500)]
LU-4056 hsm: reorder MDT proc cleanup

In mdt_procfs_init() remove the call to ptlrpc_lprocfs_register_obd().
The single file that this provides (../mdt/*/stats) is backed by the
MDT OBD's obd_svc_stats which are not tallied. In mdt_procfs_fini()
remove the corresponding call to ptlrpc_lprocfs_unregister_obd() and
move the call to hsm_cdt_procfs_fini() to before
lprocfs_obd_cleanup(). This prevents a use after free of the entries
in the MDT's hsm subdirectory.

Signed-off-by: John L. Hammond <john.hammond@intel.com>
Change-Id: Iad6e7738047377660f0077c9368f79d96f903d02
Reviewed-on: http://review.whamcloud.com/7846
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Faccini Bruno <bruno.faccini@intel.com>
Reviewed-by: Oleg Drokin <oleg.drokin@intel.com>
11 years agoLU-4054 llog: don't leak llog handle in llog_cat_process_cb() 47/7847/2
John L. Hammond [Thu, 3 Oct 2013 21:05:44 +0000 (16:05 -0500)]
LU-4054 llog: don't leak llog handle in llog_cat_process_cb()

An early return from llog_cat_process_cb() was leaking the llog
handle. Fix this by not doing that.

Signed-off-by: John L. Hammond <john.hammond@intel.com>
Change-Id: I01e8b5f389a097f17bb1370aaec7382bc9d840c9
Reviewed-on: http://review.whamcloud.com/7847
Tested-by: Hudson
Reviewed-by: Andreas Dilger <andreas.dilger@intel.com>
Reviewed-by: jacques-Charles Lafoucriere <jacques-charles.lafoucriere@cea.fr>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Mike Pershin <mike.pershin@intel.com>
11 years agoLU-4023 build: wrong type used 99/7799/5
Dmitry Eremin [Sun, 29 Sep 2013 09:58:42 +0000 (13:58 +0400)]
LU-4023 build: wrong type used

Fixed implicit conversion from 'unsigned long long' to 'int'.

Signed-off-by: Dmitry Eremin <dmitry.eremin@intel.com>
Change-Id: I331a8ebe1fb9ef53ebd4fc92603b4100f006ee2b
Reviewed-on: http://review.whamcloud.com/7799
Tested-by: Hudson
Reviewed-by: Andreas Dilger <andreas.dilger@intel.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Oleg Drokin <oleg.drokin@intel.com>
11 years agoLU-4002 hsm: avoid layout lock on unlink and rename onto 92/7792/3
John L. Hammond [Sat, 28 Sep 2013 00:04:37 +0000 (17:04 -0700)]
LU-4002 hsm: avoid layout lock on unlink and rename onto

In mdt_reint_unlink() and mdt_reint_rename() do not take a FULL inode
lock on the file to be unlinked or the rename target. This avoids a
potential deadlock when unlinking a file under HSM restore.

Signed-off-by: John L. Hammond <johnlockwoodhammond@gmail.com>
Change-Id: Id6b11b9f5c1433221670d154d65d9cec30e260af
Reviewed-on: http://review.whamcloud.com/7792
Reviewed-by: Jinshan Xiong <jinshan.xiong@intel.com>
Reviewed-by: jacques-Charles Lafoucriere <jacques-charles.lafoucriere@cea.fr>
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Oleg Drokin <oleg.drokin@intel.com>
11 years agoLU-4010 lov: Don't wait for active target with OBD_STATFS_NODELAY 62/7762/4
Andriy Skulysh [Wed, 25 Sep 2013 19:18:47 +0000 (22:18 +0300)]
LU-4010 lov: Don't wait for active target with OBD_STATFS_NODELAY

commit 6c3c5c7b breaks OBD_STATFS_NODELAY behaviour.
It adds unnecessary delay while running df command with inactive OSTs.
We shouldn't try to recover connection to OST in this case.

Change-Id: Ia04cfb76b393df4f0c4c8d841b48d20306025c92
Signed-off-by: Andriy Skulysh <Andriy_Skulysh@xyratex.com>
Reviewed-on: http://review.whamcloud.com/7762
Reviewed-by: Andreas Dilger <andreas.dilger@intel.com>
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Oleg Drokin <oleg.drokin@intel.com>
11 years agoLU-3864 lov: return minimal FIEMAP for released files 84/7584/12
Bruno Faccini [Sun, 29 Sep 2013 07:27:32 +0000 (09:27 +0200)]
LU-3864 lov: return minimal FIEMAP for released files

Since st_blocks = NULL is returned for released files, FIEMAP
should at least return a minimal mapping to make users aware
that file contains data but it is not immediately available.
This will make coreutils and tools such tar happy and have
them presume file is sparse.

Also, add a new test_228 in sanity-hsm to verify it works
for "[cp,tar] --sparse" commands.

Also fix a LBUG ("lov_fiemap()) ASSERTION( fm_local ) failed")
likely to occur when no-object/ENOMEM conditions and also now
when released.

Signed-off-by: Bruno Faccini <bruno.faccini@intel.com>
Change-Id: Idc59f755d407fd34e7095898c68b4aceb5bf9c28
Reviewed-on: http://review.whamcloud.com/7584
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Andreas Dilger <andreas.dilger@intel.com>
Reviewed-by: Aurelien Degremont <aurelien.degremont@cea.fr>
11 years agoLU-3882 hsm: Prevent duplicate CT registrations 12/7612/5
Henri Doreau [Wed, 11 Sep 2013 12:16:46 +0000 (14:16 +0200)]
LU-3882 hsm: Prevent duplicate CT registrations

Associate copytool registration to a given MDC import so that
multiple mounts of the same filesystem do not lead to having the
copytool registered multiple time.

Signed-off-by: Henri Doreau <henri.doreau@cea.fr>
Change-Id: I5767df50331675a1650dda1ab5fc8440aad2a52e
Reviewed-on: http://review.whamcloud.com/7612
Reviewed-by: John L. Hammond <john.hammond@intel.com>
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Jinshan Xiong <jinshan.xiong@intel.com>
Reviewed-by: Oleg Drokin <oleg.drokin@intel.com>
11 years agoLU-3750 mdt: fix typo in mdt_txn_stop_cb() 87/7787/4
Mikhail Pershin [Fri, 27 Sep 2013 15:13:30 +0000 (19:13 +0400)]
LU-3750 mdt: fix typo in mdt_txn_stop_cb()

Wrong check was added to mdt_txn_stop_cb() to identify object
for version update. Version is not updated for all local objects
due to that typo and leads to replay-vbr.sh failures

Test-Parameters: testgroup=review,failover
Signed-off-by: Mikhail Pershin <mike.pershin@intel.com>
Change-Id: I9c41f7208d7aa8cf307858b9c6f51ae73c1081e4
Reviewed-on: http://review.whamcloud.com/7787
Reviewed-by: Andreas Dilger <andreas.dilger@intel.com>
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: wangdi <di.wang@intel.com>