Whamcloud - gitweb
fs/lustre-release.git
11 years agoLU-797 tests: fix ost-pools test timeout issues on b1_8
Liu Ying [Mon, 14 Jan 2013 15:10:45 +0000 (23:10 +0800)]
LU-797 tests: fix ost-pools test timeout issues on b1_8

The test time of the ost-pools subtests is unreasonably long.

test_14 fills an OST to 90% full, regardless of the OST size.
Skip the test if the amount of data to be written is too large
to run in a practical time.

test_18 creates 3x3x30000 files to compare performance with/without
pools enabled.  Instead of creating a fixed number of files, use
createmany to run for a specific (short) time to measure
performance.

test_23 tried to fill all OSTs 100% full.  Split this test into two:
- test_23a to test quota with a file in a pool
- test_23b to test OOS with a file striped over pool

The following patches are merged into this one:
- LU-797 tests: speed up ost-pools tests
(master patch eea698c944283b755882d8f504d2fcc8ea371bd8)
- LU-797 tests: skip ost-pools.sh 23b when SLOW=no
(master patch f7b4054cfc1d30fbbfd56acfe4b5a7a334de8212)
- LU-797 tests: process lfs df output properly
(master patch b1a1ec6300a5ec3925b725d5d2b783314dff3f8)
- LU-797 tests: improve test_23b of ost-pools.sh
(master patch 6dd41a43e3cdff1b2e0713cfc163734889d8650a)

Test-Parameters: envdefinitions=SLOW=yes testlist=ost-pools

Signed-off-by: Andreas Dilger <andreas.dilger@intel.com>
Signed-off-by: Niu Yawei <niu@whamcloud.com>
Signed-off-by: Liu Ying <emoly.liu@intel.com>
Change-Id: I391e641664890e7172d2ed1da815894e656826ce
Reviewed-on: http://review.whamcloud.com/4898
Tested-by: Hudson
Reviewed-by: Niu Yawei <yawei.niu@intel.com>
Reviewed-by: Jian Yu <jian.yu@intel.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann.lombardi@intel.com>
11 years agoLU-1779 tests: fix run_one_logged() to log SKIP status
Yu Jian [Sat, 5 Jan 2013 07:28:02 +0000 (15:28 +0800)]
LU-1779 tests: fix run_one_logged() to log SKIP status

In the current test framework, only those tests which are in the
$ALWAYS_EXCEPT list are logged with SKIP status, other skipped
tests are all logged with PASS status.

This patch fixes the above issue by setting the SKIP status in
pass() and logging the status in run_one_logged().

Test-Parameters: clientarch=x86_64 serverarch=x86_64 testlist=mmp
Signed-off-by: Yu Jian <yujian@whamcloud.com>
Change-Id: I766cf6d2bb984b6097b20d2c089925890b86f9b1
Reviewed-on: http://review.whamcloud.com/4955
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Minh Diep <minh.diep@intel.com>
Reviewed-by: Andreas Dilger <andreas.dilger@intel.com>
11 years agoLU-958 tests: debug_mb set incorrectly for smp or vm
Emoly Liu [Tue, 8 Jan 2013 08:29:15 +0000 (16:29 +0800)]
LU-958 tests: debug_mb set incorrectly for smp or vm

For cpus with number of cores or for some VMs, number of possible CPUs
in the system could be greater than number of cpu reported by getconf.
Added check for maximum debug buffer size.
Added check if that "possible" exists, if not - use old method.

The patch of LU-1249 is also invloved to auto correct improper debug
buffer size setting.

port of patch 2ccb34d882b01305794e7780b6dd691179ddae7e
port of patch 28817cbd133c626042f9b142600c03187ba4a7ce
Xyratex-bug-id: MRP-219 incorrect settings for debug_mb

Test-Parameters: clientarch=x86_64 serverarch=x86_64 testlist=mmp

Signed-off-by: Denis Kondratenko <Denis_Kondratenko@xyratex.com>
Signed-off-by: Bobi Jam <bobijam@gmail.com>
Signed-off-by: Liu Ying <emoly.liu@intel.com>
Change-Id: Ib1e39a26e4d4af8e599d6e5fcdb1fecff8a6f4fa
Reviewed-on: http://review.whamcloud.com/4962
Reviewed-by: Jian Yu <jian.yu@intel.com>
Tested-by: Hudson
Reviewed-by: Andreas Dilger <andreas.dilger@intel.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
11 years agoLU-1526 tests: new sanity-quota tests
Yu Jian [Tue, 8 Jan 2013 07:52:55 +0000 (15:52 +0800)]
LU-1526 tests: new sanity-quota tests

This patch is ported from commit d6f2a9f to add new sanity-quota.sh
for the interoprability testing with new quota architecture,
meanwhile, the old sanity-quota.sh is reserved for the interoprability
with old server.

The patch also contains the fixups for LU-2174, LU-2283, LU-2284,
LU-2329 and LU-2526.

Test-Parameters: envdefinitions=SLOW=yes,ENABLE_QUOTA=yes clientarch=x86_64 serverarch=x86_64 testlist=conf-sanity,sanity-quota
Signed-off-by: Niu Yawei <niu@whamcloud.com>
Signed-off-by: Yu Jian <yujian@whamcloud.com>
Change-Id: I5d6834e5b7c257f0a44f45710674b2a236039bf1
Reviewed-on: http://review.whamcloud.com/4915
Reviewed-by: Johann Lombardi <johann.lombardi@intel.com>
Tested-by: Hudson
Reviewed-by: Niu Yawei <yawei.niu@intel.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
11 years agoLU-1625 test: reduce test duration for nfs mode
Emoly Liu [Tue, 8 Jan 2013 03:16:27 +0000 (11:16 +0800)]
LU-1625 test: reduce test duration for nfs mode

There isn't much value to run long duration in nfs mode.
Cut down IOR test as well.
Based on original work by Minh Diep.

port of patch f518a40d96d3431f3d68de9eac99ea33498894c7
port of patch e69d9852bc095695ceecb219b84bd8a48d5aa10c

Test-Parameters: envdefinitions=SLOW=yes,ENABLE_QUOTA=yes \
testlist=parallel-scale-nfsv3,parallel-scale-nfsv4

Signed-off-by: Keith Mannthey <keith@whamcloud.com>
Change-Id: I308d2dcedcb86bbc86b3d2b875e91ceeb2b96f6e
Signed-off-by: Liu Ying <emoly.liu@intel.com>
Reviewed-on: http://review.whamcloud.com/4949
Reviewed-by: Jian Yu <jian.yu@intel.com>
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Li Wei <wei.g.li@intel.com>
Reviewed-by: Johann Lombardi <johann.lombardi@intel.com>
11 years agoLU-1484 lprocfs: refine LC_PROCFS_USERS check
Bobi Jam [Wed, 9 Jan 2013 01:16:08 +0000 (09:16 +0800)]
LU-1484 lprocfs: refine LC_PROCFS_USERS check

In some RHEL patched 2.6.18 kernels, pde_users member is added in
another struct proc_dir_entry_aux instead of in struct proc_dir_entry
in later kernel version of 2.6.23.

Signed-off-by: Bobi Jam <bobijam.xu@intel.com>
Change-Id: Icee65893b2fbf4d0c3b3e957cb038be99aaf6eb8
Reviewed-on: http://review.whamcloud.com/4976
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Niu Yawei <yawei.niu@intel.com>
Reviewed-by: Lai Siyao <lai.siyao@intel.com>
Reviewed-by: Johann Lombardi <johann.lombardi@intel.com>
11 years agoLU-2550 osc: set resend count properly
Niu Yawei [Mon, 7 Jan 2013 09:12:33 +0000 (04:12 -0500)]
LU-2550 osc: set resend count properly

The resend count of new io request should be set properly
in osc_brw_redo_request().

Signed-off-by: Niu Yawei <yawei.niu@intel.com>
Change-Id: I31285df00513ad02befd84d9d37cfcbb48055bb0
Reviewed-on: http://review.whamcloud.com/4964
Reviewed-by: Bobi Jam <bobijam@gmail.com>
Reviewed-by: Johann Lombardi <johann.lombardi@intel.com>
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
11 years agoLU-1526 tests: Handle OFD procfs changes
Yu Jian [Tue, 8 Jan 2013 08:09:02 +0000 (16:09 +0800)]
LU-1526 tests: Handle OFD procfs changes

In order to interop with 2.4 server, the following procfs entry
changes need to be handled:

- obdfilter.*.mntdev -> osd-*.*.mntdev
- obdfilter.*.<cache_related> -> osd-*.*.<cache_related>

Test-Parameters: envdefinitions=SLOW=yes,ENABLE_QUOTA=yes clientarch=x86_64 serverarch=x86_64 testlist=lfsck,sanity
Signed-off-by: Yu Jian <yujian@whamcloud.com>
Change-Id: I74bfaa1e6d68203951de31676f23fbd8250ec652
Reviewed-on: http://review.whamcloud.com/4958
Reviewed-by: Li Wei <wei.g.li@intel.com>
Tested-by: Hudson
Reviewed-by: Andreas Dilger <andreas.dilger@intel.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
11 years agoLU-2420 tests: Have POSIX testing on b1_8
Emoly Liu [Thu, 27 Dec 2012 06:46:39 +0000 (14:46 +0800)]
LU-2420 tests: Have POSIX testing on b1_8

Have POSIX testing automated on b1_8 as it is on master.
LU-2274 port is included to change the baseline filesystem to ext3
for POSIX testing on SLES distro.
Also, this patch adds add_group() and add_user() functions into the
test-framework.sh. They are used by setup_posix_users() in posix.sh.

port of b2_1 patch f55a2af51af0bbb1d97e6987a45ca501adbc4ab6
port of b2_1 patch f9531ab0803cbafcb68003f3470307e4b826129f

Test-Parameters: envdefinitions=SLOW=yes testlist=posix

Signed-off-by: Liu Ying <emoly.liu@intel.com>
Change-Id: I8e3e8d5e87b13f4fd2d0b972d2161e9f1afbc4e9
Reviewed-on: http://review.whamcloud.com/4894
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Jian Yu <jian.yu@intel.com>
Reviewed-by: Minh Diep <minh.diep@intel.com>
Reviewed-by: Johann Lombardi <johann.lombardi@intel.com>
11 years agoLU-1842 quota: t-f changes for new quota
Niu Yawei [Fri, 28 Dec 2012 07:19:57 +0000 (15:19 +0800)]
LU-1842 quota: t-f changes for new quota

Add new quota functions in the t-f according to new quota
architecutre, at the same time, old functions are kept for
the interoprability with old server.

The patch also adds version_code() and lustre_version_code()
functions into the test framework.

One minor defect fixed:
- in facet_up(), add '-x' option to grep for exact matching,
otherwise this function will not work in single node test;

Test-Parameters: envdefinitions=ENABLE_QUOTA=yes testlist=sanity-quota

Signed-off-by: Niu Yawei <niu@whamcloud.com>
Change-Id: I7d96b7ea7cd14331aee7cfcca711a4e876025e2f
Signed-off-by: Yu Jian <yujian@whamcloud.com>
Reviewed-on: http://review.whamcloud.com/4897
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Niu Yawei <yawei.niu@intel.com>
Reviewed-by: Johann Lombardi <johann.lombardi@intel.com>
11 years agoLU-1762 tests: get correct MMP update and check intervals
Yu Jian [Fri, 4 Jan 2013 15:40:43 +0000 (23:40 +0800)]
LU-1762 tests: get correct MMP update and check intervals

This patch fixes the get_mmp_update_interval() and
get_mmp_check_interval() in mmp.sh to get the correct
MMP update and check intervals from both the old and
new outputs of debugfs.

The patch also improves test_8() to increase the running
time of e2fsck to allow mount operation to be started
before e2fsck operation stops.

Test-Parameters: testlist=mmp
Signed-off-by: Yu Jian <yujian@whamcloud.com>
Change-Id: If614659e3b7fe45c4b406d5541b1c2944b3c37ce
Reviewed-on: http://review.whamcloud.com/4953
Tested-by: Hudson
Reviewed-by: Andreas Dilger <andreas.dilger@intel.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Minh Diep <minh.diep@intel.com>
11 years agoLU-1526 tests: add --index support to the test framework
Yu Jian [Thu, 27 Dec 2012 04:57:25 +0000 (12:57 +0800)]
LU-1526 tests: add --index support to the test framework

This patch improves mkfs_opts() in test-framework.sh to
add --index option for MDT and OST targets automatically.

Signed-off-by: Yu Jian <yujian@whamcloud.com>
Change-Id: Ic00bf4c498b336e283ae86e4543f43709eda01aa
Reviewed-on: http://review.whamcloud.com/4893
Tested-by: Hudson
Reviewed-by: Li Wei <wei.g.li@intel.com>
Reviewed-by: Andreas Dilger <andreas.dilger@intel.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
11 years agoLU-1526 tests: Support for MDS-initiated OST_DESTROYs
Yu Jian [Sat, 5 Jan 2013 10:10:11 +0000 (18:10 +0800)]
LU-1526 tests: Support for MDS-initiated OST_DESTROYs

This patch is backported from commit af5f388 of LU-1303 to
support interoperating with 2.4 server.

The patch makes sure the tests work with MDSs that destroy OST
objects asynchronously on behalf of clients.

Signed-off-by: Yu Jian <yujian@whamcloud.com>
Change-Id: I8d8cb9e3699b6e7f63af106a5f45363f61f3ce7c
Reviewed-on: http://review.whamcloud.com/4959
Tested-by: Hudson
Reviewed-by: Andreas Dilger <andreas.dilger@intel.com>
Reviewed-by: Li Wei <wei.g.li@intel.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
11 years agoLU-1075 tests: auster detect directory as script name
Emoly Liu [Fri, 28 Dec 2012 05:54:00 +0000 (13:54 +0800)]
LU-1075 tests: auster detect directory as script name

Auster should use -f or detect the existing of script name
rather use -e which will treat directorysame as script.
Also, the patch of LU-412 is included to fix test script
lookup in auster.

port of master patch 0a79b541ad736bb296ea051e58b667c6195731a1
port of master patch 39b98cb351866da5648ea1a2216c108f8791226f

Signed-off-by: Minh Diep <mdiep@whamcloud.com>
Signed-off-by: Li Wei <liwei@whamcloud.com>
Signed-off-by: Liu Ying <emoly.liu@intel.com>
Change-Id: Id7d526b73051124b8b76deca974345573c6faf2b
Reviewed-on: http://review.whamcloud.com/4895
Tested-by: Hudson
Reviewed-by: Jian Yu <jian.yu@intel.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Li Wei <wei.g.li@intel.com>
Reviewed-by: Johann Lombardi <johann.lombardi@intel.com>
11 years agoLU-1520 ldlm: Revert "improve ldlm_pools_shrink algorithm"
Johann Lombardi [Tue, 11 Dec 2012 15:47:59 +0000 (10:47 -0500)]
LU-1520 ldlm: Revert "improve ldlm_pools_shrink algorithm"

This reverts commit c861cc7e0b6f7e82fd55b9658dd29578f97b5607
The patch should land on master first.

Change-Id: I9b3739defed6bf315646f8a107d3218414a14d25
Reviewed-on: http://review.whamcloud.com/4799
Reviewed-by: Johann Lombardi <johann.lombardi@intel.com>
Tested-by: Johann Lombardi <johann.lombardi@intel.com>
11 years agoLU-676 tests: machinefile option for mpirun via a variable
Jay J. Lan [Tue, 18 Oct 2011 19:04:07 +0000 (12:04 -0700)]
LU-676 tests: machinefile option for mpirun via a variable

Not all MPI implementations pass the host file to mpirun via the same
option. Common options are -machinefile and -hostfile.

This problem can be resolved by using a variable MACHINEFILE_OPTION
instead. A default value is assigned if the variable not defined.

Signed-off-by: Jay J Lan <jay.j.lan@nasa.gov>
Signed-off-by: Bobi Jam <bobijam@whamcloud.com>
Change-Id: I3362a6e62a27318cff733aea2f99b1356b3ff02e
Reviewed-on: http://review.whamcloud.com/1540
Tested-by: Hudson
Reviewed-by: Jian Yu <jian.yu@intel.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann.lombardi@intel.com>
11 years agoLU-1322 llite: revalidate dentry if f_version is 0
Peng Tao [Mon, 25 Jun 2012 13:31:18 +0000 (21:31 +0800)]
LU-1322 llite: revalidate dentry if f_version is 0

If a file is lseek()ed before i_version changes, it is possible
for application to call into ll_readdir and have f_pos pointing
to some garbage data and cause kernel hang.

Signed-off-by: Peng Tao <tao.peng@emc.com>
Change-Id: I49ab94ad5c63c3029d5ad96e27e38e124a135ed8
Reviewed-on: http://review.whamcloud.com/3181
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Lai Siyao <laisiyao@whamcloud.com>
Reviewed-by: Andreas Dilger <andreas.dilger@intel.com>
11 years agoLU-1520 ldlm: improve ldlm_pools_shrink algorithm
Hongchao Zhang [Tue, 4 Sep 2012 15:27:09 +0000 (23:27 +0800)]
LU-1520 ldlm: improve ldlm_pools_shrink algorithm

1, shrink namespaces by batches of 64 namespaces, the batch is
   implemented as list
2, limit number of simultaneously shrinking threads to 32 threads
3, have ldlm_pools_recalc to operate with namespaces similar to
   ldlm_pools_shrink
4, use glboal counters of unused locks on client and granted
   locks on servers to avoid iterating over namespaces

Change-Id: I5fb3f56748ae10961c50b4b06c300c9c7f5fca87
Signed-off-by: Vladimir Saveliev <valdimir.saveliev@oracle.com>
Signed-off-by: Hongchao Zhang <hongchao.zhang@whamcloud.com>
Reviewed-on: http://review.whamcloud.com/3270
Tested-by: Hudson
Reviewed-by: Andreas Dilger <andreas.dilger@intel.com>
Reviewed-by: Fan Yong <yong.fan@whamcloud.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann.lombardi@intel.com>
11 years agoLU-1901 ldiskfs: compile error for sles11 when JBD2_DEBUG is on
Vladimir Saveliev [Sun, 9 Sep 2012 08:57:03 +0000 (12:57 +0400)]
LU-1901 ldiskfs: compile error for sles11 when JBD2_DEBUG is on

The only change is the below hunk for ext4/inode.c:ext4_forget():
        jbd_debug(4, "forgetting bh %p: is_metadata = %d, mode %o, "
-                 "data mode %xn",
+                 "data mode %Lxn",
                  bh, is_metadata, inode->i_mode,
                  test_opt(inode->i_sb, DATA_FLAGS));

It is needed because for sles11 s_mount_opt of struct ext4_sb_info
is changed to unsigned long long.

Signed-off-by: Vladimir Saveliev <vladimir.saveliev@oracle.com>
Change-Id: I59646b821b83eed08a67124a9f52ab8dcb9b46ae
Reviewed-on: http://review.whamcloud.com/3943
Tested-by: Hudson
Reviewed-by: Andreas Dilger <andreas.dilger@intel.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Bob Glossman <bob.glossman@intel.com>
11 years agoLU-2371 ptlrpc: add support for -EINPROGRESS
Niu Yawei [Fri, 13 Jan 2012 08:33:22 +0000 (00:33 -0800)]
LU-2371 ptlrpc: add support for -EINPROGRESS

Backport patches from LU-904, LU-1329 and LU-1788 to introduce
support for -EINPROGRESS in lustre 1.8. This is needed for
quota interoperability with 2.4 servers.

Signed-off-by: Johann Lombardi <johann@whamcloud.com>
Change-Id: I9136112ca82dbf6caba41c2d41643ec646372852
Signed-off-by: Niu Yawei <niu@whamcloud.com>
Reviewed-on: http://review.whamcloud.com/4655
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann.lombardi@intel.com>
11 years agoLU-1064 mds: fix mds_lookup lma removal error path
Andreas Dilger [Sat, 1 Dec 2012 07:15:34 +0000 (00:15 -0700)]
LU-1064 mds: fix mds_lookup lma removal error path

In commit 1fd243c89e3b221d40ce74b8ef47f1bca760c8f9 if an error is hit
removing the "lma" xattr from an updated 2.x inode, then the open
transaction handle would never be committed, and the MDS would hang.

This is unlikely to be a problem, as the only errors fsfilt_set_md()
will hit that are not programming bugs are due to IO errors from the
underlying disk (which is an even bigger problem).

Make sure that the transaction is committed, even after an error.

Signed-off-by: Andreas Dilger <adilger@whamcloud.com>
Change-Id: If7d19a2337a12efafacd20d5c4e5c00e85300c1e
Reviewed-on: http://review.whamcloud.com/4729
Tested-by: Hudson
Reviewed-by: Iurii Golovach <Iurii_Golovach@xyratex.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann.lombardi@intel.com>
11 years agoLU-1064 mds: journal locking issue was fixed
Iurii Golovach [Fri, 16 Mar 2012 03:06:10 +0000 (20:06 -0700)]
LU-1064 mds: journal locking issue was fixed

During the downgrade procedure locking during
lma data removal is observed. The patch fix
this issue by moving the lma remove under the
mutex.

Reviewed-by: Vitaly Fertman <vfertman@xyratex.com>
Reviewed-by: Andrew Perepechko <aperepechko@xyratex.com>
Xyratex-bug-id: MRP-251

Signed-off-by: Iurii Golovach <Iurii_Golovach@xyratex.com>
Change-Id: I01e7bda03e3b0dfae92b7e03672c56b23a73989d
Reviewed-on: http://review.whamcloud.com/2077
Tested-by: Hudson
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Keith Mannthey <kemannthey@gmail.com>
11 years agoLU-1517 ptlrpc: throw net error to ptlrpc for bulk
Alexander.Boyko [Mon, 17 Sep 2012 13:48:59 +0000 (17:48 +0400)]
LU-1517 ptlrpc: throw net error to ptlrpc for bulk

Start reconnect and resend if network error occures
for the bulk transfer.

Signed-off-by: Alexander Boyko <alexander_boyko@xyratex.com>
Change-Id: I0cf2ee1230a039336f081fbb520c1ce768882088
Xyratex-bug-id: MRP-523
Reviewed-on: http://review.whamcloud.com/3102
Tested-by: Hudson
Reviewed-by: Liang Zhen <liang@whamcloud.com>
Reviewed-by: Keith Mannthey <keith.mannthey@intel.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Cory Spitz <spitzcor@cray.com>
Reviewed-by: Johann Lombardi <johann.lombardi@intel.com>
11 years agoLU-588 ldiskfs: Don't release super block buffer_head too early
Jeremy Filizetti [Mon, 30 Jul 2012 16:10:45 +0000 (12:10 -0400)]
LU-588 ldiskfs: Don't release super block buffer_head too early

If the super block buffer_head is released prior to MMP stopping
kmmpd can get a zeroed buffer_head and exit. The following code
causes kmmpd to exit when the s_feature_incompat is zero:

if (!(le32_to_cpu(es->s_feature_incompat) &
LDISKFS_FEATURE_INCOMPAT_MMP)) {
ldiskfs_warning(sb, "kmmpd being stopped since MMP feature"
                             " has been disabled.");
LDISKFS_SB(sb)->s_mmp_tsk = NULL;
goto failed;
}

A deadlock can occur with the kthread_stop_lock mutex because
ldiskfs_put_super calls kthread_stop on an already stopped thread
(kmmpd) so it waits for completion of the thread stopping before
releasing the kthread_stop_lock.

This is the result of a race with the kmmpd thread setting s_mmp_tsk
to NULL and ldiskfs_put_super in another thread to checks s_mmp_tsk
for NULL prior to calling kthread_stop.

Signed-off-by: Jeremy Filizetti <jeremy.filizetti@gmail.com>
Change-Id: Ia15d8ff829705a5d51dea4f86e40ba7c5745a9c5
Reviewed-on: http://review.whamcloud.com/3172
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Yu Jian <yujian@whamcloud.com>
11 years agoLU-1770 ptlrpc: introducing OBD_CONNECT_FLOCK_OWNER flag
Iurii.Golovach [Tue, 16 Oct 2012 13:39:07 +0000 (16:39 +0300)]
LU-1770 ptlrpc: introducing OBD_CONNECT_FLOCK_OWNER flag

After applying flock policy fix into the 1.8 users met with an issue
when 1.8 clients with a fixed flock policy recognized incorrectly by
2.x servers.
This flag is intended to present 1.8 clients with fixed flock policy
to let 2.x servers make flock policy recognition correctly.
Patches with functionality changes were attached on review at LU-1575

Xyratex-bug-id: MRP-489

Reviewed-by: Alexey Lyashkov <alexey_lyashkov@xyratex.com>
Reviewed-by: Andriy Skulysh <andriy_skulysh@xyratex.com>
Signed-off-by: Iurii Golovach <iurii_golovach@xyratex.com>
Change-Id: I0b203a7e181310c2888ae5bbe8c90ca0a5bbe549
Reviewed-on: http://review.whamcloud.com/3723
Reviewed-by: Cory Spitz <spitzcor@cray.com>
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann.lombardi@intel.com>
11 years agoLU-630 lnet: only router checks peer health
Lai Siyao [Mon, 5 Dec 2011 07:28:39 +0000 (15:28 +0800)]
LU-630 lnet: only router checks peer health

The peer health code is designed for router, so a ~rtr node always
assumes peers to be alive.

Signed-off-by: Lai Siyao <laisiyao@whamcloud.com>
Change-Id: Iacdc7359c69e0f172de0914048b35bd6fe06133e
Reviewed-on: http://review.whamcloud.com/4287
Tested-by: Hudson
Reviewed-by: Isaac Huang <he.huang@intel.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann.lombardi@intel.com>
11 years agoLU-1306 ldlm: LBUG at ldlm_lock.c:213
Andriy Skulysh [Wed, 11 Apr 2012 11:55:28 +0000 (14:55 +0300)]
LU-1306 ldlm: LBUG at ldlm_lock.c:213

Protect l_flags with locking to prevent race on
signal reception.

Xyratex-bug-id: MRP-420
Signed-off-by: Andriy Skulysh <Andriy_Skulysh@xyratex.com>
Reviewed-by: Vitaly Fertman <vitaly_fertman@xyratex.com>
Reviewed-by: Iurii Golovach <iurii_golovach@xyratex.com>
Change-Id: I98ba5e6e7a287090f6bd2a270c89a7671875bb9a
Reviewed-on: http://review.whamcloud.com/2727
Reviewed-by: Iurii Golovach <Iurii_Golovach@xyratex.com>
Reviewed-by: Cory Spitz <spitzcor@cray.com>
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
11 years agoLU-1585 lnet: Fix an incorrect timestamp calculation in lst.c
Doug Oucharek [Thu, 26 Jul 2012 05:21:46 +0000 (22:21 -0700)]
LU-1585 lnet: Fix an incorrect timestamp calculation in lst.c

The operation in routine lst_timeval_diff() (in lst.c) has
a bug. It uses tv_sec where it should be using tv_usec.

Signed-off-by: Doug Oucharek <doug@whamcloud.com>
Change-Id: I898dacd3d4a2c84594148d0514fda731d24a49bf
Reviewed-on: http://review.whamcloud.com/3474
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Liang Zhen <liang@whamcloud.com>
Reviewed-by: Isaac Huang <he.huang@intel.com>
Reviewed-by: Johann Lombardi <johann.lombardi@intel.com>
11 years agoLU-1969 release: bump version to 1.8.8.60-wc1 v1_8_8_60_WC1
Johann Lombardi [Mon, 17 Sep 2012 19:14:14 +0000 (21:14 +0200)]
LU-1969 release: bump version to 1.8.8.60-wc1

Bump version to 1.8.8.60-wc1.

Signed-off-by: Johann Lombardi <johann@whamcloud.com>
Change-Id: I6c8e6191afc21f674b92ebe65a869fb76bb52bd4
Reviewed-on: http://review.whamcloud.com/4014

11 years agoLU-1789 protocol: reserve connect flag for lightweight conn
Johann Lombardi [Tue, 4 Sep 2012 07:52:00 +0000 (09:52 +0200)]
LU-1789 protocol: reserve connect flag for lightweight conn

Reserve connection flag for lightweight connection support.
Although this feature will never be supported on 1.8, it still avoids
flag conflicts.

Signed-off-by: Johann Lombardi <johann@whamcloud.com>
Change-Id: I1b943579d8c46cce0d70a5df45d35874bf63b29b
Reviewed-on: http://review.whamcloud.com/3852

11 years agoLU-1675: fix fid for ll_get_parent
Alexander.Boyko [Thu, 26 Jul 2012 06:16:51 +0000 (10:16 +0400)]
LU-1675: fix fid for ll_get_parent

When nfs reexport occured between 1.8 client and 2.0 server,
nfs client do mkdir a, cd a, and get -521 error, ll_get_parent()
fail with -22 error, and mds print
"mdt_body_unpack()) Invalid fid: [0x2010e0901000001:0x0:0x4000]".
This patch fix fid translation for the ptlrpc request/reply.

Signed-off-by: Alexander Boyko <alexander_boyko@xyratex.com>
Xyratex-bug-id: MRP-522
Change-Id: Ib1a79e36ead478ebb874e7e04761bc43f33a410f
Reviewed-on: http://review.whamcloud.com/3475
Tested-by: Hudson
Reviewed-by: Fan Yong <yong.fan@whamcloud.com>
Reviewed-by: Cory Spitz <spitzcor@cray.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
11 years agoLU-1674 llite: opencreate without mode can crash llite
Liang Zhen [Thu, 26 Jul 2012 00:38:07 +0000 (08:38 +0800)]
LU-1674 llite: opencreate without mode can crash llite

User should specify mode for opencreate but if they don't llite will
LBUG, which is not good.

Signed-off-by: Bobi Jam <bobijam@whamcloud.com>
Change-Id: I5c4044da6dda2a902bc48f408f6aaf8d02dd82a4
Reviewed-on: http://review.whamcloud.com/3469
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Liang Zhen <liang@whamcloud.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
11 years agoLU-1838 llite: trusted. xattr is invisible to non-root
Fan Yong [Thu, 6 Sep 2012 16:18:05 +0000 (00:18 +0800)]
LU-1838 llite: trusted. xattr is invisible to non-root

Filter out all invalid xattrs in listxattr.
This includes trusted. xattrs that can cause
unnecessary "EPERM" in subsequent getxattr operations.

Signed-off-by: Fan Yong <yong.fan@whamcloud.com>
Signed-off-by: Bob Glossman <bogl@whamcloud.com>
Change-Id: I9613444adcdb14067a775f68f951af7a9b941e9a
Reviewed-on: http://review.whamcloud.com/3892
Tested-by: Hudson
Reviewed-by: Bob Glossman <bob.glossman@intel.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
11 years agoLU-1754 kernel: Kernel update [RHEL6.3 2.6.32-279.5.1.el6]
yangsheng [Fri, 24 Aug 2012 16:31:03 +0000 (00:31 +0800)]
LU-1754 kernel: Kernel update [RHEL6.3 2.6.32-279.5.1.el6]

Update RHEL6.3 kernel to 2.6.32-279.5.1.el6(client only).

Signed-off-by: yang sheng <ys@whamcloud.com>
Change-Id: Idffcaf6efa53119c1d093b852a37453d8c9b4116
Reviewed-on: http://review.whamcloud.com/3774
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Bobi Jam <bobijam@whamcloud.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
11 years agoLU-919 obdclass: remove hard coded 0x5a5a5a
Niu Yawei [Wed, 11 Jan 2012 04:24:59 +0000 (20:24 -0800)]
LU-919 obdclass: remove hard coded 0x5a5a5a

We assert atomic_t value with hard coded 0x5a5a5a in several places,
which could result in false assertion failure when the reference count
getting very large in some extreme case.

The hard coded 0x5a5a5a should be replaced by LI_POISON.

Signed-off-by: Bruno Faccini <bruno.faccini@bull.net>
Signed-off-by: Niu Yawei <niu@whamcloud.com>
Change-Id: Ia1105e48326c20a37d887ba9dc926ea300d97741
Reviewed-on: http://review.whamcloud.com/1954
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
11 years agoLU-1720 kernel: Quota doesn't work over 4TB on single OST
yangsheng [Fri, 10 Aug 2012 13:07:29 +0000 (21:07 +0800)]
LU-1720 kernel: Quota doesn't work over 4TB on single OST

Fix previous kernel update patch wrong update chunk.

Signed-off-by: yang sheng <ys@whamcloud.com>
Change-Id: Ice06fd70ad6f034dddb5aae5291c613e04b18d99
Reviewed-on: http://review.whamcloud.com/3599
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
11 years agoLU-1782 quota: ignore sb_has_quota_active() in OFED's header
Shuichi Ihara [Thu, 23 Aug 2012 16:36:46 +0000 (01:36 +0900)]
LU-1782 quota: ignore sb_has_quota_active() in OFED's header

sb_has_quota_active() and sb_any_quota_active() are defined
in ofed's backport headers, but if these are reffered on
RHEL5's kernel, quota is broken. So, it ignores them.

Signed-off-by: Shuichi Ihara <sihara@ddn.com>
Change-Id: Ic78799bc5d948b583b4a515479d5091381c63185
Reviewed-on: http://review.whamcloud.com/3764
Reviewed-by: Niu Yawei <niu@whamcloud.com>
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
11 years agoLU-1496 ptlrpc: prolong rw locks even IO RPCs are finished
Bobi Jam [Thu, 21 Jun 2012 04:37:49 +0000 (12:37 +0800)]
LU-1496 ptlrpc: prolong rw locks even IO RPCs are finished

Refresh rw lock again after IO RPCs are finished to leave a time
window for clients to cancel covering dlm locks.

This is a part of LU-874 back port.

Signed-off-by: Bobi Jam <bobijam@whamcloud.com>
Change-Id: I5cd185870e601a66bce21b3cc3c91f5f800b4c27
Reviewed-on: http://review.whamcloud.com/3157
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Jinshan Xiong <jinshan.xiong@whamcloud.com>
Reviewed-by: Fan Yong <yong.fan@whamcloud.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
11 years agoLU-1115 kernel: software raid6 related BUG
yangsheng [Wed, 2 May 2012 10:35:41 +0000 (18:35 +0800)]
LU-1115 kernel: software raid6 related BUG

Software raid6 hit BUGON in fs/bio.c:222 when raid chunk > 64k.
We pull upstream patch: 5b99c2ffa980528a197f26c7d876cceeccce8dd5
to deal with this issue.

Signed-off-by: yang sheng <ys@whamcloud.com>
Change-Id: I5330bc161e7cf5364a614547949323fc9a3ee7e3
Reviewed-on: http://review.whamcloud.com/2625
Reviewed-by: Bobi Jam <bobijam@whamcloud.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
11 years agoLU-359 llite: no close error if application has known failure
Fan Yong [Wed, 1 Aug 2012 13:02:36 +0000 (21:02 +0800)]
LU-359 llite: no close error if application has known failure

Don't return error again when close if the application has known
former write failure to avoid potenical rdundant error handling,
like confused error message.

Signed-off-by: Fan Yong <yong.fan@whamcloud.com>
Change-Id: I62d9cd83fc03fad22c994f2a77774ca113a6c057
Reviewed-on: http://review.whamcloud.com/596
Reviewed-by: Niu Yawei <niu@whamcloud.com>
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Jinshan Xiong <jinshan.xiong@whamcloud.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
11 years agoLU-1488 mdc: fix fid_res_name_eq() issue.
yangsheng [Sun, 5 Aug 2012 19:42:48 +0000 (03:42 +0800)]
LU-1488 mdc: fix fid_res_name_eq() issue.

Original error message:
LustreError: 25302:0:(namei.c:256:ll_mdc_blocking_ast())
ns: lustre-MDT0000-mdc-ffff81021762a000 lock:
The issue cause by commit ef8bd11416bae8c03a65682f3a10a4da39922b45.
fid_res_name_eq() use wrong way to compare fid & res_name.

Signed-off-by: yang sheng <ys@whamcloud.com>
Change-Id: Iacba148b6c3ba7fa775d2b9a4a58bdbf67434d7c
Reviewed-on: http://review.whamcloud.com/3522
Tested-by: Hudson
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Fan Yong <yong.fan@whamcloud.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
11 years agoLU-1511 kernel: kernel update [RHEL5.8 2.6.18-308.11.1.el5]
yangsheng [Tue, 12 Jun 2012 15:57:16 +0000 (23:57 +0800)]
LU-1511 kernel: kernel update [RHEL5.8 2.6.18-308.11.1.el5]

Update RHEL5.8 kernel to 2.6.18-308.11.1.el5.

Signed-off-by: yang sheng <ys@whamcloud.com>
Change-Id: I9d93c9666af54ed8c7d6d9ff33154929c94afe2e
Reviewed-on: http://review.whamcloud.com/3096
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
11 years agoLU-1563 quota: Put lqs properly in quota_pending_commit()
Niu Yawei [Tue, 26 Jun 2012 09:35:01 +0000 (02:35 -0700)]
LU-1563 quota: Put lqs properly in quota_pending_commit()

In quota_pending_commit(), always check if pending > 0 to figure
out if a lqs is held from quota_check_common(), otherwise, we
could find a lqs in quota_pending_commit(), then put it twice.

Signed-off-by: Niu Yawei <niu@whamcloud.com>
Change-Id: Ia653f41d721c002bbfbebcaf688b9943dde256bf
Reviewed-on: http://review.whamcloud.com/3187
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
Reviewed-by: Fan Yong <yong.fan@whamcloud.com>
11 years agoLU-1535 ldlm: backport fix for LU-1128
Lai Siyao [Tue, 19 Jun 2012 08:58:41 +0000 (16:58 +0800)]
LU-1535 ldlm: backport fix for LU-1128

Backport fix for LU-1128 to 1.8:
For ldlm server pool shrinker, we just use it to decrease SLV,
but never reclaim any memory directly, so it should always return
-1 to inform the kernel to break the shrink loop.

Signed-off-by: Lai Siyao <laisiyao@whamcloud.com>
Change-Id: I1c841e7485375017d33e93c59f2318318fae299c
Reviewed-on: http://review.whamcloud.com/3138
Reviewed-by: Niu Yawei <niu@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
11 years agoLU-1459 llite: Don't LBUG when import has LUSTRE_IMP_NEW state
Jeremy Filizetti [Thu, 31 May 2012 14:30:00 +0000 (10:30 -0400)]
LU-1459 llite: Don't LBUG when import has LUSTRE_IMP_NEW state

When a disabled OSC/OST is configured in the system at mount
time, a client will LBUG if calling "lfs check servers".
Disabling the LBUG causes client to return -EIO instead.

Signed-off-by: Jeremy Filizetti <jeremy.filizetti@gmail.com>
Change-Id: Ib689eb37c20d1012728abb7c35aee15f30604d54
Reviewed-on: http://review.whamcloud.com/2993
Tested-by: Hudson
Reviewed-by: Bobi Jam <bobijam@whamcloud.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
11 years agoLU-1459 llite: Don't use unitialized variable
Jeremy Filizetti [Tue, 5 Jun 2012 00:14:19 +0000 (20:14 -0400)]
LU-1459 llite: Don't use unitialized variable

Currently lov_connect_obd prints warning messages using
an unitialized stack variable. The message also only
prints a uuid instead of a meaningful target name.

Signed-off-by: Jeremy Filizetti <jeremy.filizetti@gmail.com>
Change-Id: I7549acdd45ab70e3528d2f0c153a27a502b0404a
Reviewed-on: http://review.whamcloud.com/2992
Tested-by: Hudson
Reviewed-by: Bobi Jam <bobijam@whamcloud.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
11 years agoLU-1448 llite: Prevent NULL pointer dereference on disabled OSC
Jeremy Filizetti [Thu, 31 May 2012 12:26:28 +0000 (08:26 -0400)]
LU-1448 llite: Prevent NULL pointer dereference on disabled OSC

When a file system is mounted with a disabled OSC reading the import
information from the proc file system can result in a NULL pointer
dereference. The Lustre import on a disabled OSC with remain
in the LUSTRE_IMP_NEW state and imp_connection will remain NULL.

Signed-off-by: Jeremy Filizetti <jeremy.filizetti@gmail.com>
Change-Id: Ib416b2d706ac9797715db2c0ea4f4eaa79bceb22
Reviewed-on: http://review.whamcloud.com/2977
Reviewed-by: Bobi Jam <bobijam@whamcloud.com>
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
11 years agoLU-1438 quota: quota active checking is missed on slave
Niu Yawei [Fri, 8 Jun 2012 04:55:40 +0000 (21:55 -0700)]
LU-1438 quota: quota active checking is missed on slave

On quota slave, we missed checking if quota is enabled in the
quota_check_common() and several other places. Which could cause
slave retry acquire quota in quota_chk_acq_common() infinitely
when the quota is already turned off on master.

Signed-off-by: Niu Yawei <niu@whamcloud.com>
Change-Id: I707bc34684e95f2a0beec99548dc2d78a4ce8bbf
Reviewed-on: http://review.whamcloud.com/3060
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Fan Yong <yong.fan@whamcloud.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
11 years agoLU-1438 quota: fix race in quota_chk_acq_common()
Niu Yawei [Mon, 28 May 2012 09:12:08 +0000 (02:12 -0700)]
LU-1438 quota: fix race in quota_chk_acq_common()

quota_check_common() & qctxt_adjust_qunit() uses different way
to check if quota is enforced on certain ID, which could result
in infinite loop in quota_chk_acq_common() when the QB/QI_SET
flag is cleared just after checking.

This patch used a non-instrusive way to fix this rare race.

Signed-off-by: Niu Yawei <niu@whamcloud.com>
Change-Id: I7212e9fc85e98a40e36d2773c02f838ca68339bb
Reviewed-on: http://review.whamcloud.com/2927
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
11 years agoLU-814 tests: remove leading spaces from $WRITE_DISJOINT
Yu Jian [Tue, 22 May 2012 11:38:42 +0000 (19:38 +0800)]
LU-814 tests: remove leading spaces from $WRITE_DISJOINT

In functions.sh, the WRITE_DISJOINT variable is defined as follows:

    WRITE_DISJOINT=${WRITE_DISJOINT:-\
        $(which write_disjoint 2> /dev/null || true)}

This will assign WRITE_DISJOINT with a value leading with spaces,
which causes "[: too many arguments" issue while checking the
variable. The PARALLEL_GROUPLOCK variable also has the same issue.
This patch fixes it.

Signed-off-by: Yu Jian <yujian@whamcloud.com>
Change-Id: I424af2acd4ef79ea67830010d17b6904769c2ca4
Reviewed-on: http://review.whamcloud.com/2866
Tested-by: Hudson
Reviewed-by: Minh Diep <mdiep@whamcloud.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
11 years agoLU-121 test: Change framework to only use the short hostname. 1.8.8-wc1 v1_8_8_WC1 v1_8_8_WC1_RC3
chris [Mon, 7 May 2012 12:34:49 +0000 (08:34 -0400)]
LU-121 test: Change framework to only use the short hostname.

This means stripping of everything off the name after and including
the first.

This change is only designed to make the .yml files consistent. The
log files will append the fully qualified name and appear to do this
consistantly.

We can create a jira to make the log and yml files consistant, but
the reality is that the yml files have a short life and are only
used to send to Maloo.

This change will allow automated posting of results and automated
testing to begin.

1. Carries out the above using hostname -s
2. Adds in a quick change so that LUSTRE_BUILD in yaml.sh can be a
reference to the source rather than just the lustre version string
which is recorded and written to the yaml anyway as LUSTER_VERSION

Additionally a couple of other changes sneaked in.

1. Allows the review information to be applied to the yaml output
file by way of exporting the variable CODE_REVIEW_YAML to be a yaml
description for maloo

2. The addition of a couple of fixes to make the permissions for yaml
files be allow-all. This permissions are bracketed and so do not
change any other parts of the code.

Signed-off-by: Chris Gearing <chris@whamcloud.com>
Change-Id: I4b2431030afd206bc83490f5c81fd04e57937aad
Reviewed-on: http://review.whamcloud.com/2663
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Yu Jian <yujian@whamcloud.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
11 years agoLU-458 debug: print client profile name correctly
Yu Jian [Mon, 21 May 2012 06:23:28 +0000 (14:23 +0800)]
LU-458 debug: print client profile name correctly

This patch reverts commit 48c2f667236e2f41f9fd0224b5de7a83517b3180,
which does not print client profile name correctly and introduces
a new defect that the client profile is not deleted properly.

In ll_put_super(), the memory space pointed to by profilenm is
in fact freed inside lustre_common_put_super(sb), which is called
before LCONSOLE_WARN(). In order to print the client profile name
in LCONSOLE_WARN(), we need copy the contents of profilenm to a
temporary storage before freeing profilenm.

Signed-off-by: Yu Jian <yujian@whamcloud.com>
Change-Id: I9ce2f304f3bad4761d2e3c857e4cdd5df6269c38
Reviewed-on: http://review.whamcloud.com/2841
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
11 years agoLU-1424 kernel: Kernel update [RHEL6.2 2.6.32-220.17.1.el6]
yangsheng [Mon, 21 May 2012 15:25:21 +0000 (23:25 +0800)]
LU-1424 kernel: Kernel update [RHEL6.2 2.6.32-220.17.1.el6]

Update RHEL6.2 patchless client support to 2.6.32-220.17.1.el6.

Signed-off-by: yang sheng <ys@whamcloud.com>
Change-Id: Ib6a164ecd7beb225107883fc21394056d2ce06bf
Reviewed-on: http://review.whamcloud.com/2848
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-458 debug: use profilenm before running class_del_profile() v1_8_8_WC1_RC2
Yu Jian [Wed, 16 May 2012 04:57:41 +0000 (12:57 +0800)]
LU-458 debug: use profilenm before running class_del_profile()

This patch fixes the defect in ll_put_super() which uses profilenm
after running class_del_profile(profilenm).

Signed-off-by: Yu Jian <yujian@whamcloud.com>
Change-Id: Ida357c6c970f3b6bf1bbe0060a71d17e65323aa2
Reviewed-on: http://review.whamcloud.com/2799
Reviewed-by: Yang Sheng <ys@whamcloud.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
12 years agoLU-425 tests: fix the issue of using "grep -w"
Yu Jian [Thu, 12 Apr 2012 15:38:17 +0000 (23:38 +0800)]
LU-425 tests: fix the issue of using "grep -w"

This patch fixes the following issue while using "grep -w"
to do exact match:

$ echo /mnt/nbp0-2 | grep -w /mnt/nbp0
/mnt/nbp0-2

Per the description of "-w" option:
-w, --word-regexp
Select only those lines containing matches that form whole words.
The test is that the matching substring must either be at the
beginning of the line, or preceded by a non-word constituent
character. Similarly, it must be either at the end of the line
or followed by a non-word constituent character. Word-constituent
characters are letters, digits, and the underscore.

So, the hyphen "-" character is a non-word constituent character
and "grep -w" does not do exact match on strings which contain it.

Signed-off-by: Yu Jian <yujian@whamcloud.com>
Change-Id: I91962910033f561cc9c9a82bd88bbb6dff5594af
Reviewed-on: http://review.whamcloud.com/2528
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Minh Diep <mdiep@whamcloud.com>
Reviewed-by: Wei Liu <sarah@whamcloud.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-1340 release: get ready for 1.8.8-wc1 RC1
Johann Lombardi [Wed, 9 May 2012 15:49:07 +0000 (17:49 +0200)]
LU-1340 release: get ready for 1.8.8-wc1 RC1

Change lustre version to 1.8.8-wc1 for RC1

Signed-off-by: Johann Lombardi <johann@whamcloud.com>
Change-Id: I96448a4e05c3cdbe8945642c53d007a51d0137eb
Reviewed-on: http://review.whamcloud.com/2696

12 years agoLU-1374 kernel: Kernel update [RHEL5.8 2.6.18-308.4.1.el5]
yangsheng [Fri, 4 May 2012 16:06:30 +0000 (00:06 +0800)]
LU-1374 kernel: Kernel update [RHEL5.8 2.6.18-308.4.1.el5]

Update RHEL5.8 kernel to 2.6.18-308.4.1.el5.

Signed-off-by: yang sheng <ys@whamcloud.com>
Change-Id: I1304dda0fae60c86ea67862095856a1c741ee3c3
Reviewed-on: http://review.whamcloud.com/2651
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
12 years agoLU-458 debug: quiet too noisy console messages at mount
yangsheng [Mon, 26 Mar 2012 17:09:15 +0000 (01:09 +0800)]
LU-458 debug: quiet too noisy console messages at mount

Quiet a number of extra debug messages printed to the console after a
remount or recovery. They provide no value and just add to the general
confusion of reading Lustre debug messages.

Signed-off-by: yang sheng <ys@whamcloud.com>
Change-Id: I5d5352d55d5a91f9fd4c55d077eebf1fdab61f80
Reviewed-on: http://review.whamcloud.com/2381
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-1358 kernel: Kernel update [RHEL6.2 2.6.32-220.13.1.el6]
yangsheng [Wed, 2 May 2012 05:57:11 +0000 (13:57 +0800)]
LU-1358 kernel: Kernel update [RHEL6.2 2.6.32-220.13.1.el6]

Update RHEL6.2 patchless client to 2.6.32-220.13.1.el6.

Signed-off-by: yang sheng <ys@whamcloud.com>
Change-Id: I290f985bb6cbcc6c46bc1821c7d87819479eb1be
Reviewed-on: http://review.whamcloud.com/2623
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Bobi Jam <bobijam@whamcloud.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-814 test: automated NFS over lustre testing
Minh Diep [Fri, 20 Apr 2012 18:57:37 +0000 (11:57 -0700)]
LU-814 test: automated NFS over lustre testing

Provide setup nfs within auster framework
Note: this change includes LU-1134, LU-1213

Signed-off-by: Minh Diep <mdiep@whamcloud.com>
Change-Id: If28a237a23cd448c7d8b9a772a4b8951d94697ef
Reviewed-on: http://review.whamcloud.com/2593
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Bob Glossman <bogl@whamcloud.com>
Tested-by: Bob Glossman <bogl@whamcloud.com>
Reviewed-by: Yu Jian <yujian@whamcloud.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-1312 kernel: crash at boot time in isci driver
yangsheng [Tue, 24 Apr 2012 19:50:34 +0000 (03:50 +0800)]
LU-1312 kernel: crash at boot time in isci driver

Restore SG_ALL to default value to avoid crash isci.

Signed-off-by: yang sheng <ys@whamcloud.com>
Change-Id: I855ba8c7669b749fded51f3b0316f115d18e0fcd
Reviewed-on: http://review.whamcloud.com/2595
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
12 years agoLU-1335 build: include lustre srpm in build
Minh Diep [Thu, 19 Apr 2012 00:18:46 +0000 (17:18 -0700)]
LU-1335 build: include lustre srpm in build

Add a support for building lustre-*.src.rpm

Signed-off-by: Minh Diep <mdiep@whamcloud.com>
Change-Id: I301915c107e50ea5d1a3275ecc631f07aa2b78be
Reviewed-on: http://review.whamcloud.com/2576
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Brian J. Murrell <brian@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-447 add lctl --net XXX push
Wally Wang [Tue, 30 Aug 2011 23:43:53 +0000 (16:43 -0700)]
LU-447 add lctl --net XXX push

In order to clear out peer/conn data in the gnilnd for testing after
adding lnet_notify(see LU-446), we need an alternative to
lctl --net gni del_peer - as this nukes the peer and results in false
lnd_query failures.

Change-Id: Ie8029953a2881c0d6e3ac250101d2d4374bbf3c1
Signed-off-by: Wally Wang <wang@cray.com>
Reviewed-on: http://review.whamcloud.com/1311
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Liang Zhen <liang@whamcloud.com>
Reviewed-by: Lai Siyao <laisiyao@whamcloud.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
Reviewed-by: James Simmons <uja.ornl@gmail.com>
12 years agoLU-1340 release: bump version to 1.8.7.81-wc1 v1_8_7_81_WC1
Johann Lombardi [Fri, 20 Apr 2012 13:55:14 +0000 (15:55 +0200)]
LU-1340 release: bump version to 1.8.7.81-wc1

Yet another build which brings us closer to 1.8.8-wc1 RC1.

Signed-off-by: Johann Lombardi <johann@whamcloud.com>
Change-Id: I7bac34ab4d29dc459171265319807c221a564c1a
Reviewed-on: http://review.whamcloud.com/2590

12 years agoLU-995 utils: make lfs getstripe directory output consistent.
Hongchao Zhang [Thu, 15 Mar 2012 04:03:07 +0000 (12:03 +0800)]
LU-995 utils: make lfs getstripe directory output consistent.

"lfs getstripe" should report the global default for any fields with
a value that means "use the default". This patch introduces the
following functionality:

1. If "lfs getstripe" is called on a directory and finds that striping
   EA is not set, the filesystem's defaults are looked up and printed.
2. If the striping EA is set, but the striping count and/or striping
   size has a value that means "use the default" (count = 0 and/or
   size = 0), the filesystem's default for that specific striping
   attribute is looked up and printed.
3. A new option to "lfs getstripe" is introduced; the "--raw" or
   "-R" option. If this option is specified, the previous two checks
   are skipped. In other words, if the striping EA is not set, 0, 0,
   -1, will be printed for the striping count, size, and offset
   respectively. Also, if the striping EA is set, the values will be
   printed without first converting them into their respective
   defaults.

This patch relies on the /proc filesystem to determine each
filesystem's default striping attributes, and a cache is maintained
which holds the default values for the last filesystem queried.

See Also:
Bugzilla #23802, https://bugzilla.lustre.org/show_bug.cgi?id=23802

Signed-off-by: Prakash Surya <surya1@llnl.gov>
Signed-off-by: Hongchao Zhang <hongchao.zhang@whamcloud.com>
Change-Id: Ic5b616dd83775c0a15be5c060b11cfbba05c4fbb
Reviewed-on: http://review.whamcloud.com/2117
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Bobi Jam <bobijam@whamcloud.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-891 test: waiting import state for next step.
yangsheng [Fri, 2 Mar 2012 14:46:47 +0000 (22:46 +0800)]
LU-891 test: waiting import state for next step.

Anyway, There still has a rare chance that the request meet
a invalid import after recovery. So we should waiting import
restore to a certain state and then doing next operation.

Signed-off-by: yang sheng <ys@whamcloud.com>
Change-Id: I20bed347a16755ccaf102d4c67b0a7e87b1318a1
Reviewed-on: http://review.whamcloud.com/2248
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-734 tests: add sub-tests into recovery-*-scale tests
Yu Jian [Wed, 11 Apr 2012 06:41:49 +0000 (14:41 +0800)]
LU-734 tests: add sub-tests into recovery-*-scale tests

This patch adds sub-tests into the recovery-*-scale tests
so that test results and logs could be gathered properly
and uploaded to Maloo.

The patch also does some cleanup works on the test scripts
and moves some common functions into test-framework.sh.

Signed-off-by: Yu Jian <yujian@whamcloud.com>
Change-Id: I514143e1fa29aad289d215174dbc68d2740da73b
Reviewed-on: http://review.whamcloud.com/2508
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Minh Diep <mdiep@whamcloud.com>
Reviewed-by: Cliff White <cliffw@whamcloud.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-577 tests: FAIL replay-single test_70b rundbench load
James Simmons [Wed, 18 Apr 2012 14:09:14 +0000 (10:09 -0400)]
LU-577 tests: FAIL replay-single test_70b rundbench load

Test 70b for replay-single assumes that lustre is mounted on
/mnt/lustre which is not the case for us. This patch passes
the proper MOUNT. The test also was not using the standard
DIR/tdir setup which had generated data files not being
cleaned up. Increased the sleep period to match dbench's
warm up period. This gives dbench a change to start up when
using many clients.

Signed-off-by: James Simmons <uja.ornl@gmail.com>
Signed-off-by: Yu Jian <yujian@whamcloud.com>
Change-Id: I3a793db35aa21d57220d0de1a9e92486e65ae21a
Reviewed-on: http://review.whamcloud.com/2518
Tested-by: Hudson
Reviewed-by: Wei Liu <sarah@whamcloud.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-1062 tests: incorrect path to configuration file
Iurii Golovach [Tue, 20 Mar 2012 00:53:27 +0000 (17:53 -0700)]
LU-1062 tests: incorrect path to configuration file

this is a bit modified port of the
http://review.whamcloud.com/#change,1877
(author Andreas Dilger)

Xyratex-bug-id: MRP-480

Reviewed-by: Sergey Glushchenko <Sergey_Glushchenko@xyratex.com>
Reviewed-by: Andriy Skulysh <andriy_skulysh@xyratex.com>
Signed-off-by: Iurii Golovach <iurii_golovach@xyratex.com>
Change-Id: I10b42de2c2d453e23142c01290742153dff7262a
Reviewed-on: http://review.whamcloud.com/2419
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-427 test: Test failure on test suite lfsck
Yu Jian [Tue, 10 Apr 2012 10:23:02 +0000 (18:23 +0800)]
LU-427 test: Test failure on test suite lfsck

- Reset $MDSDB & $OSTDB in generate_db(). Else they will
  stale if user redefine $SHARED_DIRECTORY.
- Add a function check_shared_dir() to ensure
  $SHARED_DIRECTORY is shared among tests nodes.
- Fix check_logdir() and check_write_access() to avoid using
  node.$(hostname).yml files which should not be deleted.

Signed-off-by: Yang Sheng <ys@whamcloud.com>
Signed-off-by: Yu Jian <yujian@whamcloud.com>
Change-Id: Ie7d1d08c0d2c701fa9fb74ef8b252fa8b31bf111
Reviewed-on: http://review.whamcloud.com/2498
Tested-by: Hudson
Reviewed-by: Wei Liu <sarah@whamcloud.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-313 tests: re-enable lfsck test to run by default
Andreas Dilger [Tue, 10 Apr 2012 08:53:29 +0000 (16:53 +0800)]
LU-313 tests: re-enable lfsck test to run by default

Due to bug 13698, the lfsck part of the lfsck.sh test script was
disabled by default. After the fixes in LU-113 were landed, lfsck
should work again. Remove SKIP_LFSCK checks so lfsck.sh actually runs
lfsck instead of silently skipping it unless SKIP_LFSCK=no is set.

Signed-off-by: Andreas Dilger <adilger@whamcloud.com>
Signed-off-by: Yu Jian <yujian@whamcloud.com>
Change-Id: I430f7398b2d21db0d0755726fdcb6053f25b4b10
Reviewed-on: http://review.whamcloud.com/2497
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Wei Liu <sarah@whamcloud.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-1052 kernel: Kernel update [RHEL5.8 2.6.18-308.1.1.el5]
yangsheng [Wed, 22 Feb 2012 10:27:42 +0000 (18:27 +0800)]
LU-1052 kernel: Kernel update [RHEL5.8 2.6.18-308.1.1.el5]

Update RHEL5.8 kernel to 2.6.18-308.1.1.el5.

Signed-off-by: yang sheng <ys@whamcloud.com>
Change-Id: Id8394bc1cb57d9555e4cda1312200a23b0915c84
Reviewed-on: http://review.whamcloud.com/2175
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
Tested-by: Hudson
Reviewed-by: Bobi Jam <bobijam@whamcloud.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: James Simmons <uja.ornl@gmail.com>
12 years agoLU-935 quota: break early when b/i_unit_sz exceeded upper limit
Niu Yawei [Mon, 19 Dec 2011 04:12:55 +0000 (20:12 -0800)]
LU-935 quota: break early when b/i_unit_sz exceeded upper limit

While expanding b/i_unit_sz in dquot_create_oqaq(), we'd break the loop
early when the b/i_unit_sz exceeded upper limit, otherwise, qaq_b/iunit_sz
could be overflow and result in endless loop.

Signed-off-by: Niu Yawei <niu@whamcloud.com>
Change-Id: I6046c8b86cad00c8c2fa2d24d0e21821a3c02209
Reviewed-on: http://review.whamcloud.com/1887
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Fan Yong <yong.fan@whamcloud.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-1241 kernel: Kernel update [RHEL6.2 2.6.32-220.7.1.el6]
yangsheng [Wed, 28 Mar 2012 20:28:59 +0000 (04:28 +0800)]
LU-1241 kernel: Kernel update [RHEL6.2 2.6.32-220.7.1.el6]

Update RHEL6.2 kernel to 2.6.32-220.7.1.el6.(patchless client)

Signed-off-by: yang sheng <ys@whamcloud.com>
Change-Id: I0933095b4e412dbb4ff826f45196d3b6aa80aecc
Reviewed-on: http://review.whamcloud.com/2401
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-847 quota: client retrieve quota usage directly
Niu Yawei [Fri, 21 Oct 2011 08:30:46 +0000 (01:30 -0700)]
LU-847 quota: client retrieve quota usage directly

Current 'lfs quota' sends getquota RPC to MDS, and MDS is responsible
for retrieving disk usage from all targets, this scheme will be
changed to client retrieving disk usage from all targets directly.

This patch addresses the compatibility issue as well: If the getquota
returned by MDS has QIF_SPACE, client just trust the disk usage
returned by MDS, otherwise, client has to issue RPCs to collect disk
usage by itself.

Signed-off-by: Niu Yawei <niu@whamcloud.com>
Signed-off-by: Johann Lombardi <johann@whamcloud.com>
Change-Id: I35526c453033809d2f23b1806b4783f5011e3fa2
Reviewed-on: http://review.whamcloud.com/1570
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
12 years agoLU-974 protocol: change OBD_CONNECT_GRANT_PARAM
Andreas Dilger [Tue, 13 Mar 2012 20:31:38 +0000 (14:31 -0600)]
LU-974 protocol: change OBD_CONNECT_GRANT_PARAM

Change the OBD_CONNECT_GRANT_PARAM flag value to avoid conflict
with the OBD_CONNECT_UMASK flag from LU-974.  While that patch is
not yet landed to our release tree, it is in use in production at
some customers.  While the risk of conflict is currently low, it
is easier to change the GRANT_PARAM value since only in use on the
orion branch, and isn't even handled by the client there yet.

Add (hopefully) clear comments for OBD_CONNECT and obd_connect_data
to ensure that they are not modified in some incompatible way across
branches.

Signed-off-by: Andreas Dilger <adilger@whamcloud.com>
Change-Id: I503892c3b595c0272b0941fa58a16a496318cab0
Reviewed-on: http://review.whamcloud.com/2298
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-649 io: DIO doesn't need lock i_mutex
Johann Lombardi [Wed, 28 Mar 2012 12:10:35 +0000 (14:10 +0200)]
LU-649 io: DIO doesn't need lock i_mutex

Revert "LU-649 io: DIO doesn't need lock i_mutex"
This reverts commit 4ebec99e309fcafd1be8a0b10673d69a083ae87f.

There is a concern that this patch is wrong, so i'm reverting it until
the situation is clarified.

Signed-off-by: Johann Lombardi <johann@whamcloud.com>
Change-Id: Ia7e12eaeeb87ef30d073e28391a25ffcd8ad146f
Reviewed-on: http://review.whamcloud.com/2398

12 years agoLU-249 mds: Increase the maximum number of threads on MDS
James Simmons [Tue, 27 Mar 2012 12:46:56 +0000 (08:46 -0400)]
LU-249 mds: Increase the maximum number of threads on MDS

Currently the largest number of the threads that can exist on
the MDS is 512. For very large system is those threads can
easily be exhausted and end up back logging the system. This
patch allows the user to define the upper limit of how many
threads are created on the MDS. If not selected the default
behavior of 512 is kept.

Change-Id: Ie48eb00f48ddcd98cc518178ba5e1da9ea1e40eb
Signed-off-by: James Simmons <uja.ornl@gmail.com>
Reviewed-on: http://review.whamcloud.com/2354
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-1018 test: Reduce compilebench run time
Minh Diep [Tue, 24 Jan 2012 17:30:32 +0000 (09:30 -0800)]
LU-1018 test: Reduce compilebench run time

We decided to reduce the run time for compilebench
so it would run under 1h on VM nodes. This is similar
to master

Signed-off-by: Minh Diep <mdiep@whamcloud.com>
Change-Id: Ie664ad131e4fbce9443eb78d06cbe36d6002b881
Reviewed-on: http://review.whamcloud.com/2005
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Cliff White <cliffw@whamcloud.com>
Reviewed-by: Wei Liu <sarah@whamcloud.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-554 LNet: add gnilnd awareness to LNet
Chris Horn [Tue, 31 Jan 2012 18:31:44 +0000 (12:31 -0600)]
LU-554 LNet: add gnilnd awareness to LNet

This allows servers on any network to talk to gnilnd routers.
See Oracle 23884 attachment 31892

Change-Id: I6989c987b2bd0d60058a290567f8ef0bee56420b
Signed-off-by: Chris Horn <hornc@cray.com>
Reviewed-on: http://review.whamcloud.com/1179
Reviewed-by: James Simmons <uja.ornl@gmail.com>
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-1221 ldiskfs: Replace sysname with nodename in MMP
Niu Yawei [Fri, 16 Mar 2012 03:22:59 +0000 (20:22 -0700)]
LU-1221 ldiskfs: Replace sysname with nodename in MMP

sysname holds "Linux" by default, i.e. what appears when
doing a "uname -s"; nodename should be used to print the
machine's hostname, i.e. what is returned when doing a
"uname -n" or "hostname", and what
gethostname(2)/sethostname(2) manipulate, in order to
notify the administrator of the node which is contending
to mount the filesystem.

Andreas says this was introduced when porting the MMP
patches from RHEL5 to RHEL6, and then also pushed upstream
to ext4; a patch for upstream ext4 has already been
submitted.

MRP-222
Signed-off-by: Nikitas Angelinas <nikitas_angelinas@xyratex.com>
Signed-off-by: Niu Yawei <niu@whamcloud.com>
Change-Id: Ie5c8e99ae536c2239134140589d678d3a4f92639
Reviewed-on: http://review.whamcloud.com/2337
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-1198 idl: move FID VER to DLM resource name[1]
Andreas Dilger [Thu, 8 Mar 2012 14:22:59 +0000 (22:22 +0800)]
LU-1198 idl: move FID VER to DLM resource name[1]

Until Lustre 1.8.7/2.1.1 the FID version was packed into name[2].

However, this leaves very little room in the LDLM resource name
for other uses.  The upcoming quota code needs to store another
FID into the LDLM resource to allow directory tree quotas, and
managed by the DLM.

The 32-bit VER, which is currently always 0, is moved into the high
bits of name[1] along with the 32-bit OID, to avoid consuming the
name[2] field.  Since future use of the FID version (including
snapshots, pools, etc) will need changes on the client side anyway,
there will never be non-zero VER on an existing client.

Signed-off-by: Andreas Dilger <adilger@whamcloud.com>
Change-Id: If1e500cfb277dfc25bc056bb0c5763e48e7d500c
Reviewed-on: http://review.whamcloud.com/2275
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-164 Update testframework to restart sessions at next suite
Chris Gearing [Thu, 12 May 2011 09:42:54 +0000 (10:42 +0100)]
LU-164 Update testframework to restart sessions at next suite

Update the test framework so that it can append to yaml files rather
than begin a new series and allow auster to begin at a selected
test_suite.

auster has a new switch '-S suite-name' which allows the caller to
start a test group part way through.

The idea behind all of this is that a test session that fails at a
test suite can be restarted at the next suite.

Change-Id: Ifc84184506cff5ee11e5dc570ce17baab4d439a4
Signed-off-by: Chris Gearing <chris@whamcloud.com>
Reviewed-on: http://review.whamcloud.com/538
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Robert Read <rread@whamcloud.com>
Reviewed-by: Bobi Jam <bobijam@whamcloud.com>
12 years agoLU-983 llite: align readahead to 1M after ra_max adjustment
wangdi [Thu, 19 Jan 2012 05:43:00 +0000 (21:43 -0800)]
LU-983 llite: align readahead to 1M after ra_max adjustment

Align the readahead pages only if ria_start != 0, otherwise the
readahead pages will be cut to zero. And it can be checked by
checking (ret > ((ria->ria_start + ret) % PTLRPC_MAX_BRW_PAGES))

Add test_101e for checking small reads for small size files.

Put 101a to Exception list, because it will do random small
reads(64k) on the file, then checking pages being discarded.
But in current implementation, we will always extend small size
read to 1M during read-ahead, which might cause too much unused
pages being read, then being discarded later.

Remove big rpc check in 101c, because the test will do random small
size(64K) read, so it only need make sure there are no small size
RPC during the read. Besides, the original big RPC check is wrong,
since it can not make sure all of RPC exists in this case.

Signed-off-by: Wang Di <di.wang@whamcloud.com>
Change-Id: If86975274f36050b9252304c447f057e99f0c5ea
Reviewed-on: http://review.whamcloud.com/1983
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-432 multiop test needs $(PTHREAD_LIBS)
Christos Triantafyllidis [Sun, 19 Jun 2011 22:31:43 +0000 (01:31 +0300)]
LU-432 multiop test needs $(PTHREAD_LIBS)

Added $(PTHREAD_LIBS) to multiop_LDADD for multiop test.

Change-Id: I95ce209fc033122ac89bb4ee74f8e095e5727b17
Signed-off-by: Christos Triantafyllidis <ctria@grid.auth.gr>
Reviewed-on: http://review.whamcloud.com/978
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
Tested-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-1116 kernel: Support RHEL6.2 patchless client.
yangsheng [Tue, 21 Feb 2012 01:46:52 +0000 (09:46 +0800)]
LU-1116 kernel: Support RHEL6.2 patchless client.

Update RHEL6.2 kernel to 2.6.32-220.4.2(client).

Signed-off-by: yang sheng <ys@whamcloud.com>
Change-Id: I654d8d56b14196acf8d27b31a5af8c34812d8b99
Reviewed-on: http://review.whamcloud.com/2171
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-987 build: Fail to create ldisk rpms
Minh Diep [Tue, 7 Feb 2012 17:56:06 +0000 (09:56 -0800)]
LU-987 build: Fail to create ldisk rpms

The autoMakefile.am in ldsikfs does not define the
BUILD_SERVER flag so make rpms fails. This patch
simply set the flag to true since ldiskfs will most
likely be used only on servers
Signed-off-by: Minh Diep <mdiep@whamcloud.com>
Change-Id: Ic42eb543658f97bbff9c2f360d8693da840a8a36
Reviewed-on: http://review.whamcloud.com/2113
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-746 obdfilter-survey FAIL: test_1b ost4: hndls expected > 8, have 2
Lai Siyao [Tue, 18 Oct 2011 05:08:47 +0000 (22:08 -0700)]
LU-746 obdfilter-survey FAIL: test_1b ost4: hndls expected > 8, have 2

obdfilter-survey.sh checks jbd proc stats after survey, but it
doesn't take obd cleanup time into account, so the stats
collected may not reflect the data of survey test. Save run time
of survey test, and collect stats of (run_time/4) ago to fix this.

Change-Id: Ib0b55693d13da172ca476ca79013a8a2e9dd81ca
Signed-off-by: Lai Siyao <laisiyao@whamcloud.com>
Reviewed-on: http://review.whamcloud.com/1534
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
Reviewed-by: Liang Zhen <liang@whamcloud.com>
12 years agoLU-322 tests: Pool test fixes for large number of OSTS
Minh Diep [Tue, 13 Dec 2011 01:41:04 +0000 (17:41 -0800)]
LU-322 tests: Pool test fixes for large number of OSTS

This patch fixes issues with large numbers of OSTs with the
ost-pools test. we need to use hexidecimal numbers for
the pool args since the OSTs UUID are named with hex numbers.
Currently the test does pass but erros can be seen in the logs
when more than 9 OSTs exist.

Signed-off-by: Minh Diep <mdiep@whamcloud.com>
Change-Id: I5db7540ee710c528af33d768a105a55bd09229b3
Reviewed-on: http://review.whamcloud.com/1841
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Yu Jian <yujian@whamcloud.com>
Reviewed-by: James Simmons <uja.ornl@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-952 quota: follow locking order of quota code
Niu Yawei [Thu, 5 Jan 2012 05:30:54 +0000 (21:30 -0800)]
LU-952 quota: follow locking order of quota code

The locking order of quota code is: i_mutex > dqonoff_sem >
journal_lock > dqptr_sem > dquot->dq_lock > dqio_mutex, so we
should call the ll_vfs_dq_init() after journal started to avoid
deadlock.

Signed-off-by: Niu Yawei <niu@whamcloud.com>
Change-Id: I7f805abc3105e338dd60bc632d860be5389d87ec
Reviewed-on: http://review.whamcloud.com/1919
Tested-by: Hudson
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Fan Yong <yong.fan@whamcloud.com>
12 years agoLU-358 Add the branch and commiti-id to the yml.
Chris Gearing [Thu, 26 May 2011 10:45:21 +0000 (11:45 +0100)]
LU-358 Add the branch and commiti-id to the yml.

yaml.sh updated to provide the information requested in the JIRA the
change is simple and requires that LUSTRE_BRANCH and LUSTRE_REVISION
are defined, if they are not defined then the elements are left blank

Signed-off-by: Chris Gearing <chris@whamcloud.com>
Change-Id: Id18f3d5c68d3703af3b76628d7d43289c57bf485
Reviewed-on: http://review.whamcloud.com/1400
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Yu Jian <yujian@whamcloud.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-534 test: nfsread_orphan_file test
Bobi Jam [Tue, 20 Dec 2011 01:19:52 +0000 (09:19 +0800)]
LU-534 test: nfsread_orphan_file test

Reproducer of bug17764.

In order to hit the assertion the lustre file has to be open twice
from lustre clients, and once from nfs mount.  Then unlink and close
opened lustre file and read the file opened on nfs.

Bugzilla: 17764
Author: Elena Gryanova <elena.gryaznova@oracle.com>
Signed-off-by: Elena Gryanova <elena.gryaznova@oracle.com>
Signed-off-by: Bobi Jam <bobijam@whamcloud.com>
Change-Id: I05d8f04a2bd7c64864e1b82a07ef26a170887714
Reviewed-on: http://review.whamcloud.com/1895
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-534 mds: correct assertion
Vladimir Saveliev [Thu, 24 Nov 2011 18:50:04 +0000 (21:50 +0300)]
LU-534 mds: correct assertion

Orphan inode can be reached on mds_open when opening by fid which
takes place on accessing files via nfs correct the assertion
correspondingly.

Bugzilla: 17764
Author: Vladimir Saveliev <vladimir.saveliev@oracle.com>
Signed-off-by: Vladimir Saveliev <vladimir.saveliev@oracle.com>
Signed-off-by: Bobi Jam <bobijam@whamcloud.com>
Change-Id: Id6cde949003c722b56fe7ccec43b2d8b8a0667bb
Reviewed-on: http://review.whamcloud.com/1894
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-955 build: fix bad lustre-backend-fs dependency
Minh Diep [Mon, 9 Jan 2012 16:51:37 +0000 (08:51 -0800)]
LU-955 build: fix bad lustre-backend-fs dependency

Fix an incorrect RPM package dependency if Lustre RPMs are built
with "make rpms" with client only
Signed-off-by: Minh Diep <mdiep@whamcloud.com>
Change-Id: I1e00a2d2710ec96036cbad0a3539f704251f880a
Reviewed-on: http://review.whamcloud.com/1931
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-805 quota: lfs quota doesn't print grace time correctly
Niu Yawei [Tue, 8 Nov 2011 10:01:05 +0000 (02:01 -0800)]
LU-805 quota: lfs quota doesn't print grace time correctly

Lustre always trigger grace time when the allocated qunit exceeding
softlimit, however, user tools 'lfs quota' only print grace time
when the total usage greater than softlimit, so sometimes user can't
tell if the softlimit is already exceeded from 'lfs quota' output.

This patch changes the 'lfs quota' to use the data get from kernel
instead of comparing usage with softlimit.

Signed-off-by: Niu Yawei <niu@whamcloud.com>
Change-Id: I320cd0fafcb92e8b91f688b2c8bb50bc44442a09
Reviewed-on: http://review.whamcloud.com/1667
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-649 io: DIO doesn't need lock i_mutex
Lai Siyao [Mon, 12 Dec 2011 15:12:42 +0000 (07:12 -0800)]
LU-649 io: DIO doesn't need lock i_mutex

There is a dead lock if a file is written with with normal IO, and
read with DIO at the mean time:
regular write: extent lock -> i_mutex
DIO read: i_mutex -> server take extent lock

i_mutex taken at DIO read is redundant, because server will take
extent lock to serialize with write and truncate. Removing it can
avoid the deadlock.

Change-Id: I280559cebae5d04e4fd943bf3fbe82d67e07834b
Signed-off-by: Lai Siyao <laisiyao@whamcloud.com>
Reviewed-on: http://review.whamcloud.com/1829
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Niu Yawei <niu@whamcloud.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-882 quota: Quota code compares unsigned < 0
Niu Yawei [Tue, 29 Nov 2011 04:59:38 +0000 (20:59 -0800)]
LU-882 quota: Quota code compares unsigned < 0

Port from b23858.

In check_cur_qunit(), it checks "if (limit + record < 0)", however,
the limit is unsigned, so this check will be always false, and when
limit is smaller than -record, following "limit += record" will make
limit a unreasonable large value.

This patch also fixed a similar defect in dqacq_handler().

Signed-off-by: Vladimir Saveliev <vladimir.saveliev@oracle.com>
Signed-off-by: Niu Yawei <niu@whamcloud.com>
Change-Id: Ibe1d643ea3b310b2e55c05a8c200ba5e0137ee27
Reviewed-on: http://review.whamcloud.com/1750
Tested-by: Hudson
Reviewed-by: Fan Yong <yong.fan@whamcloud.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-620 llite: add delete/remove_from_page_cache check
Bobi Jam [Fri, 4 Nov 2011 09:18:19 +0000 (17:18 +0800)]
LU-620 llite: add delete/remove_from_page_cache check

Later 2.6.32 kernel use memory cgroup feature but does not export
truncate_complete_page but export delete_from_page_cache or
remove_from_page_cache, we need properly use them for pachless client
code.

Signed-off-by: Bobi Jam <bobijam@whamcloud.com>
Change-Id: I4b95da2b4cac7f2c2f63b69896dd91c3544a5473
Reviewed-on: http://review.whamcloud.com/1649
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-792 lbuild-rhel5 should use redhat's SRPM repo
Michael MacDonald [Tue, 25 Oct 2011 14:23:28 +0000 (10:23 -0400)]
LU-792 lbuild-rhel5 should use redhat's SRPM repo

Download EL5 kernel .src.rpm packages from redhat's repo instead
of trying to use CentOS's often-outdated repo.

Change-Id: I45c59894c649f3290a8ce1f0d3f1f033395567a4
Signed-off-by: Michael MacDonald <mjmac@whamcloud.com>
Reviewed-on: http://review.whamcloud.com/1595
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Brian J. Murrell <brian@whamcloud.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-424 tests: conf-sanity test 55, 56, 58 failure fixes
Yu Jian [Sat, 31 Dec 2011 09:04:41 +0000 (17:04 +0800)]
LU-424 tests: conf-sanity test 55, 56, 58 failure fixes

The MGS service was not started in conf-sanity test 55, 56 and 58
with separate MGS and MDT configuration. This patch fix the issue.

Signed-off-by: Yu Jian <yujian@whamcloud.com>
Change-Id: Id82121f645d1903934bcf00328627947e943c45c
Reviewed-on: http://review.whamcloud.com/955
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Bobi Jam <bobijam@whamcloud.com>
Reviewed-by: Minh Diep <mdiep@whamcloud.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>