Whamcloud - gitweb
fs/lustre-release.git
11 years agoLU-1526 tests: Support for MDS-initiated OST_DESTROYs
Yu Jian [Sat, 5 Jan 2013 10:10:11 +0000 (18:10 +0800)]
LU-1526 tests: Support for MDS-initiated OST_DESTROYs

This patch is backported from commit af5f388 of LU-1303 to
support interoperating with 2.4 server.

The patch makes sure the tests work with MDSs that destroy OST
objects asynchronously on behalf of clients.

Signed-off-by: Yu Jian <yujian@whamcloud.com>
Change-Id: I8d8cb9e3699b6e7f63af106a5f45363f61f3ce7c
Reviewed-on: http://review.whamcloud.com/4959
Tested-by: Hudson
Reviewed-by: Andreas Dilger <andreas.dilger@intel.com>
Reviewed-by: Li Wei <wei.g.li@intel.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
11 years agoLU-1075 tests: auster detect directory as script name
Emoly Liu [Fri, 28 Dec 2012 05:54:00 +0000 (13:54 +0800)]
LU-1075 tests: auster detect directory as script name

Auster should use -f or detect the existing of script name
rather use -e which will treat directorysame as script.
Also, the patch of LU-412 is included to fix test script
lookup in auster.

port of master patch 0a79b541ad736bb296ea051e58b667c6195731a1
port of master patch 39b98cb351866da5648ea1a2216c108f8791226f

Signed-off-by: Minh Diep <mdiep@whamcloud.com>
Signed-off-by: Li Wei <liwei@whamcloud.com>
Signed-off-by: Liu Ying <emoly.liu@intel.com>
Change-Id: Id7d526b73051124b8b76deca974345573c6faf2b
Reviewed-on: http://review.whamcloud.com/4895
Tested-by: Hudson
Reviewed-by: Jian Yu <jian.yu@intel.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Li Wei <wei.g.li@intel.com>
Reviewed-by: Johann Lombardi <johann.lombardi@intel.com>
11 years agoLU-1520 ldlm: Revert "improve ldlm_pools_shrink algorithm"
Johann Lombardi [Tue, 11 Dec 2012 15:47:59 +0000 (10:47 -0500)]
LU-1520 ldlm: Revert "improve ldlm_pools_shrink algorithm"

This reverts commit c861cc7e0b6f7e82fd55b9658dd29578f97b5607
The patch should land on master first.

Change-Id: I9b3739defed6bf315646f8a107d3218414a14d25
Reviewed-on: http://review.whamcloud.com/4799
Reviewed-by: Johann Lombardi <johann.lombardi@intel.com>
Tested-by: Johann Lombardi <johann.lombardi@intel.com>
11 years agoLU-676 tests: machinefile option for mpirun via a variable
Jay J. Lan [Tue, 18 Oct 2011 19:04:07 +0000 (12:04 -0700)]
LU-676 tests: machinefile option for mpirun via a variable

Not all MPI implementations pass the host file to mpirun via the same
option. Common options are -machinefile and -hostfile.

This problem can be resolved by using a variable MACHINEFILE_OPTION
instead. A default value is assigned if the variable not defined.

Signed-off-by: Jay J Lan <jay.j.lan@nasa.gov>
Signed-off-by: Bobi Jam <bobijam@whamcloud.com>
Change-Id: I3362a6e62a27318cff733aea2f99b1356b3ff02e
Reviewed-on: http://review.whamcloud.com/1540
Tested-by: Hudson
Reviewed-by: Jian Yu <jian.yu@intel.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann.lombardi@intel.com>
11 years agoLU-1322 llite: revalidate dentry if f_version is 0
Peng Tao [Mon, 25 Jun 2012 13:31:18 +0000 (21:31 +0800)]
LU-1322 llite: revalidate dentry if f_version is 0

If a file is lseek()ed before i_version changes, it is possible
for application to call into ll_readdir and have f_pos pointing
to some garbage data and cause kernel hang.

Signed-off-by: Peng Tao <tao.peng@emc.com>
Change-Id: I49ab94ad5c63c3029d5ad96e27e38e124a135ed8
Reviewed-on: http://review.whamcloud.com/3181
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Lai Siyao <laisiyao@whamcloud.com>
Reviewed-by: Andreas Dilger <andreas.dilger@intel.com>
11 years agoLU-1520 ldlm: improve ldlm_pools_shrink algorithm
Hongchao Zhang [Tue, 4 Sep 2012 15:27:09 +0000 (23:27 +0800)]
LU-1520 ldlm: improve ldlm_pools_shrink algorithm

1, shrink namespaces by batches of 64 namespaces, the batch is
   implemented as list
2, limit number of simultaneously shrinking threads to 32 threads
3, have ldlm_pools_recalc to operate with namespaces similar to
   ldlm_pools_shrink
4, use glboal counters of unused locks on client and granted
   locks on servers to avoid iterating over namespaces

Change-Id: I5fb3f56748ae10961c50b4b06c300c9c7f5fca87
Signed-off-by: Vladimir Saveliev <valdimir.saveliev@oracle.com>
Signed-off-by: Hongchao Zhang <hongchao.zhang@whamcloud.com>
Reviewed-on: http://review.whamcloud.com/3270
Tested-by: Hudson
Reviewed-by: Andreas Dilger <andreas.dilger@intel.com>
Reviewed-by: Fan Yong <yong.fan@whamcloud.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann.lombardi@intel.com>
11 years agoLU-1901 ldiskfs: compile error for sles11 when JBD2_DEBUG is on
Vladimir Saveliev [Sun, 9 Sep 2012 08:57:03 +0000 (12:57 +0400)]
LU-1901 ldiskfs: compile error for sles11 when JBD2_DEBUG is on

The only change is the below hunk for ext4/inode.c:ext4_forget():
        jbd_debug(4, "forgetting bh %p: is_metadata = %d, mode %o, "
-                 "data mode %xn",
+                 "data mode %Lxn",
                  bh, is_metadata, inode->i_mode,
                  test_opt(inode->i_sb, DATA_FLAGS));

It is needed because for sles11 s_mount_opt of struct ext4_sb_info
is changed to unsigned long long.

Signed-off-by: Vladimir Saveliev <vladimir.saveliev@oracle.com>
Change-Id: I59646b821b83eed08a67124a9f52ab8dcb9b46ae
Reviewed-on: http://review.whamcloud.com/3943
Tested-by: Hudson
Reviewed-by: Andreas Dilger <andreas.dilger@intel.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Bob Glossman <bob.glossman@intel.com>
11 years agoLU-2371 ptlrpc: add support for -EINPROGRESS
Niu Yawei [Fri, 13 Jan 2012 08:33:22 +0000 (00:33 -0800)]
LU-2371 ptlrpc: add support for -EINPROGRESS

Backport patches from LU-904, LU-1329 and LU-1788 to introduce
support for -EINPROGRESS in lustre 1.8. This is needed for
quota interoperability with 2.4 servers.

Signed-off-by: Johann Lombardi <johann@whamcloud.com>
Change-Id: I9136112ca82dbf6caba41c2d41643ec646372852
Signed-off-by: Niu Yawei <niu@whamcloud.com>
Reviewed-on: http://review.whamcloud.com/4655
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann.lombardi@intel.com>
11 years agoLU-1064 mds: fix mds_lookup lma removal error path
Andreas Dilger [Sat, 1 Dec 2012 07:15:34 +0000 (00:15 -0700)]
LU-1064 mds: fix mds_lookup lma removal error path

In commit 1fd243c89e3b221d40ce74b8ef47f1bca760c8f9 if an error is hit
removing the "lma" xattr from an updated 2.x inode, then the open
transaction handle would never be committed, and the MDS would hang.

This is unlikely to be a problem, as the only errors fsfilt_set_md()
will hit that are not programming bugs are due to IO errors from the
underlying disk (which is an even bigger problem).

Make sure that the transaction is committed, even after an error.

Signed-off-by: Andreas Dilger <adilger@whamcloud.com>
Change-Id: If7d19a2337a12efafacd20d5c4e5c00e85300c1e
Reviewed-on: http://review.whamcloud.com/4729
Tested-by: Hudson
Reviewed-by: Iurii Golovach <Iurii_Golovach@xyratex.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann.lombardi@intel.com>
11 years agoLU-1064 mds: journal locking issue was fixed
Iurii Golovach [Fri, 16 Mar 2012 03:06:10 +0000 (20:06 -0700)]
LU-1064 mds: journal locking issue was fixed

During the downgrade procedure locking during
lma data removal is observed. The patch fix
this issue by moving the lma remove under the
mutex.

Reviewed-by: Vitaly Fertman <vfertman@xyratex.com>
Reviewed-by: Andrew Perepechko <aperepechko@xyratex.com>
Xyratex-bug-id: MRP-251

Signed-off-by: Iurii Golovach <Iurii_Golovach@xyratex.com>
Change-Id: I01e7bda03e3b0dfae92b7e03672c56b23a73989d
Reviewed-on: http://review.whamcloud.com/2077
Tested-by: Hudson
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Keith Mannthey <kemannthey@gmail.com>
11 years agoLU-1517 ptlrpc: throw net error to ptlrpc for bulk
Alexander.Boyko [Mon, 17 Sep 2012 13:48:59 +0000 (17:48 +0400)]
LU-1517 ptlrpc: throw net error to ptlrpc for bulk

Start reconnect and resend if network error occures
for the bulk transfer.

Signed-off-by: Alexander Boyko <alexander_boyko@xyratex.com>
Change-Id: I0cf2ee1230a039336f081fbb520c1ce768882088
Xyratex-bug-id: MRP-523
Reviewed-on: http://review.whamcloud.com/3102
Tested-by: Hudson
Reviewed-by: Liang Zhen <liang@whamcloud.com>
Reviewed-by: Keith Mannthey <keith.mannthey@intel.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Cory Spitz <spitzcor@cray.com>
Reviewed-by: Johann Lombardi <johann.lombardi@intel.com>
11 years agoLU-588 ldiskfs: Don't release super block buffer_head too early
Jeremy Filizetti [Mon, 30 Jul 2012 16:10:45 +0000 (12:10 -0400)]
LU-588 ldiskfs: Don't release super block buffer_head too early

If the super block buffer_head is released prior to MMP stopping
kmmpd can get a zeroed buffer_head and exit. The following code
causes kmmpd to exit when the s_feature_incompat is zero:

if (!(le32_to_cpu(es->s_feature_incompat) &
LDISKFS_FEATURE_INCOMPAT_MMP)) {
ldiskfs_warning(sb, "kmmpd being stopped since MMP feature"
                             " has been disabled.");
LDISKFS_SB(sb)->s_mmp_tsk = NULL;
goto failed;
}

A deadlock can occur with the kthread_stop_lock mutex because
ldiskfs_put_super calls kthread_stop on an already stopped thread
(kmmpd) so it waits for completion of the thread stopping before
releasing the kthread_stop_lock.

This is the result of a race with the kmmpd thread setting s_mmp_tsk
to NULL and ldiskfs_put_super in another thread to checks s_mmp_tsk
for NULL prior to calling kthread_stop.

Signed-off-by: Jeremy Filizetti <jeremy.filizetti@gmail.com>
Change-Id: Ia15d8ff829705a5d51dea4f86e40ba7c5745a9c5
Reviewed-on: http://review.whamcloud.com/3172
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Yu Jian <yujian@whamcloud.com>
11 years agoLU-1770 ptlrpc: introducing OBD_CONNECT_FLOCK_OWNER flag
Iurii.Golovach [Tue, 16 Oct 2012 13:39:07 +0000 (16:39 +0300)]
LU-1770 ptlrpc: introducing OBD_CONNECT_FLOCK_OWNER flag

After applying flock policy fix into the 1.8 users met with an issue
when 1.8 clients with a fixed flock policy recognized incorrectly by
2.x servers.
This flag is intended to present 1.8 clients with fixed flock policy
to let 2.x servers make flock policy recognition correctly.
Patches with functionality changes were attached on review at LU-1575

Xyratex-bug-id: MRP-489

Reviewed-by: Alexey Lyashkov <alexey_lyashkov@xyratex.com>
Reviewed-by: Andriy Skulysh <andriy_skulysh@xyratex.com>
Signed-off-by: Iurii Golovach <iurii_golovach@xyratex.com>
Change-Id: I0b203a7e181310c2888ae5bbe8c90ca0a5bbe549
Reviewed-on: http://review.whamcloud.com/3723
Reviewed-by: Cory Spitz <spitzcor@cray.com>
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann.lombardi@intel.com>
11 years agoLU-630 lnet: only router checks peer health
Lai Siyao [Mon, 5 Dec 2011 07:28:39 +0000 (15:28 +0800)]
LU-630 lnet: only router checks peer health

The peer health code is designed for router, so a ~rtr node always
assumes peers to be alive.

Signed-off-by: Lai Siyao <laisiyao@whamcloud.com>
Change-Id: Iacdc7359c69e0f172de0914048b35bd6fe06133e
Reviewed-on: http://review.whamcloud.com/4287
Tested-by: Hudson
Reviewed-by: Isaac Huang <he.huang@intel.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann.lombardi@intel.com>
11 years agoLU-1306 ldlm: LBUG at ldlm_lock.c:213
Andriy Skulysh [Wed, 11 Apr 2012 11:55:28 +0000 (14:55 +0300)]
LU-1306 ldlm: LBUG at ldlm_lock.c:213

Protect l_flags with locking to prevent race on
signal reception.

Xyratex-bug-id: MRP-420
Signed-off-by: Andriy Skulysh <Andriy_Skulysh@xyratex.com>
Reviewed-by: Vitaly Fertman <vitaly_fertman@xyratex.com>
Reviewed-by: Iurii Golovach <iurii_golovach@xyratex.com>
Change-Id: I98ba5e6e7a287090f6bd2a270c89a7671875bb9a
Reviewed-on: http://review.whamcloud.com/2727
Reviewed-by: Iurii Golovach <Iurii_Golovach@xyratex.com>
Reviewed-by: Cory Spitz <spitzcor@cray.com>
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
11 years agoLU-1585 lnet: Fix an incorrect timestamp calculation in lst.c
Doug Oucharek [Thu, 26 Jul 2012 05:21:46 +0000 (22:21 -0700)]
LU-1585 lnet: Fix an incorrect timestamp calculation in lst.c

The operation in routine lst_timeval_diff() (in lst.c) has
a bug. It uses tv_sec where it should be using tv_usec.

Signed-off-by: Doug Oucharek <doug@whamcloud.com>
Change-Id: I898dacd3d4a2c84594148d0514fda731d24a49bf
Reviewed-on: http://review.whamcloud.com/3474
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Liang Zhen <liang@whamcloud.com>
Reviewed-by: Isaac Huang <he.huang@intel.com>
Reviewed-by: Johann Lombardi <johann.lombardi@intel.com>
11 years agoLU-1969 release: bump version to 1.8.8.60-wc1 v1_8_8_60_WC1
Johann Lombardi [Mon, 17 Sep 2012 19:14:14 +0000 (21:14 +0200)]
LU-1969 release: bump version to 1.8.8.60-wc1

Bump version to 1.8.8.60-wc1.

Signed-off-by: Johann Lombardi <johann@whamcloud.com>
Change-Id: I6c8e6191afc21f674b92ebe65a869fb76bb52bd4
Reviewed-on: http://review.whamcloud.com/4014

11 years agoLU-1789 protocol: reserve connect flag for lightweight conn
Johann Lombardi [Tue, 4 Sep 2012 07:52:00 +0000 (09:52 +0200)]
LU-1789 protocol: reserve connect flag for lightweight conn

Reserve connection flag for lightweight connection support.
Although this feature will never be supported on 1.8, it still avoids
flag conflicts.

Signed-off-by: Johann Lombardi <johann@whamcloud.com>
Change-Id: I1b943579d8c46cce0d70a5df45d35874bf63b29b
Reviewed-on: http://review.whamcloud.com/3852

11 years agoLU-1675: fix fid for ll_get_parent
Alexander.Boyko [Thu, 26 Jul 2012 06:16:51 +0000 (10:16 +0400)]
LU-1675: fix fid for ll_get_parent

When nfs reexport occured between 1.8 client and 2.0 server,
nfs client do mkdir a, cd a, and get -521 error, ll_get_parent()
fail with -22 error, and mds print
"mdt_body_unpack()) Invalid fid: [0x2010e0901000001:0x0:0x4000]".
This patch fix fid translation for the ptlrpc request/reply.

Signed-off-by: Alexander Boyko <alexander_boyko@xyratex.com>
Xyratex-bug-id: MRP-522
Change-Id: Ib1a79e36ead478ebb874e7e04761bc43f33a410f
Reviewed-on: http://review.whamcloud.com/3475
Tested-by: Hudson
Reviewed-by: Fan Yong <yong.fan@whamcloud.com>
Reviewed-by: Cory Spitz <spitzcor@cray.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
11 years agoLU-1674 llite: opencreate without mode can crash llite
Liang Zhen [Thu, 26 Jul 2012 00:38:07 +0000 (08:38 +0800)]
LU-1674 llite: opencreate without mode can crash llite

User should specify mode for opencreate but if they don't llite will
LBUG, which is not good.

Signed-off-by: Bobi Jam <bobijam@whamcloud.com>
Change-Id: I5c4044da6dda2a902bc48f408f6aaf8d02dd82a4
Reviewed-on: http://review.whamcloud.com/3469
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Liang Zhen <liang@whamcloud.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
11 years agoLU-1838 llite: trusted. xattr is invisible to non-root
Fan Yong [Thu, 6 Sep 2012 16:18:05 +0000 (00:18 +0800)]
LU-1838 llite: trusted. xattr is invisible to non-root

Filter out all invalid xattrs in listxattr.
This includes trusted. xattrs that can cause
unnecessary "EPERM" in subsequent getxattr operations.

Signed-off-by: Fan Yong <yong.fan@whamcloud.com>
Signed-off-by: Bob Glossman <bogl@whamcloud.com>
Change-Id: I9613444adcdb14067a775f68f951af7a9b941e9a
Reviewed-on: http://review.whamcloud.com/3892
Tested-by: Hudson
Reviewed-by: Bob Glossman <bob.glossman@intel.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
11 years agoLU-1754 kernel: Kernel update [RHEL6.3 2.6.32-279.5.1.el6]
yangsheng [Fri, 24 Aug 2012 16:31:03 +0000 (00:31 +0800)]
LU-1754 kernel: Kernel update [RHEL6.3 2.6.32-279.5.1.el6]

Update RHEL6.3 kernel to 2.6.32-279.5.1.el6(client only).

Signed-off-by: yang sheng <ys@whamcloud.com>
Change-Id: Idffcaf6efa53119c1d093b852a37453d8c9b4116
Reviewed-on: http://review.whamcloud.com/3774
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Bobi Jam <bobijam@whamcloud.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
11 years agoLU-919 obdclass: remove hard coded 0x5a5a5a
Niu Yawei [Wed, 11 Jan 2012 04:24:59 +0000 (20:24 -0800)]
LU-919 obdclass: remove hard coded 0x5a5a5a

We assert atomic_t value with hard coded 0x5a5a5a in several places,
which could result in false assertion failure when the reference count
getting very large in some extreme case.

The hard coded 0x5a5a5a should be replaced by LI_POISON.

Signed-off-by: Bruno Faccini <bruno.faccini@bull.net>
Signed-off-by: Niu Yawei <niu@whamcloud.com>
Change-Id: Ia1105e48326c20a37d887ba9dc926ea300d97741
Reviewed-on: http://review.whamcloud.com/1954
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
11 years agoLU-1720 kernel: Quota doesn't work over 4TB on single OST
yangsheng [Fri, 10 Aug 2012 13:07:29 +0000 (21:07 +0800)]
LU-1720 kernel: Quota doesn't work over 4TB on single OST

Fix previous kernel update patch wrong update chunk.

Signed-off-by: yang sheng <ys@whamcloud.com>
Change-Id: Ice06fd70ad6f034dddb5aae5291c613e04b18d99
Reviewed-on: http://review.whamcloud.com/3599
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
11 years agoLU-1782 quota: ignore sb_has_quota_active() in OFED's header
Shuichi Ihara [Thu, 23 Aug 2012 16:36:46 +0000 (01:36 +0900)]
LU-1782 quota: ignore sb_has_quota_active() in OFED's header

sb_has_quota_active() and sb_any_quota_active() are defined
in ofed's backport headers, but if these are reffered on
RHEL5's kernel, quota is broken. So, it ignores them.

Signed-off-by: Shuichi Ihara <sihara@ddn.com>
Change-Id: Ic78799bc5d948b583b4a515479d5091381c63185
Reviewed-on: http://review.whamcloud.com/3764
Reviewed-by: Niu Yawei <niu@whamcloud.com>
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
11 years agoLU-1496 ptlrpc: prolong rw locks even IO RPCs are finished
Bobi Jam [Thu, 21 Jun 2012 04:37:49 +0000 (12:37 +0800)]
LU-1496 ptlrpc: prolong rw locks even IO RPCs are finished

Refresh rw lock again after IO RPCs are finished to leave a time
window for clients to cancel covering dlm locks.

This is a part of LU-874 back port.

Signed-off-by: Bobi Jam <bobijam@whamcloud.com>
Change-Id: I5cd185870e601a66bce21b3cc3c91f5f800b4c27
Reviewed-on: http://review.whamcloud.com/3157
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Jinshan Xiong <jinshan.xiong@whamcloud.com>
Reviewed-by: Fan Yong <yong.fan@whamcloud.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
11 years agoLU-1115 kernel: software raid6 related BUG
yangsheng [Wed, 2 May 2012 10:35:41 +0000 (18:35 +0800)]
LU-1115 kernel: software raid6 related BUG

Software raid6 hit BUGON in fs/bio.c:222 when raid chunk > 64k.
We pull upstream patch: 5b99c2ffa980528a197f26c7d876cceeccce8dd5
to deal with this issue.

Signed-off-by: yang sheng <ys@whamcloud.com>
Change-Id: I5330bc161e7cf5364a614547949323fc9a3ee7e3
Reviewed-on: http://review.whamcloud.com/2625
Reviewed-by: Bobi Jam <bobijam@whamcloud.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
11 years agoLU-359 llite: no close error if application has known failure
Fan Yong [Wed, 1 Aug 2012 13:02:36 +0000 (21:02 +0800)]
LU-359 llite: no close error if application has known failure

Don't return error again when close if the application has known
former write failure to avoid potenical rdundant error handling,
like confused error message.

Signed-off-by: Fan Yong <yong.fan@whamcloud.com>
Change-Id: I62d9cd83fc03fad22c994f2a77774ca113a6c057
Reviewed-on: http://review.whamcloud.com/596
Reviewed-by: Niu Yawei <niu@whamcloud.com>
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Jinshan Xiong <jinshan.xiong@whamcloud.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
11 years agoLU-1488 mdc: fix fid_res_name_eq() issue.
yangsheng [Sun, 5 Aug 2012 19:42:48 +0000 (03:42 +0800)]
LU-1488 mdc: fix fid_res_name_eq() issue.

Original error message:
LustreError: 25302:0:(namei.c:256:ll_mdc_blocking_ast())
ns: lustre-MDT0000-mdc-ffff81021762a000 lock:
The issue cause by commit ef8bd11416bae8c03a65682f3a10a4da39922b45.
fid_res_name_eq() use wrong way to compare fid & res_name.

Signed-off-by: yang sheng <ys@whamcloud.com>
Change-Id: Iacba148b6c3ba7fa775d2b9a4a58bdbf67434d7c
Reviewed-on: http://review.whamcloud.com/3522
Tested-by: Hudson
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Fan Yong <yong.fan@whamcloud.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
11 years agoLU-1511 kernel: kernel update [RHEL5.8 2.6.18-308.11.1.el5]
yangsheng [Tue, 12 Jun 2012 15:57:16 +0000 (23:57 +0800)]
LU-1511 kernel: kernel update [RHEL5.8 2.6.18-308.11.1.el5]

Update RHEL5.8 kernel to 2.6.18-308.11.1.el5.

Signed-off-by: yang sheng <ys@whamcloud.com>
Change-Id: I9d93c9666af54ed8c7d6d9ff33154929c94afe2e
Reviewed-on: http://review.whamcloud.com/3096
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
11 years agoLU-1563 quota: Put lqs properly in quota_pending_commit()
Niu Yawei [Tue, 26 Jun 2012 09:35:01 +0000 (02:35 -0700)]
LU-1563 quota: Put lqs properly in quota_pending_commit()

In quota_pending_commit(), always check if pending > 0 to figure
out if a lqs is held from quota_check_common(), otherwise, we
could find a lqs in quota_pending_commit(), then put it twice.

Signed-off-by: Niu Yawei <niu@whamcloud.com>
Change-Id: Ia653f41d721c002bbfbebcaf688b9943dde256bf
Reviewed-on: http://review.whamcloud.com/3187
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
Reviewed-by: Fan Yong <yong.fan@whamcloud.com>
11 years agoLU-1535 ldlm: backport fix for LU-1128
Lai Siyao [Tue, 19 Jun 2012 08:58:41 +0000 (16:58 +0800)]
LU-1535 ldlm: backport fix for LU-1128

Backport fix for LU-1128 to 1.8:
For ldlm server pool shrinker, we just use it to decrease SLV,
but never reclaim any memory directly, so it should always return
-1 to inform the kernel to break the shrink loop.

Signed-off-by: Lai Siyao <laisiyao@whamcloud.com>
Change-Id: I1c841e7485375017d33e93c59f2318318fae299c
Reviewed-on: http://review.whamcloud.com/3138
Reviewed-by: Niu Yawei <niu@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
11 years agoLU-1459 llite: Don't LBUG when import has LUSTRE_IMP_NEW state
Jeremy Filizetti [Thu, 31 May 2012 14:30:00 +0000 (10:30 -0400)]
LU-1459 llite: Don't LBUG when import has LUSTRE_IMP_NEW state

When a disabled OSC/OST is configured in the system at mount
time, a client will LBUG if calling "lfs check servers".
Disabling the LBUG causes client to return -EIO instead.

Signed-off-by: Jeremy Filizetti <jeremy.filizetti@gmail.com>
Change-Id: Ib689eb37c20d1012728abb7c35aee15f30604d54
Reviewed-on: http://review.whamcloud.com/2993
Tested-by: Hudson
Reviewed-by: Bobi Jam <bobijam@whamcloud.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
11 years agoLU-1459 llite: Don't use unitialized variable
Jeremy Filizetti [Tue, 5 Jun 2012 00:14:19 +0000 (20:14 -0400)]
LU-1459 llite: Don't use unitialized variable

Currently lov_connect_obd prints warning messages using
an unitialized stack variable. The message also only
prints a uuid instead of a meaningful target name.

Signed-off-by: Jeremy Filizetti <jeremy.filizetti@gmail.com>
Change-Id: I7549acdd45ab70e3528d2f0c153a27a502b0404a
Reviewed-on: http://review.whamcloud.com/2992
Tested-by: Hudson
Reviewed-by: Bobi Jam <bobijam@whamcloud.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
11 years agoLU-1448 llite: Prevent NULL pointer dereference on disabled OSC
Jeremy Filizetti [Thu, 31 May 2012 12:26:28 +0000 (08:26 -0400)]
LU-1448 llite: Prevent NULL pointer dereference on disabled OSC

When a file system is mounted with a disabled OSC reading the import
information from the proc file system can result in a NULL pointer
dereference. The Lustre import on a disabled OSC with remain
in the LUSTRE_IMP_NEW state and imp_connection will remain NULL.

Signed-off-by: Jeremy Filizetti <jeremy.filizetti@gmail.com>
Change-Id: Ib416b2d706ac9797715db2c0ea4f4eaa79bceb22
Reviewed-on: http://review.whamcloud.com/2977
Reviewed-by: Bobi Jam <bobijam@whamcloud.com>
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
11 years agoLU-1438 quota: quota active checking is missed on slave
Niu Yawei [Fri, 8 Jun 2012 04:55:40 +0000 (21:55 -0700)]
LU-1438 quota: quota active checking is missed on slave

On quota slave, we missed checking if quota is enabled in the
quota_check_common() and several other places. Which could cause
slave retry acquire quota in quota_chk_acq_common() infinitely
when the quota is already turned off on master.

Signed-off-by: Niu Yawei <niu@whamcloud.com>
Change-Id: I707bc34684e95f2a0beec99548dc2d78a4ce8bbf
Reviewed-on: http://review.whamcloud.com/3060
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Fan Yong <yong.fan@whamcloud.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-1438 quota: fix race in quota_chk_acq_common()
Niu Yawei [Mon, 28 May 2012 09:12:08 +0000 (02:12 -0700)]
LU-1438 quota: fix race in quota_chk_acq_common()

quota_check_common() & qctxt_adjust_qunit() uses different way
to check if quota is enforced on certain ID, which could result
in infinite loop in quota_chk_acq_common() when the QB/QI_SET
flag is cleared just after checking.

This patch used a non-instrusive way to fix this rare race.

Signed-off-by: Niu Yawei <niu@whamcloud.com>
Change-Id: I7212e9fc85e98a40e36d2773c02f838ca68339bb
Reviewed-on: http://review.whamcloud.com/2927
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-814 tests: remove leading spaces from $WRITE_DISJOINT
Yu Jian [Tue, 22 May 2012 11:38:42 +0000 (19:38 +0800)]
LU-814 tests: remove leading spaces from $WRITE_DISJOINT

In functions.sh, the WRITE_DISJOINT variable is defined as follows:

    WRITE_DISJOINT=${WRITE_DISJOINT:-\
        $(which write_disjoint 2> /dev/null || true)}

This will assign WRITE_DISJOINT with a value leading with spaces,
which causes "[: too many arguments" issue while checking the
variable. The PARALLEL_GROUPLOCK variable also has the same issue.
This patch fixes it.

Signed-off-by: Yu Jian <yujian@whamcloud.com>
Change-Id: I424af2acd4ef79ea67830010d17b6904769c2ca4
Reviewed-on: http://review.whamcloud.com/2866
Tested-by: Hudson
Reviewed-by: Minh Diep <mdiep@whamcloud.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-121 test: Change framework to only use the short hostname. 1.8.8-wc1 v1_8_8_WC1 v1_8_8_WC1_RC3
chris [Mon, 7 May 2012 12:34:49 +0000 (08:34 -0400)]
LU-121 test: Change framework to only use the short hostname.

This means stripping of everything off the name after and including
the first.

This change is only designed to make the .yml files consistent. The
log files will append the fully qualified name and appear to do this
consistantly.

We can create a jira to make the log and yml files consistant, but
the reality is that the yml files have a short life and are only
used to send to Maloo.

This change will allow automated posting of results and automated
testing to begin.

1. Carries out the above using hostname -s
2. Adds in a quick change so that LUSTRE_BUILD in yaml.sh can be a
reference to the source rather than just the lustre version string
which is recorded and written to the yaml anyway as LUSTER_VERSION

Additionally a couple of other changes sneaked in.

1. Allows the review information to be applied to the yaml output
file by way of exporting the variable CODE_REVIEW_YAML to be a yaml
description for maloo

2. The addition of a couple of fixes to make the permissions for yaml
files be allow-all. This permissions are bracketed and so do not
change any other parts of the code.

Signed-off-by: Chris Gearing <chris@whamcloud.com>
Change-Id: I4b2431030afd206bc83490f5c81fd04e57937aad
Reviewed-on: http://review.whamcloud.com/2663
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Yu Jian <yujian@whamcloud.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-458 debug: print client profile name correctly
Yu Jian [Mon, 21 May 2012 06:23:28 +0000 (14:23 +0800)]
LU-458 debug: print client profile name correctly

This patch reverts commit 48c2f667236e2f41f9fd0224b5de7a83517b3180,
which does not print client profile name correctly and introduces
a new defect that the client profile is not deleted properly.

In ll_put_super(), the memory space pointed to by profilenm is
in fact freed inside lustre_common_put_super(sb), which is called
before LCONSOLE_WARN(). In order to print the client profile name
in LCONSOLE_WARN(), we need copy the contents of profilenm to a
temporary storage before freeing profilenm.

Signed-off-by: Yu Jian <yujian@whamcloud.com>
Change-Id: I9ce2f304f3bad4761d2e3c857e4cdd5df6269c38
Reviewed-on: http://review.whamcloud.com/2841
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-1424 kernel: Kernel update [RHEL6.2 2.6.32-220.17.1.el6]
yangsheng [Mon, 21 May 2012 15:25:21 +0000 (23:25 +0800)]
LU-1424 kernel: Kernel update [RHEL6.2 2.6.32-220.17.1.el6]

Update RHEL6.2 patchless client support to 2.6.32-220.17.1.el6.

Signed-off-by: yang sheng <ys@whamcloud.com>
Change-Id: Ib6a164ecd7beb225107883fc21394056d2ce06bf
Reviewed-on: http://review.whamcloud.com/2848
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-458 debug: use profilenm before running class_del_profile() v1_8_8_WC1_RC2
Yu Jian [Wed, 16 May 2012 04:57:41 +0000 (12:57 +0800)]
LU-458 debug: use profilenm before running class_del_profile()

This patch fixes the defect in ll_put_super() which uses profilenm
after running class_del_profile(profilenm).

Signed-off-by: Yu Jian <yujian@whamcloud.com>
Change-Id: Ida357c6c970f3b6bf1bbe0060a71d17e65323aa2
Reviewed-on: http://review.whamcloud.com/2799
Reviewed-by: Yang Sheng <ys@whamcloud.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
12 years agoLU-425 tests: fix the issue of using "grep -w"
Yu Jian [Thu, 12 Apr 2012 15:38:17 +0000 (23:38 +0800)]
LU-425 tests: fix the issue of using "grep -w"

This patch fixes the following issue while using "grep -w"
to do exact match:

$ echo /mnt/nbp0-2 | grep -w /mnt/nbp0
/mnt/nbp0-2

Per the description of "-w" option:
-w, --word-regexp
Select only those lines containing matches that form whole words.
The test is that the matching substring must either be at the
beginning of the line, or preceded by a non-word constituent
character. Similarly, it must be either at the end of the line
or followed by a non-word constituent character. Word-constituent
characters are letters, digits, and the underscore.

So, the hyphen "-" character is a non-word constituent character
and "grep -w" does not do exact match on strings which contain it.

Signed-off-by: Yu Jian <yujian@whamcloud.com>
Change-Id: I91962910033f561cc9c9a82bd88bbb6dff5594af
Reviewed-on: http://review.whamcloud.com/2528
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Minh Diep <mdiep@whamcloud.com>
Reviewed-by: Wei Liu <sarah@whamcloud.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-1340 release: get ready for 1.8.8-wc1 RC1
Johann Lombardi [Wed, 9 May 2012 15:49:07 +0000 (17:49 +0200)]
LU-1340 release: get ready for 1.8.8-wc1 RC1

Change lustre version to 1.8.8-wc1 for RC1

Signed-off-by: Johann Lombardi <johann@whamcloud.com>
Change-Id: I96448a4e05c3cdbe8945642c53d007a51d0137eb
Reviewed-on: http://review.whamcloud.com/2696

12 years agoLU-1374 kernel: Kernel update [RHEL5.8 2.6.18-308.4.1.el5]
yangsheng [Fri, 4 May 2012 16:06:30 +0000 (00:06 +0800)]
LU-1374 kernel: Kernel update [RHEL5.8 2.6.18-308.4.1.el5]

Update RHEL5.8 kernel to 2.6.18-308.4.1.el5.

Signed-off-by: yang sheng <ys@whamcloud.com>
Change-Id: I1304dda0fae60c86ea67862095856a1c741ee3c3
Reviewed-on: http://review.whamcloud.com/2651
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
12 years agoLU-458 debug: quiet too noisy console messages at mount
yangsheng [Mon, 26 Mar 2012 17:09:15 +0000 (01:09 +0800)]
LU-458 debug: quiet too noisy console messages at mount

Quiet a number of extra debug messages printed to the console after a
remount or recovery. They provide no value and just add to the general
confusion of reading Lustre debug messages.

Signed-off-by: yang sheng <ys@whamcloud.com>
Change-Id: I5d5352d55d5a91f9fd4c55d077eebf1fdab61f80
Reviewed-on: http://review.whamcloud.com/2381
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-1358 kernel: Kernel update [RHEL6.2 2.6.32-220.13.1.el6]
yangsheng [Wed, 2 May 2012 05:57:11 +0000 (13:57 +0800)]
LU-1358 kernel: Kernel update [RHEL6.2 2.6.32-220.13.1.el6]

Update RHEL6.2 patchless client to 2.6.32-220.13.1.el6.

Signed-off-by: yang sheng <ys@whamcloud.com>
Change-Id: I290f985bb6cbcc6c46bc1821c7d87819479eb1be
Reviewed-on: http://review.whamcloud.com/2623
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Bobi Jam <bobijam@whamcloud.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-814 test: automated NFS over lustre testing
Minh Diep [Fri, 20 Apr 2012 18:57:37 +0000 (11:57 -0700)]
LU-814 test: automated NFS over lustre testing

Provide setup nfs within auster framework
Note: this change includes LU-1134, LU-1213

Signed-off-by: Minh Diep <mdiep@whamcloud.com>
Change-Id: If28a237a23cd448c7d8b9a772a4b8951d94697ef
Reviewed-on: http://review.whamcloud.com/2593
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Bob Glossman <bogl@whamcloud.com>
Tested-by: Bob Glossman <bogl@whamcloud.com>
Reviewed-by: Yu Jian <yujian@whamcloud.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-1312 kernel: crash at boot time in isci driver
yangsheng [Tue, 24 Apr 2012 19:50:34 +0000 (03:50 +0800)]
LU-1312 kernel: crash at boot time in isci driver

Restore SG_ALL to default value to avoid crash isci.

Signed-off-by: yang sheng <ys@whamcloud.com>
Change-Id: I855ba8c7669b749fded51f3b0316f115d18e0fcd
Reviewed-on: http://review.whamcloud.com/2595
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
12 years agoLU-1335 build: include lustre srpm in build
Minh Diep [Thu, 19 Apr 2012 00:18:46 +0000 (17:18 -0700)]
LU-1335 build: include lustre srpm in build

Add a support for building lustre-*.src.rpm

Signed-off-by: Minh Diep <mdiep@whamcloud.com>
Change-Id: I301915c107e50ea5d1a3275ecc631f07aa2b78be
Reviewed-on: http://review.whamcloud.com/2576
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Brian J. Murrell <brian@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-447 add lctl --net XXX push
Wally Wang [Tue, 30 Aug 2011 23:43:53 +0000 (16:43 -0700)]
LU-447 add lctl --net XXX push

In order to clear out peer/conn data in the gnilnd for testing after
adding lnet_notify(see LU-446), we need an alternative to
lctl --net gni del_peer - as this nukes the peer and results in false
lnd_query failures.

Change-Id: Ie8029953a2881c0d6e3ac250101d2d4374bbf3c1
Signed-off-by: Wally Wang <wang@cray.com>
Reviewed-on: http://review.whamcloud.com/1311
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Liang Zhen <liang@whamcloud.com>
Reviewed-by: Lai Siyao <laisiyao@whamcloud.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
Reviewed-by: James Simmons <uja.ornl@gmail.com>
12 years agoLU-1340 release: bump version to 1.8.7.81-wc1 v1_8_7_81_WC1
Johann Lombardi [Fri, 20 Apr 2012 13:55:14 +0000 (15:55 +0200)]
LU-1340 release: bump version to 1.8.7.81-wc1

Yet another build which brings us closer to 1.8.8-wc1 RC1.

Signed-off-by: Johann Lombardi <johann@whamcloud.com>
Change-Id: I7bac34ab4d29dc459171265319807c221a564c1a
Reviewed-on: http://review.whamcloud.com/2590

12 years agoLU-995 utils: make lfs getstripe directory output consistent.
Hongchao Zhang [Thu, 15 Mar 2012 04:03:07 +0000 (12:03 +0800)]
LU-995 utils: make lfs getstripe directory output consistent.

"lfs getstripe" should report the global default for any fields with
a value that means "use the default". This patch introduces the
following functionality:

1. If "lfs getstripe" is called on a directory and finds that striping
   EA is not set, the filesystem's defaults are looked up and printed.
2. If the striping EA is set, but the striping count and/or striping
   size has a value that means "use the default" (count = 0 and/or
   size = 0), the filesystem's default for that specific striping
   attribute is looked up and printed.
3. A new option to "lfs getstripe" is introduced; the "--raw" or
   "-R" option. If this option is specified, the previous two checks
   are skipped. In other words, if the striping EA is not set, 0, 0,
   -1, will be printed for the striping count, size, and offset
   respectively. Also, if the striping EA is set, the values will be
   printed without first converting them into their respective
   defaults.

This patch relies on the /proc filesystem to determine each
filesystem's default striping attributes, and a cache is maintained
which holds the default values for the last filesystem queried.

See Also:
Bugzilla #23802, https://bugzilla.lustre.org/show_bug.cgi?id=23802

Signed-off-by: Prakash Surya <surya1@llnl.gov>
Signed-off-by: Hongchao Zhang <hongchao.zhang@whamcloud.com>
Change-Id: Ic5b616dd83775c0a15be5c060b11cfbba05c4fbb
Reviewed-on: http://review.whamcloud.com/2117
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Bobi Jam <bobijam@whamcloud.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-891 test: waiting import state for next step.
yangsheng [Fri, 2 Mar 2012 14:46:47 +0000 (22:46 +0800)]
LU-891 test: waiting import state for next step.

Anyway, There still has a rare chance that the request meet
a invalid import after recovery. So we should waiting import
restore to a certain state and then doing next operation.

Signed-off-by: yang sheng <ys@whamcloud.com>
Change-Id: I20bed347a16755ccaf102d4c67b0a7e87b1318a1
Reviewed-on: http://review.whamcloud.com/2248
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-734 tests: add sub-tests into recovery-*-scale tests
Yu Jian [Wed, 11 Apr 2012 06:41:49 +0000 (14:41 +0800)]
LU-734 tests: add sub-tests into recovery-*-scale tests

This patch adds sub-tests into the recovery-*-scale tests
so that test results and logs could be gathered properly
and uploaded to Maloo.

The patch also does some cleanup works on the test scripts
and moves some common functions into test-framework.sh.

Signed-off-by: Yu Jian <yujian@whamcloud.com>
Change-Id: I514143e1fa29aad289d215174dbc68d2740da73b
Reviewed-on: http://review.whamcloud.com/2508
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Minh Diep <mdiep@whamcloud.com>
Reviewed-by: Cliff White <cliffw@whamcloud.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-577 tests: FAIL replay-single test_70b rundbench load
James Simmons [Wed, 18 Apr 2012 14:09:14 +0000 (10:09 -0400)]
LU-577 tests: FAIL replay-single test_70b rundbench load

Test 70b for replay-single assumes that lustre is mounted on
/mnt/lustre which is not the case for us. This patch passes
the proper MOUNT. The test also was not using the standard
DIR/tdir setup which had generated data files not being
cleaned up. Increased the sleep period to match dbench's
warm up period. This gives dbench a change to start up when
using many clients.

Signed-off-by: James Simmons <uja.ornl@gmail.com>
Signed-off-by: Yu Jian <yujian@whamcloud.com>
Change-Id: I3a793db35aa21d57220d0de1a9e92486e65ae21a
Reviewed-on: http://review.whamcloud.com/2518
Tested-by: Hudson
Reviewed-by: Wei Liu <sarah@whamcloud.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-1062 tests: incorrect path to configuration file
Iurii Golovach [Tue, 20 Mar 2012 00:53:27 +0000 (17:53 -0700)]
LU-1062 tests: incorrect path to configuration file

this is a bit modified port of the
http://review.whamcloud.com/#change,1877
(author Andreas Dilger)

Xyratex-bug-id: MRP-480

Reviewed-by: Sergey Glushchenko <Sergey_Glushchenko@xyratex.com>
Reviewed-by: Andriy Skulysh <andriy_skulysh@xyratex.com>
Signed-off-by: Iurii Golovach <iurii_golovach@xyratex.com>
Change-Id: I10b42de2c2d453e23142c01290742153dff7262a
Reviewed-on: http://review.whamcloud.com/2419
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-427 test: Test failure on test suite lfsck
Yu Jian [Tue, 10 Apr 2012 10:23:02 +0000 (18:23 +0800)]
LU-427 test: Test failure on test suite lfsck

- Reset $MDSDB & $OSTDB in generate_db(). Else they will
  stale if user redefine $SHARED_DIRECTORY.
- Add a function check_shared_dir() to ensure
  $SHARED_DIRECTORY is shared among tests nodes.
- Fix check_logdir() and check_write_access() to avoid using
  node.$(hostname).yml files which should not be deleted.

Signed-off-by: Yang Sheng <ys@whamcloud.com>
Signed-off-by: Yu Jian <yujian@whamcloud.com>
Change-Id: Ie7d1d08c0d2c701fa9fb74ef8b252fa8b31bf111
Reviewed-on: http://review.whamcloud.com/2498
Tested-by: Hudson
Reviewed-by: Wei Liu <sarah@whamcloud.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-313 tests: re-enable lfsck test to run by default
Andreas Dilger [Tue, 10 Apr 2012 08:53:29 +0000 (16:53 +0800)]
LU-313 tests: re-enable lfsck test to run by default

Due to bug 13698, the lfsck part of the lfsck.sh test script was
disabled by default. After the fixes in LU-113 were landed, lfsck
should work again. Remove SKIP_LFSCK checks so lfsck.sh actually runs
lfsck instead of silently skipping it unless SKIP_LFSCK=no is set.

Signed-off-by: Andreas Dilger <adilger@whamcloud.com>
Signed-off-by: Yu Jian <yujian@whamcloud.com>
Change-Id: I430f7398b2d21db0d0755726fdcb6053f25b4b10
Reviewed-on: http://review.whamcloud.com/2497
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Wei Liu <sarah@whamcloud.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-1052 kernel: Kernel update [RHEL5.8 2.6.18-308.1.1.el5]
yangsheng [Wed, 22 Feb 2012 10:27:42 +0000 (18:27 +0800)]
LU-1052 kernel: Kernel update [RHEL5.8 2.6.18-308.1.1.el5]

Update RHEL5.8 kernel to 2.6.18-308.1.1.el5.

Signed-off-by: yang sheng <ys@whamcloud.com>
Change-Id: Id8394bc1cb57d9555e4cda1312200a23b0915c84
Reviewed-on: http://review.whamcloud.com/2175
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
Tested-by: Hudson
Reviewed-by: Bobi Jam <bobijam@whamcloud.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: James Simmons <uja.ornl@gmail.com>
12 years agoLU-935 quota: break early when b/i_unit_sz exceeded upper limit
Niu Yawei [Mon, 19 Dec 2011 04:12:55 +0000 (20:12 -0800)]
LU-935 quota: break early when b/i_unit_sz exceeded upper limit

While expanding b/i_unit_sz in dquot_create_oqaq(), we'd break the loop
early when the b/i_unit_sz exceeded upper limit, otherwise, qaq_b/iunit_sz
could be overflow and result in endless loop.

Signed-off-by: Niu Yawei <niu@whamcloud.com>
Change-Id: I6046c8b86cad00c8c2fa2d24d0e21821a3c02209
Reviewed-on: http://review.whamcloud.com/1887
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Fan Yong <yong.fan@whamcloud.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-1241 kernel: Kernel update [RHEL6.2 2.6.32-220.7.1.el6]
yangsheng [Wed, 28 Mar 2012 20:28:59 +0000 (04:28 +0800)]
LU-1241 kernel: Kernel update [RHEL6.2 2.6.32-220.7.1.el6]

Update RHEL6.2 kernel to 2.6.32-220.7.1.el6.(patchless client)

Signed-off-by: yang sheng <ys@whamcloud.com>
Change-Id: I0933095b4e412dbb4ff826f45196d3b6aa80aecc
Reviewed-on: http://review.whamcloud.com/2401
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-847 quota: client retrieve quota usage directly
Niu Yawei [Fri, 21 Oct 2011 08:30:46 +0000 (01:30 -0700)]
LU-847 quota: client retrieve quota usage directly

Current 'lfs quota' sends getquota RPC to MDS, and MDS is responsible
for retrieving disk usage from all targets, this scheme will be
changed to client retrieving disk usage from all targets directly.

This patch addresses the compatibility issue as well: If the getquota
returned by MDS has QIF_SPACE, client just trust the disk usage
returned by MDS, otherwise, client has to issue RPCs to collect disk
usage by itself.

Signed-off-by: Niu Yawei <niu@whamcloud.com>
Signed-off-by: Johann Lombardi <johann@whamcloud.com>
Change-Id: I35526c453033809d2f23b1806b4783f5011e3fa2
Reviewed-on: http://review.whamcloud.com/1570
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
12 years agoLU-974 protocol: change OBD_CONNECT_GRANT_PARAM
Andreas Dilger [Tue, 13 Mar 2012 20:31:38 +0000 (14:31 -0600)]
LU-974 protocol: change OBD_CONNECT_GRANT_PARAM

Change the OBD_CONNECT_GRANT_PARAM flag value to avoid conflict
with the OBD_CONNECT_UMASK flag from LU-974.  While that patch is
not yet landed to our release tree, it is in use in production at
some customers.  While the risk of conflict is currently low, it
is easier to change the GRANT_PARAM value since only in use on the
orion branch, and isn't even handled by the client there yet.

Add (hopefully) clear comments for OBD_CONNECT and obd_connect_data
to ensure that they are not modified in some incompatible way across
branches.

Signed-off-by: Andreas Dilger <adilger@whamcloud.com>
Change-Id: I503892c3b595c0272b0941fa58a16a496318cab0
Reviewed-on: http://review.whamcloud.com/2298
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-649 io: DIO doesn't need lock i_mutex
Johann Lombardi [Wed, 28 Mar 2012 12:10:35 +0000 (14:10 +0200)]
LU-649 io: DIO doesn't need lock i_mutex

Revert "LU-649 io: DIO doesn't need lock i_mutex"
This reverts commit 4ebec99e309fcafd1be8a0b10673d69a083ae87f.

There is a concern that this patch is wrong, so i'm reverting it until
the situation is clarified.

Signed-off-by: Johann Lombardi <johann@whamcloud.com>
Change-Id: Ia7e12eaeeb87ef30d073e28391a25ffcd8ad146f
Reviewed-on: http://review.whamcloud.com/2398

12 years agoLU-249 mds: Increase the maximum number of threads on MDS
James Simmons [Tue, 27 Mar 2012 12:46:56 +0000 (08:46 -0400)]
LU-249 mds: Increase the maximum number of threads on MDS

Currently the largest number of the threads that can exist on
the MDS is 512. For very large system is those threads can
easily be exhausted and end up back logging the system. This
patch allows the user to define the upper limit of how many
threads are created on the MDS. If not selected the default
behavior of 512 is kept.

Change-Id: Ie48eb00f48ddcd98cc518178ba5e1da9ea1e40eb
Signed-off-by: James Simmons <uja.ornl@gmail.com>
Reviewed-on: http://review.whamcloud.com/2354
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-1018 test: Reduce compilebench run time
Minh Diep [Tue, 24 Jan 2012 17:30:32 +0000 (09:30 -0800)]
LU-1018 test: Reduce compilebench run time

We decided to reduce the run time for compilebench
so it would run under 1h on VM nodes. This is similar
to master

Signed-off-by: Minh Diep <mdiep@whamcloud.com>
Change-Id: Ie664ad131e4fbce9443eb78d06cbe36d6002b881
Reviewed-on: http://review.whamcloud.com/2005
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Cliff White <cliffw@whamcloud.com>
Reviewed-by: Wei Liu <sarah@whamcloud.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-554 LNet: add gnilnd awareness to LNet
Chris Horn [Tue, 31 Jan 2012 18:31:44 +0000 (12:31 -0600)]
LU-554 LNet: add gnilnd awareness to LNet

This allows servers on any network to talk to gnilnd routers.
See Oracle 23884 attachment 31892

Change-Id: I6989c987b2bd0d60058a290567f8ef0bee56420b
Signed-off-by: Chris Horn <hornc@cray.com>
Reviewed-on: http://review.whamcloud.com/1179
Reviewed-by: James Simmons <uja.ornl@gmail.com>
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-1221 ldiskfs: Replace sysname with nodename in MMP
Niu Yawei [Fri, 16 Mar 2012 03:22:59 +0000 (20:22 -0700)]
LU-1221 ldiskfs: Replace sysname with nodename in MMP

sysname holds "Linux" by default, i.e. what appears when
doing a "uname -s"; nodename should be used to print the
machine's hostname, i.e. what is returned when doing a
"uname -n" or "hostname", and what
gethostname(2)/sethostname(2) manipulate, in order to
notify the administrator of the node which is contending
to mount the filesystem.

Andreas says this was introduced when porting the MMP
patches from RHEL5 to RHEL6, and then also pushed upstream
to ext4; a patch for upstream ext4 has already been
submitted.

MRP-222
Signed-off-by: Nikitas Angelinas <nikitas_angelinas@xyratex.com>
Signed-off-by: Niu Yawei <niu@whamcloud.com>
Change-Id: Ie5c8e99ae536c2239134140589d678d3a4f92639
Reviewed-on: http://review.whamcloud.com/2337
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-1198 idl: move FID VER to DLM resource name[1]
Andreas Dilger [Thu, 8 Mar 2012 14:22:59 +0000 (22:22 +0800)]
LU-1198 idl: move FID VER to DLM resource name[1]

Until Lustre 1.8.7/2.1.1 the FID version was packed into name[2].

However, this leaves very little room in the LDLM resource name
for other uses.  The upcoming quota code needs to store another
FID into the LDLM resource to allow directory tree quotas, and
managed by the DLM.

The 32-bit VER, which is currently always 0, is moved into the high
bits of name[1] along with the 32-bit OID, to avoid consuming the
name[2] field.  Since future use of the FID version (including
snapshots, pools, etc) will need changes on the client side anyway,
there will never be non-zero VER on an existing client.

Signed-off-by: Andreas Dilger <adilger@whamcloud.com>
Change-Id: If1e500cfb277dfc25bc056bb0c5763e48e7d500c
Reviewed-on: http://review.whamcloud.com/2275
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-164 Update testframework to restart sessions at next suite
Chris Gearing [Thu, 12 May 2011 09:42:54 +0000 (10:42 +0100)]
LU-164 Update testframework to restart sessions at next suite

Update the test framework so that it can append to yaml files rather
than begin a new series and allow auster to begin at a selected
test_suite.

auster has a new switch '-S suite-name' which allows the caller to
start a test group part way through.

The idea behind all of this is that a test session that fails at a
test suite can be restarted at the next suite.

Change-Id: Ifc84184506cff5ee11e5dc570ce17baab4d439a4
Signed-off-by: Chris Gearing <chris@whamcloud.com>
Reviewed-on: http://review.whamcloud.com/538
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Robert Read <rread@whamcloud.com>
Reviewed-by: Bobi Jam <bobijam@whamcloud.com>
12 years agoLU-983 llite: align readahead to 1M after ra_max adjustment
wangdi [Thu, 19 Jan 2012 05:43:00 +0000 (21:43 -0800)]
LU-983 llite: align readahead to 1M after ra_max adjustment

Align the readahead pages only if ria_start != 0, otherwise the
readahead pages will be cut to zero. And it can be checked by
checking (ret > ((ria->ria_start + ret) % PTLRPC_MAX_BRW_PAGES))

Add test_101e for checking small reads for small size files.

Put 101a to Exception list, because it will do random small
reads(64k) on the file, then checking pages being discarded.
But in current implementation, we will always extend small size
read to 1M during read-ahead, which might cause too much unused
pages being read, then being discarded later.

Remove big rpc check in 101c, because the test will do random small
size(64K) read, so it only need make sure there are no small size
RPC during the read. Besides, the original big RPC check is wrong,
since it can not make sure all of RPC exists in this case.

Signed-off-by: Wang Di <di.wang@whamcloud.com>
Change-Id: If86975274f36050b9252304c447f057e99f0c5ea
Reviewed-on: http://review.whamcloud.com/1983
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-432 multiop test needs $(PTHREAD_LIBS)
Christos Triantafyllidis [Sun, 19 Jun 2011 22:31:43 +0000 (01:31 +0300)]
LU-432 multiop test needs $(PTHREAD_LIBS)

Added $(PTHREAD_LIBS) to multiop_LDADD for multiop test.

Change-Id: I95ce209fc033122ac89bb4ee74f8e095e5727b17
Signed-off-by: Christos Triantafyllidis <ctria@grid.auth.gr>
Reviewed-on: http://review.whamcloud.com/978
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
Tested-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-1116 kernel: Support RHEL6.2 patchless client.
yangsheng [Tue, 21 Feb 2012 01:46:52 +0000 (09:46 +0800)]
LU-1116 kernel: Support RHEL6.2 patchless client.

Update RHEL6.2 kernel to 2.6.32-220.4.2(client).

Signed-off-by: yang sheng <ys@whamcloud.com>
Change-Id: I654d8d56b14196acf8d27b31a5af8c34812d8b99
Reviewed-on: http://review.whamcloud.com/2171
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-987 build: Fail to create ldisk rpms
Minh Diep [Tue, 7 Feb 2012 17:56:06 +0000 (09:56 -0800)]
LU-987 build: Fail to create ldisk rpms

The autoMakefile.am in ldsikfs does not define the
BUILD_SERVER flag so make rpms fails. This patch
simply set the flag to true since ldiskfs will most
likely be used only on servers
Signed-off-by: Minh Diep <mdiep@whamcloud.com>
Change-Id: Ic42eb543658f97bbff9c2f360d8693da840a8a36
Reviewed-on: http://review.whamcloud.com/2113
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-746 obdfilter-survey FAIL: test_1b ost4: hndls expected > 8, have 2
Lai Siyao [Tue, 18 Oct 2011 05:08:47 +0000 (22:08 -0700)]
LU-746 obdfilter-survey FAIL: test_1b ost4: hndls expected > 8, have 2

obdfilter-survey.sh checks jbd proc stats after survey, but it
doesn't take obd cleanup time into account, so the stats
collected may not reflect the data of survey test. Save run time
of survey test, and collect stats of (run_time/4) ago to fix this.

Change-Id: Ib0b55693d13da172ca476ca79013a8a2e9dd81ca
Signed-off-by: Lai Siyao <laisiyao@whamcloud.com>
Reviewed-on: http://review.whamcloud.com/1534
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
Reviewed-by: Liang Zhen <liang@whamcloud.com>
12 years agoLU-322 tests: Pool test fixes for large number of OSTS
Minh Diep [Tue, 13 Dec 2011 01:41:04 +0000 (17:41 -0800)]
LU-322 tests: Pool test fixes for large number of OSTS

This patch fixes issues with large numbers of OSTs with the
ost-pools test. we need to use hexidecimal numbers for
the pool args since the OSTs UUID are named with hex numbers.
Currently the test does pass but erros can be seen in the logs
when more than 9 OSTs exist.

Signed-off-by: Minh Diep <mdiep@whamcloud.com>
Change-Id: I5db7540ee710c528af33d768a105a55bd09229b3
Reviewed-on: http://review.whamcloud.com/1841
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Yu Jian <yujian@whamcloud.com>
Reviewed-by: James Simmons <uja.ornl@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-952 quota: follow locking order of quota code
Niu Yawei [Thu, 5 Jan 2012 05:30:54 +0000 (21:30 -0800)]
LU-952 quota: follow locking order of quota code

The locking order of quota code is: i_mutex > dqonoff_sem >
journal_lock > dqptr_sem > dquot->dq_lock > dqio_mutex, so we
should call the ll_vfs_dq_init() after journal started to avoid
deadlock.

Signed-off-by: Niu Yawei <niu@whamcloud.com>
Change-Id: I7f805abc3105e338dd60bc632d860be5389d87ec
Reviewed-on: http://review.whamcloud.com/1919
Tested-by: Hudson
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Fan Yong <yong.fan@whamcloud.com>
12 years agoLU-358 Add the branch and commiti-id to the yml.
Chris Gearing [Thu, 26 May 2011 10:45:21 +0000 (11:45 +0100)]
LU-358 Add the branch and commiti-id to the yml.

yaml.sh updated to provide the information requested in the JIRA the
change is simple and requires that LUSTRE_BRANCH and LUSTRE_REVISION
are defined, if they are not defined then the elements are left blank

Signed-off-by: Chris Gearing <chris@whamcloud.com>
Change-Id: Id18f3d5c68d3703af3b76628d7d43289c57bf485
Reviewed-on: http://review.whamcloud.com/1400
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Yu Jian <yujian@whamcloud.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-534 test: nfsread_orphan_file test
Bobi Jam [Tue, 20 Dec 2011 01:19:52 +0000 (09:19 +0800)]
LU-534 test: nfsread_orphan_file test

Reproducer of bug17764.

In order to hit the assertion the lustre file has to be open twice
from lustre clients, and once from nfs mount.  Then unlink and close
opened lustre file and read the file opened on nfs.

Bugzilla: 17764
Author: Elena Gryanova <elena.gryaznova@oracle.com>
Signed-off-by: Elena Gryanova <elena.gryaznova@oracle.com>
Signed-off-by: Bobi Jam <bobijam@whamcloud.com>
Change-Id: I05d8f04a2bd7c64864e1b82a07ef26a170887714
Reviewed-on: http://review.whamcloud.com/1895
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-534 mds: correct assertion
Vladimir Saveliev [Thu, 24 Nov 2011 18:50:04 +0000 (21:50 +0300)]
LU-534 mds: correct assertion

Orphan inode can be reached on mds_open when opening by fid which
takes place on accessing files via nfs correct the assertion
correspondingly.

Bugzilla: 17764
Author: Vladimir Saveliev <vladimir.saveliev@oracle.com>
Signed-off-by: Vladimir Saveliev <vladimir.saveliev@oracle.com>
Signed-off-by: Bobi Jam <bobijam@whamcloud.com>
Change-Id: Id6cde949003c722b56fe7ccec43b2d8b8a0667bb
Reviewed-on: http://review.whamcloud.com/1894
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-955 build: fix bad lustre-backend-fs dependency
Minh Diep [Mon, 9 Jan 2012 16:51:37 +0000 (08:51 -0800)]
LU-955 build: fix bad lustre-backend-fs dependency

Fix an incorrect RPM package dependency if Lustre RPMs are built
with "make rpms" with client only
Signed-off-by: Minh Diep <mdiep@whamcloud.com>
Change-Id: I1e00a2d2710ec96036cbad0a3539f704251f880a
Reviewed-on: http://review.whamcloud.com/1931
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-805 quota: lfs quota doesn't print grace time correctly
Niu Yawei [Tue, 8 Nov 2011 10:01:05 +0000 (02:01 -0800)]
LU-805 quota: lfs quota doesn't print grace time correctly

Lustre always trigger grace time when the allocated qunit exceeding
softlimit, however, user tools 'lfs quota' only print grace time
when the total usage greater than softlimit, so sometimes user can't
tell if the softlimit is already exceeded from 'lfs quota' output.

This patch changes the 'lfs quota' to use the data get from kernel
instead of comparing usage with softlimit.

Signed-off-by: Niu Yawei <niu@whamcloud.com>
Change-Id: I320cd0fafcb92e8b91f688b2c8bb50bc44442a09
Reviewed-on: http://review.whamcloud.com/1667
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-649 io: DIO doesn't need lock i_mutex
Lai Siyao [Mon, 12 Dec 2011 15:12:42 +0000 (07:12 -0800)]
LU-649 io: DIO doesn't need lock i_mutex

There is a dead lock if a file is written with with normal IO, and
read with DIO at the mean time:
regular write: extent lock -> i_mutex
DIO read: i_mutex -> server take extent lock

i_mutex taken at DIO read is redundant, because server will take
extent lock to serialize with write and truncate. Removing it can
avoid the deadlock.

Change-Id: I280559cebae5d04e4fd943bf3fbe82d67e07834b
Signed-off-by: Lai Siyao <laisiyao@whamcloud.com>
Reviewed-on: http://review.whamcloud.com/1829
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Niu Yawei <niu@whamcloud.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-882 quota: Quota code compares unsigned < 0
Niu Yawei [Tue, 29 Nov 2011 04:59:38 +0000 (20:59 -0800)]
LU-882 quota: Quota code compares unsigned < 0

Port from b23858.

In check_cur_qunit(), it checks "if (limit + record < 0)", however,
the limit is unsigned, so this check will be always false, and when
limit is smaller than -record, following "limit += record" will make
limit a unreasonable large value.

This patch also fixed a similar defect in dqacq_handler().

Signed-off-by: Vladimir Saveliev <vladimir.saveliev@oracle.com>
Signed-off-by: Niu Yawei <niu@whamcloud.com>
Change-Id: Ibe1d643ea3b310b2e55c05a8c200ba5e0137ee27
Reviewed-on: http://review.whamcloud.com/1750
Tested-by: Hudson
Reviewed-by: Fan Yong <yong.fan@whamcloud.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-620 llite: add delete/remove_from_page_cache check
Bobi Jam [Fri, 4 Nov 2011 09:18:19 +0000 (17:18 +0800)]
LU-620 llite: add delete/remove_from_page_cache check

Later 2.6.32 kernel use memory cgroup feature but does not export
truncate_complete_page but export delete_from_page_cache or
remove_from_page_cache, we need properly use them for pachless client
code.

Signed-off-by: Bobi Jam <bobijam@whamcloud.com>
Change-Id: I4b95da2b4cac7f2c2f63b69896dd91c3544a5473
Reviewed-on: http://review.whamcloud.com/1649
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-792 lbuild-rhel5 should use redhat's SRPM repo
Michael MacDonald [Tue, 25 Oct 2011 14:23:28 +0000 (10:23 -0400)]
LU-792 lbuild-rhel5 should use redhat's SRPM repo

Download EL5 kernel .src.rpm packages from redhat's repo instead
of trying to use CentOS's often-outdated repo.

Change-Id: I45c59894c649f3290a8ce1f0d3f1f033395567a4
Signed-off-by: Michael MacDonald <mjmac@whamcloud.com>
Reviewed-on: http://review.whamcloud.com/1595
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Brian J. Murrell <brian@whamcloud.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-424 tests: conf-sanity test 55, 56, 58 failure fixes
Yu Jian [Sat, 31 Dec 2011 09:04:41 +0000 (17:04 +0800)]
LU-424 tests: conf-sanity test 55, 56, 58 failure fixes

The MGS service was not started in conf-sanity test 55, 56 and 58
with separate MGS and MDT configuration. This patch fix the issue.

Signed-off-by: Yu Jian <yujian@whamcloud.com>
Change-Id: Id82121f645d1903934bcf00328627947e943c45c
Reviewed-on: http://review.whamcloud.com/955
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Bobi Jam <bobijam@whamcloud.com>
Reviewed-by: Minh Diep <mdiep@whamcloud.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-780 test: improve parallel-scale to support hyperion run
Minh Diep [Mon, 7 Nov 2011 18:11:38 +0000 (10:11 -0800)]
LU-780 test: improve parallel-scale to support hyperion run

We need to add support for srun/slurm, and a few tests
from hyperion-sanity script that has been used for hyperion
testing
Signed-off-by: Minh Diep <mdiep@whamcloud.com>
Change-Id: I30d5c9c84fed6f6533ec766e6278a9c86046e4a9
Reviewed-on: http://review.whamcloud.com/1662
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Yu Jian <yujian@whamcloud.com>
Reviewed-by: Cliff White <cliffw@whamcloud.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-604 open non-exist object should return ENOENT
hongchao.zhang [Fri, 19 Aug 2011 09:00:18 +0000 (17:00 +0800)]
LU-604 open non-exist object should return ENOENT

when call ll_intent_file_open() against non-exist object, we cannot obtain
RPC request refcount after the call, so subsequent operation should not use
such internal RPC request to avoid to access freed memory.

Change-Id: Ic8cd2a7798d2d454cbe5931eb93b92b1a44cca4b
Signed-off-by: Hongchao Zhang<hongchao.zhang@whamcloud.com>
Reviewed-on: http://review.whamcloud.com/1271
Tested-by: Hudson
Reviewed-by: Fan Yong <yong.fan@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
12 years agoLU-715 lov: fix procfs reporting for qos values
Matt Ezell [Thu, 13 Oct 2011 20:39:04 +0000 (16:39 -0400)]
LU-715 lov: fix procfs reporting for qos values

When writing to
/proc/fs/lustre/lov/<fsname>-mdtlov/{qos_prio_free,qos_threshold_rr},
the values read back are often one less than the values written.
This happens because internally the value is stored as a number from
0-255 but accessed by the user with 0-100. Integer truncation in the
storage and retrieval stages causes this to often show lower. Adding
255 to an internal step causes the bit-shift to "round up".

Signed-off-by: Matt Ezell <ezell@nics.utk.edu>
Change-Id: I3bb1bb4b048d40eb3882c7cb22799320f90496c7
Reviewed-on: http://review.whamcloud.com/1522
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
12 years agoLU-841 tests: sanity.sh 27q does not create a testing directory
Minh Diep [Wed, 7 Dec 2011 23:10:56 +0000 (15:10 -0800)]
LU-841 tests: sanity.sh 27q does not create a testing directory

sanity.sh 27q does not create a testing directory which causes
ENOENT errors from "ONLY=27q bash sanity.sh"
Signed-off-by: Minh Diep <mdiep@whamcloud.com>
Change-Id: Ied574286fe8ee9c1859da05cc7737fb325c5ef28
Reviewed-on: http://review.whamcloud.com/1807
Tested-by: Hudson
Reviewed-by: Yu Jian <yujian@whamcloud.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Wei Liu <sarah@whamcloud.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-880 mds: Fix mds_open counter.
wangdi [Mon, 28 Nov 2011 04:41:47 +0000 (20:41 -0800)]
LU-880 mds: Fix mds_open counter.

Remove duplicate mds_open counter.

Signed-off-by: Wang Di <di.wang@whamcloud.com>
Change-Id: I229929e9c1635704a668364f72cf17428ed70ef1
Reviewed-on: http://review.whamcloud.com/1747
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-824 corrupted ldiskfs after md rebuild (bz24264)
yangsheng [Tue, 8 Nov 2011 17:48:26 +0000 (01:48 +0800)]
LU-824 corrupted ldiskfs after md rebuild (bz24264)

Pick up a patch from upstream to fix the md bug may
cause a corruption issue after rebuild.

Change-Id: I764a7c5ac4595aa73d821fc8e66f55598d1ff099
Signed-off-by: Yang Sheng <ys@whamcloud.com>
Reviewed-on: http://review.whamcloud.com/1672
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-553 build: improve checks for commit-msg
Andreas Dilger [Sat, 12 Nov 2011 19:23:49 +0000 (12:23 -0700)]
LU-553 build: improve checks for commit-msg

Improve the checks done by the commit-msg script.  It now ensures
that all the parts of the commit message are present.
- validate that the Change-Id: generated from 'git hash-object' is
  not empty, since this can happen if git is unhappy with the options
- check for only one Change-Id: line (multiple Signed-off-by: OK)
- describe the "component:" field better, with examples

If there was an error committing the message, save a copy to a
temporary file, so that it can be edited and re-used, instead of
having to recreate it each time, or fetch it from .git/COMMIT_MSG.

Add a simple regression test with good & bad commit messages, so
it is easier to verify that any changes made to the script will
continue to both detect errors, and pass valid commit messages.

Signed-off-by: Andreas Dilger <adilger@whamcloud.com>
Change-Id: I9f954cd142a4b5cd9d596c7221cd3d6f73d8cf17
Merge-from: I15cb3690560400a591598997424cf79dee3a039d
Reviewed-on: http://review.whamcloud.com/1689
Tested-by: Hudson
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-737 utils: check device name for digit
Minh Diep [Mon, 7 Nov 2011 17:23:38 +0000 (09:23 -0800)]
LU-737 utils: check device name for digit

We need to check the whole string for digit
not only the first character

Signed-off-by: Minh Diep <mdiep@whamcloud.com>
Change-Id: Ib3002f2f8dd9c657d7bcfcfa9166488c080152c2
Reviewed-on: http://review.whamcloud.com/1660
Reviewed-by: wangdi <di.wang@whamcloud.com>
Reviewed-by: Yu Jian <yujian@whamcloud.com>
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-645 Avoid unnecessary dentry rehashing
Bobi Jam [Wed, 28 Sep 2011 10:24:12 +0000 (18:24 +0800)]
LU-645 Avoid unnecessary dentry rehashing

In patchless case the sequence __d_drop() then d_rehash_cond() creates
race window if dentry is already hashed, in this case rehashing should
be avoided.

Change-Id: Ib891e7d2e5ae7efe19ef2ac0abb111e9f9681b7b
Signed-off-by: Bobi Jam <bobijam@whamcloud.com>
Reviewed-on: http://review.whamcloud.com/1435
Tested-by: Hudson
Reviewed-by: Fan Yong <yong.fan@whamcloud.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-695 clean up for OBD_FAIL_CHECK_ONCE
yangsheng [Sun, 25 Sep 2011 13:45:19 +0000 (21:45 +0800)]
LU-695 clean up for OBD_FAIL_CHECK_ONCE

Clean up for OBD_FAIL_CHECK_ONCE. Just use OBD_FAIL_CHECK.

Signed-off-by: Chris Horn <hornc@cray.com>
Signed-off-by: Yang Sheng <ys@whamcloud.com>
Change-Id: I86bb314a052d74da6f3be1d4b85aa287f706ed36
Reviewed-on: http://review.whamcloud.com/1424
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Bobi Jam <bobijam@whamcloud.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoLU-691 fix OST index errors in test suite - sanity 133c defect
Yu Jian [Fri, 30 Sep 2011 05:41:26 +0000 (13:41 +0800)]
LU-691 fix OST index errors in test suite - sanity 133c defect

Several test run do_facet ost which assumes ost is ost0 which does not exist
according to the way the test suite works. This patch address several areas
where the wrong ost index is used. This patch also introduces some short hand
functions to get OST properties from the index. Those functions ensure ost1,
ost10 and ost100 are seen as different which is not always the case.

The following patch was also ported:
b=24504 sanity test_133* and check_stats() fix
i=Vladimir.Saveliev

Signed-off-by: James Simmons <uja.ornl@gmail.com>
Signed-off-by: Yu Jian <yujian@whamcloud.com>
Change-Id: Ia7016e3c152acd8aa75cda6f838275bee316f639
Reviewed-on: http://review.whamcloud.com/1450
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Minh Diep <mdiep@whamcloud.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
12 years agoChange version to 1.8.7.80 for 1.8.8-wc1
Johann Lombardi [Fri, 4 Nov 2011 09:57:41 +0000 (10:57 +0100)]
Change version to 1.8.7.80 for 1.8.8-wc1

Signed-off-by: Johann Lombardi <johann@whamcloud.com>
Change-Id: I64aa0df0c2f40f884b3ccd5db38d8bbbf022bf86
Reviewed-on: http://review.whamcloud.com/1648