Whamcloud - gitweb
10 years agoLU-184 Keep orphan on failover umount
Niu Yawei [Thu, 14 Apr 2011 02:50:37 +0000 (19:50 -0700)]
LU-184 Keep orphan on failover umount

- On failover umount, the orphan should not be cleared on last close.
- In mdt_mfd_close(), the ma_valid should not be set as zero, otherwise,
  mds will never issue object destroy to OSS.

Signed-off-by: Niu Yawei <niu@whamcloud.com>
Change-Id: If0a894655d0ea42ad0d07e725174b73a137b7adb
Reviewed-on: http://review.whamcloud.com/426
Tested-by: Hudson
Reviewed-by: Mikhail Pershin <tappro@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
10 years agoLU-25: Use LDLM_ASYNC with ldlm_cancel_lru to avoid blocking.
Christopher J. Morrone [Thu, 6 Jan 2011 01:07:49 +0000 (17:07 -0800)]
LU-25: Use LDLM_ASYNC with ldlm_cancel_lru to avoid blocking.

Have ldlm_cli_pool_shrink() call ldlm_cancel_lru() with the LDLM_ASYNC
option to have the blocking callback handled in another thread.  This
will avoid randomly blocking for excessively long times when the shrinker
is called.

Change-Id: Ide705df2640a2a96be554ffc452930a2e180fc20
Signed-off-by: Christopher J. Morrone <morrone2@llnl.gov>
Reviewed-on: http://review.whamcloud.com/277
Tested-by: Hudson
Reviewed-by: Oleg Drokin <green@whamcloud.com>
10 years agoLU-140 Remove leftovers of iopen
Johann Lombardi [Sat, 9 Apr 2011 20:46:34 +0000 (13:46 -0700)]
LU-140 Remove leftovers of iopen

iopen has been already removed from the patch series, but it is still referenced in many places.

Change-Id: I512ce8ceef11d99f812f1348a5b53552b3ac1a42
Signed-off-by: Johann Lombardi <johann@whamcloud.com>
Reviewed-on: http://review.whamcloud.com/417
Tested-by: Hudson
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
10 years agoLU-183 replace osd with osd* in proc paths
Yu Jian [Wed, 6 Apr 2011 14:10:33 +0000 (22:10 +0800)]
LU-183 replace osd with osd* in proc paths

Fix the test scripts to use "osd*" instead of "osd" so that
it works with any OSD type.

Signed-off-by: Yu Jian <yujian@whamcloud.com>
Change-Id: Ibc403ff146a7f429acb16c35449ec3026f6a3c7b
Reviewed-on: http://review.whamcloud.com/405
Tested-by: Hudson
Reviewed-by: Brian J. Murrell <brian@whamcloud.com>
Reviewed-by: Alex Zhuravlev <bzzz@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
10 years agoLU-73 add a kernel config for rhel6/i686
Brian J. Murrell [Tue, 5 Apr 2011 02:49:34 +0000 (22:49 -0400)]
LU-73 add a kernel config for rhel6/i686

And a small patch to lbuild-rhel to display the diff of what got
built and what's in our tree for a .config.

Signed-off-by: Brian J. Murrell <brian@whamcloud.com>
Change-Id: Ic847c92bbef5d6293cd54e26be242d37b5eb13c5
Reviewed-on: http://review.whamcloud.com/400
Tested-by: Hudson
Reviewed-by: Michael MacDonald <mjmac@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
10 years agoLU-98 Fix defects in bug 14949 implementation
Brian J. Murrell [Thu, 3 Mar 2011 13:49:43 +0000 (08:49 -0500)]
LU-98 Fix defects in bug 14949 implementation

Some of the details of the implentation of bug 14949 "integrate
lustre-iokit into normal build process" were incorrect.  This was leading
to a problem when trying to build with "make rpms" after configuring without
the iokit.

Signed-off-by: Brian J. Murrell <brian@whamcloud.com>
Change-Id: Id064c688d63e3e70a90b725e49831b9c319229f0
Reviewed-on: http://review.whamcloud.com/294
Tested-by: Hudson
Reviewed-by: Robert Read <rread@whamcloud.com>
Reviewed-by: Michael MacDonald <mjmac@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
10 years agoLU-115 get some verbose patching output
Brian J. Murrell [Wed, 2 Mar 2011 18:10:55 +0000 (13:10 -0500)]
LU-115 get some verbose patching output

Signed-off-by: Brian J. Murrell <brian@whamcloud.com>
Change-Id: I548a3a26b61241ed8d81551779bd78950dccdd54
Reviewed-on: http://review.whamcloud.com/298
Tested-by: Hudson
Reviewed-by: Robert Read <rread@whamcloud.com>
Reviewed-by: Michael MacDonald <mjmac@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
10 years agoLU-170 oscc_grow_count will never grow
Liang Zhen [Mon, 28 Mar 2011 16:05:16 +0000 (00:05 +0800)]
LU-170 oscc_grow_count will never grow

We are using req::rq_async_args.space[0] to store original value of
oscc_grow_count, and using req::rq_async_args.pointer_arg[0] to store
oscc, however, ptlrpc_async_args is a union, which means
req::rq_async_args.space[0] will always be overwritten by a ossc (pointer),
and osc_interpret_create will always get true on this condition
"if (diff < (int) req->rq_async_args.space[0])" and reset
oscc_grow_count to OST_MIN_PRECREATE and set OSCC_FLAG_LOW.

Because it's very unsafe to use raw scratchpad directly, I also cleaned
up all using of raw scratchpad in this patch.

Change-Id: I56348c2ebaf27acb493185db73f3992a17610d98
Signed-off-by: Liang Zhen <liang@whamcloud.com>
Reviewed-on: http://review.whamcloud.com/371
Tested-by: Hudson
Reviewed-by: Oleg Drokin <green@whamcloud.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
Reviewed-by: Fan Yong <yong.fan@whamcloud.com>
10 years agoLU-189 RHEL6 MMP patch misses fix from bug 22117
Bobi Jam [Sun, 3 Apr 2011 04:14:38 +0000 (12:14 +0800)]
LU-189 RHEL6 MMP patch misses fix from bug 22117

Change-Id: I61010892c69f744e10c9561a202573fd08543d99
Signed-off-by: Bobi Jam <bobijam@whamcloud.com>
Reviewed-on: http://review.whamcloud.com/393
Tested-by: Hudson
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
10 years agoLU-144 xattr fixes
Alexey Lyashkov [Fri, 18 Mar 2011 11:55:56 +0000 (14:55 +0300)]
LU-144 xattr fixes

- generate trusted.lov locally instead of ask MDT about it
- take conflicting LCK_EX lock to properly flush ALCs on update from client

Change-Id: Ic78f21fffb5174f83400e70ab368c80531322663
Signed-off-by: Alexey Lyashkov <Alexey_Lyashkov@xyratex.com>
Signed-off-by: Vitaly Fertman <vitaly_fertman@xyratex.com>
Reviewed-on: http://review.whamcloud.com/341
Tested-by: Hudson
Reviewed-by: Oleg Drokin <green@whamcloud.com>
Reviewed-by: Fan Yong <yong.fan@whamcloud.com>
10 years agoLU-13 updated patch to fix at_min issue
ZhangHongChao [Tue, 31 Aug 2010 20:38:48 +0000 (13:38 -0700)]
LU-13 updated patch to fix at_min issue

Also Bugzilla bug 23352, attachment 31479
This is already landed for 1.8.6.

Change-Id: I87db1f40401e1da9337eace5d3bd992bc64871a4
Signed-off-by: Christopher J. Morrone <morrone2@llnl.gov>
Reviewed-on: http://review.whamcloud.com/306
Tested-by: Hudson
Reviewed-by: Oleg Drokin <green@whamcloud.com>
Reviewed-by: Mikhail Pershin <tappro@whamcloud.com>
10 years agoLU-182 Add ldiskfs 64bit-name-hash patch.
Bobi Jam [Fri, 1 Apr 2011 01:37:18 +0000 (09:37 +0800)]
LU-182 Add ldiskfs 64bit-name-hash patch.

Change-Id: Ic00e17dbba630c8b1367ddfb6f6c350111b045b5
Signed-off-by: Bobi Jam <bobijam@whamcloud.com>
Reviewed-on: http://review.whamcloud.com/388
Tested-by: Hudson
Reviewed-by: Andrew Perepechko <andrew_perepechko@xyratex.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
10 years agoLU-22 ldlm_resource::lr_lvb_data is protected by wrong lock
Bobi Jam [Thu, 31 Mar 2011 07:55:23 +0000 (15:55 +0800)]
LU-22 ldlm_resource::lr_lvb_data is protected by wrong lock

Use lr_lock to protect lr_lvbo_data while lr_lvbo_sem is only for
lvbo_init serialization.

Bugzilla: 24336
Change-Id: Ic3e77e99d8b3a3ca277adbc6548c254969e9761a
Signed-off-by: Bobi Jam <bobijam@whamcloud.com>
Reviewed-on: http://review.whamcloud.com/379
Tested-by: Hudson
Reviewed-by: Oleg Drokin <green@whamcloud.com>
10 years agoLU-73 Make register_shrinker argument more visible. 2.0.59-llnl3-base
Bobi Jam [Thu, 31 Mar 2011 04:06:14 +0000 (12:06 +0800)]
LU-73 Make register_shrinker argument more visible.

Change-Id: I0e1e9331b250bb31ae1e545e5705243e3a26c75a
Signed-off-by: Bobi Jam <bobijam@whamcloud.com>
Reviewed-on: http://review.whamcloud.com/377
Tested-by: Hudson
Reviewed-by: Oleg Drokin <green@whamcloud.com>
Reviewed-by: wangdi <di.wang@whamcloud.com>
10 years agoLU-168 Claim MDS_INODELOCK_LOOKUP lock when revalidate root object
nasf [Tue, 29 Mar 2011 06:53:38 +0000 (14:53 +0800)]
LU-168 Claim MDS_INODELOCK_LOOKUP lock when revalidate root object

Change-Id: I6854627ad1a168817d7551aee880652c873a663a
Signed-off-by: nasf <yong.fan@whamcloud.com>
Reviewed-on: http://review.whamcloud.com/370
Tested-by: Hudson
Reviewed-by: Mikhail Pershin <tappro@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
10 years agoLU-177 lbuild autonomy fixes
Brian J. Murrell [Fri, 25 Mar 2011 04:48:57 +0000 (00:48 -0400)]
LU-177 lbuild autonomy fixes

Fixes to allow lbuild to work without uneeded help from it's caller:
- kernel srpm location for rhel5
- CentOS is a "rhel" distro
Direct output to the output FD.
Refactor find_linux_rpm-rhel{5,6}.

Signed-off-by: Brian J. Murrell <brian@whamcloud.com>
Change-Id: I714b90b53b14ac4ef4cd6bfe638a029656a151b0
Reviewed-on: http://review.whamcloud.com/361
Reviewed-by: Oleg Drokin <green@whamcloud.com>
Tested-by: Hudson
Reviewed-by: Michael MacDonald <mjmac@whamcloud.com>
10 years agoLU-155 cleanup some dead code
Johann Lombardi [Thu, 24 Mar 2011 17:01:04 +0000 (18:01 +0100)]
LU-155 cleanup some dead code

- 2.x does not support any kernels with vfs intents, so we can drop the related lustre code
 (as a side note, HAVE_VFS_INTENT_PATCHES seems to be gone already)
- remove ll_pin/unpin since it won't be used any more
- add a comment to ll_ddelete()

Change-Id: I3ed16d84c8618071ea66d9c2cc9d2810b23f1830
Signed-off-by: Johann Lombardi <johann@whamcloud.com>
Reviewed-on: http://review.whamcloud.com/359
Tested-by: Hudson
Reviewed-by: Oleg Drokin <green@whamcloud.com>
Reviewed-by: Mikhail Pershin <tappro@whamcloud.com>
10 years agoLU-167 Return "-ENOENT" instead of "-ESTALE" when getattr against non-existent object
nasf [Tue, 29 Mar 2011 03:13:28 +0000 (11:13 +0800)]
LU-167 Return "-ENOENT" instead of "-ESTALE" when getattr against non-existent object

Change-Id: I17148a361e711c92f31f6a9318512267a866c724
Signed-off-by: nasf <yong.fan@whamcloud.com>
Reviewed-on: http://review.whamcloud.com/369
Reviewed-by: Oleg Drokin <green@whamcloud.com>
Reviewed-by: wangdi <di.wang@whamcloud.com>
Tested-by: Hudson
10 years agoLU-84 Allow building against kernel devel headers with quotas
Ned Bass [Fri, 11 Feb 2011 20:12:19 +0000 (12:12 -0800)]
LU-84 Allow building against kernel devel headers with quotas

The quota format definitions are normally found in private kernel
headers.  However, some sites build Lustre against kernel development
headers rather than than full kernel source. To accomodate this we add
the minimal set of required definitions to
lustre/include/lustre_quota.h which will take effect only if
quotaio_v*.h are not located at configure time.  The upstream version
of the 64-bit quota format patch in RHEL6 adds QFMT_VFS_V1 to
include/linux/quota.h, so configure can check for that if the
quotaio_v*.h headers aren't found.

Signed-off-by: Ned Bass <bass6@llnl.gov>
Change-Id: Id627f21604f1e35f550adb972c7f0e0ca9edf115
Reviewed-on: http://review.whamcloud.com/295
Tested-by: Hudson
Reviewed-by: Oleg Drokin <green@whamcloud.com>
Reviewed-by: Bobi Jam <bobijam@whamcloud.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
10 years agoORNL-2 MDS only processes child lock for getattr-by-fid
nasf [Fri, 25 Mar 2011 05:17:49 +0000 (13:17 +0800)]
ORNL-2 MDS only processes child lock for getattr-by-fid

Change-Id: I4fd46761702f13173f43c83e1d76ff4b0478e786
Signed-off-by: nasf <yong.fan@whamcloud.com>
Reviewed-on: http://review.whamcloud.com/365
Reviewed-by: wangdi <di.wang@whamcloud.com>
Tested-by: Hudson
Reviewed-by: Oleg Drokin <green@whamcloud.com>
10 years agoLU-73 RHEL6 support.
Bobi Jam [Thu, 24 Mar 2011 17:06:06 +0000 (01:06 +0800)]
LU-73 RHEL6 support.

Add kernel jbd2-commit-timer-no-jiffies-rounding-rhel6.patch.

Change-Id: Ic0f49962ecb72b974a3e5d6225c2995e481c6783
Signed-off-by: Bobi Jam <bobijam@whamcloud.com>
Reviewed-on: http://review.whamcloud.com/358
Tested-by: Hudson
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
10 years agoLU-162 Export ext4_discard_preallocations for ldiskfs
Bobi Jam [Thu, 24 Mar 2011 16:57:31 +0000 (00:57 +0800)]
LU-162 Export ext4_discard_preallocations for ldiskfs

Change-Id: I75d5553293fe5c4e409557895c48d02c65180409
Signed-off-by: Bobi Jam <bobijam@whamcloud.com>
Reviewed-on: http://review.whamcloud.com/360
Tested-by: Hudson
Reviewed-by: Oleg Drokin <green@whamcloud.com>
10 years agoLU-91 Fix quota format problem with RHEL6 and kernels >= 2.6.33
Johann Lombardi [Mon, 28 Feb 2011 14:49:21 +0000 (15:49 +0100)]
LU-91 Fix quota format problem with RHEL6 and kernels >= 2.6.33

Since kernel commit 869835dfad3eb6f7d90c3255a24b084fea82f30d "quota: Improve
checking of quota file header", quota users now have to specify what
format they want to use and an error is returned if it does not match the
one used on disk.
Since we only support the 64-bit quota format with lustre 2.x, we should
just use QFMT_VFS_V1 instead of QFMT_VFS_V0 (which should still be used
for older kernels like RHEL5).

Change-Id: I9022073385c76dafbf698fbdf2df58de38535d88
Signed-off-by: Johann Lombardi <johann@whamcloud.com>
Reviewed-on: http://review.whamcloud.com/268
Reviewed-by: Brian J. Murrell <brian@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
Tested-by: Hudson
10 years agoLU-73 RHEL6 support.
Bobi Jam [Tue, 15 Mar 2011 01:19:12 +0000 (09:19 +0800)]
LU-73 RHEL6 support.

Include client, ldiskfs, kernel patches.

Change-Id: Ice16b8bf40c2e37df9af9f399316917097e8ee8f
Signed-off-by: Bobi Jam <bobijam@whamcloud.com>
Reviewed-on: http://review.whamcloud.com/307
Tested-by: Hudson
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
Reviewed-by: Brian J. Murrell <brian@whamcloud.com>
10 years agoLU-139 move DCACHE_LUSTRE_INVALID to higher bit to prevent conflict with other dentry...
Johann Lombardi [Thu, 17 Mar 2011 21:17:49 +0000 (22:17 +0100)]
LU-139 move DCACHE_LUSTRE_INVALID to higher bit to prevent conflict with other dentry flags

2.6.38 comes with a bunch of new dentry flags and one of them

Change-Id: I11de689058d12b07b2f00191f2d175d48a90ff8e
Signed-off-by: Johann Lombardi <johann@whamcloud.com>
Reviewed-on: http://review.whamcloud.com/339
Tested-by: Hudson
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
10 years agoLU-39 ASSERTION(atomic_read(&client_stat->nid_exp_ref_count) == 0)
Bobi Jam [Tue, 22 Mar 2011 03:01:52 +0000 (11:01 +0800)]
LU-39 ASSERTION(atomic_read(&client_stat->nid_exp_ref_count) == 0)

In lprocfs_exp_setup(), we need release old stats in all cases.

Bugzilla: 23499
Change-Id: Ida40ffe82a12e213779e344da92f5e3e437f8c8b
Signed-off-by: Bobi Jam <bobijam@whamcloud.com>
Reviewed-on: http://review.whamcloud.com/350
Tested-by: Hudson
Reviewed-by: Oleg Drokin <green@whamcloud.com>
10 years agoLU-73 lbuild and friends for RHEL6
Brian J. Murrell [Wed, 9 Mar 2011 21:11:07 +0000 (16:11 -0500)]
LU-73 lbuild and friends for RHEL6

This adds the lbuild support needed to build RHEL6 servers.
Included is a re-org of the RHEL5 lbuild code to maximize code
reuse and minimze code replication.
The kernel config here is the stock config from an x86_64 RHEL6
kernel build.  It might need some tuning for Lustre.

Signed-off-by: Brian J. Murrell <brian@whamcloud.com>
Change-Id: If83ee846d6d84a456444fb54feba0ad7810a586a
Reviewed-on: http://review.whamcloud.com/332
Tested-by: Hudson
Reviewed-by: Michael MacDonald <mjmac@whamcloud.com>
Reviewed-by: Robert Read <rread@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
10 years agoLU-148 ll_readpage has to unlock vmpage by any means
Jinshan Xiong [Sat, 19 Mar 2011 19:44:03 +0000 (12:44 -0700)]
LU-148 ll_readpage has to unlock vmpage by any means

Unlock vmpage in case ll_cl_init fails.

Change-Id: Id7d92df20c7057b324f21067e5b1cd3311cf9f41
Signed-off-by: Jinshan Xiong <jay@whamcloud.com>
Reviewed-on: http://review.whamcloud.com/349
Tested-by: Hudson
Reviewed-by: Oleg Drokin <green@whamcloud.com>
Reviewed-by: wangdi <di.wang@whamcloud.com>
10 years agob=24226 SUID/SGID related processing
nasf [Tue, 1 Mar 2011 09:17:29 +0000 (17:17 +0800)]
b=24226 SUID/SGID related processing

1) remove SUID/SGID when writes/truncates file.
2) keep SUID/SGID for normal chmod without file data changed.

Issue: LU-65
Change-Id: I664f16c9bace1b0c011abcc7e2d103432886350e
Signed-off-by: nasf <yong.fan@whamcloud.com>
Reviewed-on: http://review.whamcloud.com/238
Tested-by: Hudson
Reviewed-by: Bobi Jam <bobijam@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
10 years agoLU-114 use ll_lookup_one_len() instead of lookup_one_len() in sptlrpc_target_local_co...
Johann Lombardi [Mon, 7 Mar 2011 16:55:23 +0000 (17:55 +0100)]
LU-114 use ll_lookup_one_len() instead of lookup_one_len() in sptlrpc_target_local_copy_conf() should lock the parent dir when doing lookup

Recent kernels print a warning when lookup_one_len() is called w/o the parent dir locked.
sptlrpc_target_local_copy_conf() should not call lookup_one_len() directly, but ll_lookup_one_len()
instead which locks the parent dir for you and also checks for bad inode.

Change-Id: I4d020c55382238bbbc6df6017f54b5b585f53c48
Signed-off-by: Johann Lombardi <johann@whamcloud.com>
Reviewed-on: http://review.whamcloud.com/303
Tested-by: Hudson
Reviewed-by: Oleg Drokin <green@whamcloud.com>
10 years agob=20581 MDS returns full hash for readdir to decrease hash collision 2.0.59-llnl2-base
nasf [Tue, 1 Mar 2011 07:59:40 +0000 (15:59 +0800)]
b=20581 MDS returns full hash for readdir to decrease hash collision

1) MDS returns full hash (both major and minor, 64bit) for readdir to decrease
   hash collision.
2) Synchronization control between traversing thread and statahead thread when
Issues: LU-71
Change-Id: I043784d58607d474a501111c9690fdab89ce8a4a
Signed-off-by: nasf <yong.fan@whamcloud.com>
Reviewed-on: http://review.whamcloud.com/281
Tested-by: Hudson
Reviewed-by: Cliff White <cliffw@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
10 years agoLU-68 Fix a race between lock cancel and write
Oleg Drokin [Mon, 14 Mar 2011 22:46:17 +0000 (18:46 -0400)]
LU-68 Fix a race between lock cancel and write

A race in osc_lock_detach due to disconnect between reading
kms from lock resource and write updating kms in the resource
could lead to loss of the update.
Lock the object across entire kms access to avert the issue.

Change-Id: I4b15e611b91b7a4a6b69f5c9fa9b6bd0a1f8cac9
Signed-off-by: Oleg Drokin <green@whamcloud.com>
Reviewed-on: http://review.whamcloud.com/327
Tested-by: Hudson
Reviewed-by: wangdi <di.wang@whamcloud.com>
Reviewed-by: Jinshan Xiong <jay@whamcloud.com>
10 years agoLU-82 Remove useless clio locks
Jinshan Xiong [Mon, 7 Mar 2011 18:56:17 +0000 (10:56 -0800)]
LU-82 Remove useless clio locks

A CLS_NEW toplock must be deleted also if the last sublock is being deleted.

Signed-off-by: Jinshan Xiong <jay@whamcloud.com>
Change-Id: I9831ab534ae222fe74ebaf34faafe00bf3d18cb7
Reviewed-on: http://review.whamcloud.com/305
Tested-by: Hudson
Reviewed-by: Oleg Drokin <green@whamcloud.com>
10 years agoLU-83 Kernel BUG in ext4 if wrong mount option is given
Oleg Drokin [Fri, 11 Mar 2011 01:30:17 +0000 (20:30 -0500)]
LU-83 Kernel BUG in ext4 if wrong mount option is given

Original attempt at landing a fix for this bug landed hopelessly
broken patch.
Replace the patch with proper code.

Change-Id: Iddc38c4254b1f981b10e272b671fad232e20fd70
Signed-off-by: Oleg Drokin <green@whamcloud.com>
Reviewed-on: http://review.whamcloud.com/314
Tested-by: Hudson
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
10 years agob=24375 Fix lvb updating race in enqueue vs completion case
Oleg Drokin [Fri, 11 Feb 2011 03:28:15 +0000 (22:28 -0500)]
b=24375 Fix lvb updating race in enqueue vs completion case

ldlm_enqueue_tail checked for lock mode and updated lvb without
taking appropriate locks.
Take res and lock around mode check and lvb updating.

Issue: LU-67
Change-Id: I8fb764313326b8bf8f813e7cf77c050dd68afe45
Reviewed-on: http://review.whamcloud.com/229
Tested-by: Hudson
Reviewed-by: Liang Zhen <liang@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Alex Zhuravlev <bzzz@whamcloud.com>
10 years agob=22980 improve obdfilter-survey performance on multi-core system
Niu Yawei [Fri, 18 Feb 2011 03:46:53 +0000 (19:46 -0800)]
b=22980 improve obdfilter-survey performance on multi-core system

1) Add unlock_ioctl support.
2) Land OBD_ALLOC_LARGE() patch from b4424 to avoid vmalloc in ioctl path.
3) Minor fix in class_handle_ioctl() to avoid double free.

Issue: LU-66
Change-Id: Id965cd13fdaeb63197c30e557708afbfa9ee9f39
Signed-off-by: Niu Yawei <niu@whamcloud.com>
Reviewed-on: http://review.whamcloud.com/247
Tested-by: Hudson
Reviewed-by: Oleg Drokin <green@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
10 years agoLU-116 fix mmp test 9 and test 10
Yu Jian [Mon, 7 Mar 2011 10:44:02 +0000 (18:44 +0800)]
LU-116 fix mmp test 9 and test 10

1) fix the sleep time in test 9 to abort e2fsck before it's complete
2) fix test 10 to allow e2fsck run on mounted filesystem with -n option

Signed-off-by: Yu Jian <yujian@whamcloud.com>
Change-Id: I2605d75f47e44db659ced2e5435e3b51cc8897c6
Reviewed-on: http://review.whamcloud.com/301
Tested-by: Hudson
Reviewed-by: Brian J. Murrell <brian@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
10 years agob=22703 no error when close if application has known write failure already
nasf [Wed, 2 Mar 2011 10:23:08 +0000 (18:23 +0800)]
b=22703 no error when close if application has known write failure already

Do not return error when close if application has known write failure already.

Issues: LU-65
Change-Id: I25b9628fcdd3bf5b7db9d3f8460e06c18226b2e0
Signed-off-by: nasf <yong.fan@whamcloud.com>
Reviewed-on: http://review.whamcloud.com/286
Tested-by: Hudson
Reviewed-by: Jinshan Xiong <jay@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
10 years agoMake llite_lloop.ko build conditional
Christopher J. Morrone [Thu, 24 Feb 2011 22:11:45 +0000 (14:11 -0800)]
Make llite_lloop.ko build conditional

The llite_lloop.ko module only works with kernel page sizes smaller
than 64K, so test for the kernel page size and only build the module

Issue: LU-94
Change-Id: I78e416832767bbe29529d152e45addec7a999506
Signed-off-by: Christopher J. Morrone <morrone2@llnl.gov>
Reviewed-on: http://review.whamcloud.com/266
Tested-by: Hudson
Reviewed-by: Brian J. Murrell <brian@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
10 years agoORNL-2 Enable OBD_CONNECT_ATTRFID supporting on lustre-2.x
nasf [Mon, 7 Mar 2011 15:23:42 +0000 (23:23 +0800)]
ORNL-2 Enable OBD_CONNECT_ATTRFID supporting on lustre-2.x

To drop redundant MDS-side RPC for root object revalidation when
stats each item for traversing directory based on absolute path.

Signed-off-by: nasf <yong.fan@whamcloud.com>
Change-Id: I5f7c7755b4142ac1f23c66517e5919c1398e15b8
Reviewed-on: http://review.whamcloud.com/302
Tested-by: Hudson
Reviewed-by: Lai Siyao <laisiyao@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
10 years agob=24417 Update Build-Depends
Brian J. Murrell [Fri, 21 Jan 2011 22:47:58 +0000 (17:47 -0500)]
b=24417 Update Build-Depends

- remove texlive-latex-recommended as a build requirement
- add missing "| automake1.7 | automake1.8 | automake1.9" to

Issue: LU-52
Signed-off-by: Brian J. Murrell <brian@whamcloud.com>
Change-Id: Ief22c678014836e8372a18dd3ed48a95631177b5
Reviewed-on: http://review.whamcloud.com/212
Tested-by: Hudson
Reviewed-by: Robert Read <rread@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
10 years agoLU-120 the perfect storm
Brian J. Murrell [Mon, 7 Mar 2011 22:37:44 +0000 (17:37 -0500)]
LU-120 the perfect storm

There is a corner case in the debian patching/unpatching process
where a patch converts the use of a given file to a file.in that is
to be substed.  The problem is that between the configure and the
make debs, a file, which a debian patch wants to remove already
exists (it was created by the configure) and dpkg-buildpackage's
unpatch step fails becuase it wants to try to create a file which
configure already created.

Signed-off-by: Brian J. Murrell <brian@whamcloud.com>
Change-Id: I185e496b82687d21ea3d744e939536c106b702f9
Reviewed-on: http://review.whamcloud.com/308
Tested-by: Hudson
Reviewed-by: Robert Read <rread@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
10 years agoLU-97 lustre-modules should depend on ldiskfs.
Brian J. Murrell [Thu, 24 Feb 2011 18:06:24 +0000 (13:06 -0500)]
LU-97 lustre-modules should depend on ldiskfs.

Only for the server of course.
But depend on a generically named tag, and have ldiskfs provide
that tag so that any future backing stores can be used as a
replacement for ldiskfs.

Signed-off-by: Brian J. Murrell <brian@whamcloud.com>
Change-Id: Ibef2b8c2b2a8c7339143dc32d32b59ace94f00fd
Reviewed-on: http://review.whamcloud.com/265
Reviewed-by: Robert Read <rread@whamcloud.com>
Tested-by: Hudson
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
10 years agoLU-103 Add support for test groups.
Robert Read [Tue, 1 Mar 2011 00:46:16 +0000 (16:46 -0800)]
LU-103 Add support for test groups.

Added an -g GROUP_FILE option to auster. The tests listed in the
group file will be used instead of any passed on the command line. I also added
a couple of test-groups to get things started, although
I'm sure we'll want to refine these.

Took care of a few cleanups in run_suite as well.

Signed-off-by: Robert Read <rread@whamcloud.com>
Change-Id: I5000c2b95a6eb1a49f5305637d4b55de5edc6d5c
Reviewed-on: http://review.whamcloud.com/278
Tested-by: Hudson
Reviewed-by: Chris Gearing <chris@whamcloud.com>
Reviewed-by: Wei Liu <sarah@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
10 years agoLU-119 Remove code added by a conflict resolution
Brian J. Murrell [Mon, 7 Mar 2011 17:47:05 +0000 (12:47 -0500)]
LU-119 Remove code added by a conflict resolution

I erroneously re-added some code that was removed in
62345145fa7cae1846f96078bac43ddf58cea640 while resolving
a conflict in LU-85's landing of commit

Signed-off-by: Brian J. Murrell <brian@whamcloud.com>
Change-Id: Id2489a3aa5560cdf4e7cf4755ddc16437558a2fc
Reviewed-on: http://review.whamcloud.com/304
Tested-by: Hudson
Reviewed-by: Christopher J. Morrone <morrone2@llnl.gov>
Reviewed-by: Robert Read <rread@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
10 years agoNo bugzilla for this. One of the test failing was config-sanity.sh
James Simmons [Fri, 18 Feb 2011 18:58:41 +0000 (13:58 -0500)]
No bugzilla for this. One of the test failing was config-sanity.sh
test number 37. So I tracked it down to way. The reason was the mounts
were failing. Upon investigation I discovered the test caught a error
reported back by the mount. I tracked it down to the section of code in
the patch. Basically mount_lustre.c attempts to set the stripe_cache_size
for /dev/mdX devices. The problem is this extra feature is present for
certain classes of md devices. In the case of our test bed we are one
of the classes that lacks this enhancement. So instead of failing to mount
in that case we just print a warning that the special param is not present
but still succed to mount.

Change-Id: Ib8a743e77b5e6994e2640c51b781f7d0d345d5a7
Signed-off-by: James Simmons <uja.ornl@gmail.com>
Reviewed-on: http://review.whamcloud.com/254
Tested-by: Hudson
Reviewed-by: Brian J. Murrell <brian@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
10 years agoLU-85 Update some URLs and text to reflect our releases. 60/260/6
Brian J. Murrell [Wed, 23 Feb 2011 21:38:40 +0000 (16:38 -0500)]
LU-85 Update some URLs and text to reflect our releases.

The text and URLs in the debian packaging metadata were old
(circa Sun Microsystems), so update them and take the opportunity
to inform about the source of the release being installed.

Signed-off-by: Brian J. Murrell <brian@whamcloud.com>
Change-Id: I7860e8ada7757d8a991a472af293749fd4c89f5a

10 years agoMerge "b=21581 too long file / path names for old tar"
Oleg Drokin [Fri, 4 Mar 2011 23:51:32 +0000 (15:51 -0800)]
Merge "b=21581 too long file / path names for old tar"

10 years agoLU-111 Fix force options parsing 97/297/2
Oleg Drokin [Fri, 4 Mar 2011 16:15:41 +0000 (11:15 -0500)]
LU-111 Fix force options parsing

parse_options had incorrect check for force option that matched
all force options like force_over_16tb and as a result those did
not get propagated down.

Change-Id: I46d07a6d505acc9150e2655a63be55fb7ef6954d
Signed-off-by: Oleg Drokin <green@whamcloud.com>
10 years agoMerge "Remove rpm spec code and message related to "alias lustre llite""
Oleg Drokin [Fri, 4 Mar 2011 22:37:12 +0000 (14:37 -0800)]
Merge "Remove rpm spec code and message related to "alias lustre llite""

10 years agoMerge "Remove obsolete nodelustre script"
Oleg Drokin [Fri, 4 Mar 2011 04:42:13 +0000 (20:42 -0800)]
Merge "Remove obsolete nodelustre script"

10 years agoAdd -D logdir option to Auster so that the whole logdir can be specified 88/288/2
Chris [Thu, 3 Mar 2011 00:38:21 +0000 (00:38 +0000)]
Add -D logdir option to Auster so that the whole logdir can be specified

Added a -D logdir so that Auster does not append a time based suffix to the dir
this means that the location of the logs can be known by the parent of Auster

Use Case: The job manager needs to place console logs into the logs directory
and is a parent of Auster, therefor the job manage creates a directory and passes
it to auster using the -D option

Issue: LU-105
Change-Id: Idbd258677b9c6624fdac43aa6223fe6f516bc989
Signed-off-by: Chris Gearing <chris@whamcloud.com>
10 years agoRemove obsolete nodelustre script 91/291/1
Christopher J. Morrone [Thu, 3 Mar 2011 01:05:17 +0000 (17:05 -0800)]
Remove obsolete nodelustre script

Change-Id: I1cf7e933225b773e95210f0bc5088ecbd7ab0037
Signed-off-by: Christopher J. Morrone <morrone2@llnl.gov>
10 years agoMerge "Remove EXPORT_SYMBOL for static function."
Oleg Drokin [Tue, 1 Mar 2011 04:07:59 +0000 (20:07 -0800)]
Merge "Remove EXPORT_SYMBOL for static function."

10 years agob=21581 too long file / path names for old tar 22/222/5
Brian J. Murrell [Mon, 24 Jan 2011 20:53:53 +0000 (15:53 -0500)]
b=21581 too long file / path names for old tar

Instruct automake to use tar's ustar format to prevent errors when
pathnames are longer than 99 chars.
- this requires automake >= 1.9, so adjust accordingly
  - including dealing with multiple versions of automake installed
Per andreas' inspection, provide a hint to the user on how to install
a missing automake or autoconf.
Per mjmac's inspection, give Debian users help on how to install
- note that I am *assuming" that Debian's lsb distribution id is "Debian"


Issue: LU-47
Signed-off-by: Brian J. Murrell <brian@whamcloud.com>
Change-Id: Ica6419196d2a6910824e8f2326537d9b46612c15

10 years agoAdd location argument to $upload_script call 69/269/1
John Spray [Mon, 28 Feb 2011 16:33:08 +0000 (16:33 +0000)]
Add location argument to $upload_script call

Issue: LU-100
Change-Id: Idbe262f15f0915ef9ffe8f6f05256eaf324d8a51
Signed-off-by: John Spray <john@whamcloud.com>
10 years agoMerge "b=24413 fix for automake > 1.9.6"
Oleg Drokin [Sat, 26 Feb 2011 01:55:40 +0000 (17:55 -0800)]
Merge "b=24413 fix for automake > 1.9.6"

10 years agob=24413 fix for automake > 1.9.6 07/207/5
Brian J. Murrell [Fri, 21 Jan 2011 18:54:06 +0000 (13:54 -0500)]
b=24413 fix for automake > 1.9.6

We seem to be using a Makefile variable that does not exist in more
recent versions of automake.  This fixes that problem.

Issue: LU-48
Signed-off-by: Brian J. Murrell <brian@whamcloud.com>
Change-Id: I84d0656e9bee0878c7ac25183476e39dca63366f

10 years agoRemove rpm spec code and message related to "alias lustre llite" 64/264/1
Christopher J. Morrone [Sat, 19 Feb 2011 01:52:43 +0000 (17:52 -0800)]
Remove rpm spec code and message related to "alias lustre llite"

It has been quite some time since we used the module alias:

  alias lustre llite

Remove conversion code from the spec file.
Remove the confusing error message that make reference to the alias.

Change-Id: I412ce738243e3d9824b4a94f8a4fd95811cfae0d
Signed-off-by: Christopher J. Morrone <morrone2@llnl.gov>
10 years agoRemove EXPORT_SYMBOL for static function. 62/262/1
Christopher J. Morrone [Wed, 16 Feb 2011 21:13:17 +0000 (13:13 -0800)]
Remove EXPORT_SYMBOL for static function.

gcc 4.4.4 in RHEL6 on ppc64 does not allow EXPORT_SYMBOL for
a static function (which seems reasonable).  The following error
is seen on the EXPORT_SYMBOL line:

error: __ksymtab_obd_device_alloc causes a section type conflict

Change-Id: I6ad5ae13346b82bd723e05545d1e2a765b7460a8
Signed-off-by: Christopher J. Morrone <morrone2@llnl.gov>
10 years agoDon't upload to maloo by default 44/244/2
Robert Read [Thu, 17 Feb 2011 06:43:17 +0000 (22:43 -0800)]
Don't upload to maloo by default

- Main change is to invert meaning of -l flag. It now turns *on* uploading
  logs to maloo, instead of disabling it.

- Also added check to find the absolute path for maloo_upload.sh, just like
  we do for the test scripts we run.

- The verbose option now dumps the output of the script to the console (and
  also saves it to the log file).

- The -h (help) option will now page the help text if needed.

Issue: LU-86
Change-Id: Ieccb3f431b87d39007cd4f267b6e3e61a41a024a
Signed-off-by: Robert Read <rread@whamcloud.com>
10 years agoUpdates to test scripts: 23/223/4
John Spray [Tue, 8 Feb 2011 15:25:06 +0000 (15:25 +0000)]
Updates to test scripts:
 * Add auster test-runner script
 * Add integration with maloo test DB

Change-Id: Ifb4903ffad3fafc989e3261d72d41051b171a90e

10 years agoProperly pick module parameters from modules.conf 02/202/4
Oleg Drokin [Thu, 20 Jan 2011 02:55:00 +0000 (21:55 -0500)]
Properly pick module parameters from modules.conf

Fix incorrect regexp that assumes "options <module> params" format
instead of proper "options module params".
Also use baseline to extract just the name of the module.
Now depends on GNU grep for -P (perl_ regexps so that '+' glob works.

Issue: LU-46
Change-Id: Id4d939bf2d9a412d1d5f3f2e7fcbf466ce436e28

10 years agob=24416 debian packaging fixes 13/213/6
Brian J. Murrell [Sat, 22 Jan 2011 11:16:55 +0000 (06:16 -0500)]
b=24416 debian packaging fixes

- don't make a patch out of anything in /debian
- exclude noise files from the debian built source tarball
- fake debian/patche{s,d} for make dist
- a few more reasons to run autogen.sh
- figure out if dist tarball needs autogen.shs and include it if so
- look for and run autogen.sh in the build subdir
- make debdiff as part of make dist
- add a debian/source/format file
- mv the orig tarball and the debdiff to the debs dir
- don't try to dist /debian for non-dpkg-using build targets

Issue: LU-51
Change-Id: I041aaef217e107def86ce808d0e96fc6891e1dcd

10 years agob=24378 wrong error code conversion in mdd_path_current() 2.0.59-llnl-base
Vitaly Fertman [Tue, 28 Dec 2010 20:52:23 +0000 (23:52 +0300)]
b=24378 wrong error code conversion in mdd_path_current()


10 years agob=24357 thread_sanity() fix
Elena Gryaznova [Fri, 24 Dec 2010 17:22:20 +0000 (20:22 +0300)]
b=24357 thread_sanity() fix


10 years agob=24037 Remove iopen ldiskfs patch for rhel5.
yangsheng [Fri, 24 Dec 2010 03:53:48 +0000 (11:53 +0800)]
b=24037 Remove iopen ldiskfs patch for rhel5.

10 years agob=23368 MMP: Watchdog detected LOCKUP on CPU 1 RIP: __list_add+0x6/0x68
Rahul Deshmukh [Mon, 27 Dec 2010 07:29:52 +0000 (12:59 +0530)]
b=23368 MMP: Watchdog detected LOCKUP on CPU 1 RIP: __list_add+0x6/0x68

10 years agob=24294 test_pios: take the ost-s sizes into account
Elena Gryaznova [Thu, 23 Dec 2010 23:39:13 +0000 (02:39 +0300)]
b=24294 test_pios: take the ost-s sizes into account


10 years agob=24255 move seq_client_alloc_seq out of lcs_sem
Vitaly Fertman [Thu, 23 Dec 2010 22:42:46 +0000 (01:42 +0300)]
b=24255 move seq_client_alloc_seq out of lcs_sem


do not send RPC under seq->lcs_sem, because in a case of recovery
the recovery thread takes it too and deadlocks.

10 years agob=23793 MOUNTOPT "-o" cleanup
Elena Gryaznova [Thu, 23 Dec 2010 21:23:27 +0000 (00:23 +0300)]
b=23793 MOUNTOPT "-o" cleanup


10 years agob=21122 fix a race between page fault and lock cancel.
Eric Mei [Thu, 23 Dec 2010 01:48:57 +0000 (18:48 -0700)]
b=21122 fix a race between page fault and lock cancel.


10 years agob=16359 _debug_req fix
cliff.white@oracle.com [Thu, 23 Dec 2010 22:38:39 +0000 (01:38 +0300)]
b=16359 _debug_req fix


10 years agob=23409 add -i to the setstripe usage and man page
Andreas Dilger [Tue, 10 Aug 2010 08:43:49 +0000 (04:43 -0400)]
b=23409 add -i to the setstripe usage and man page

Add the "-i" option to the "lfs setstripe" usage and man page.
Fix nroff formatting in the "lfs setstripe" and "lfs getstripe".


10 years agoRevert "b=20288 kiblnd_check_conns can deadlock"
Vitaly Fertman [Thu, 23 Dec 2010 22:49:21 +0000 (01:49 +0300)]
Revert "b=20288 kiblnd_check_conns can deadlock"

This reverts commit 9887737999bb2db2bc2f10b77854dee7f471ba62.

10 years agob=24291 Ignore PARAM_NETWORK in mgs_write_log_param()
Zhiqi Tao [Thu, 16 Dec 2010 10:28:29 +0000 (21:28 +1100)]
b=24291 Ignore PARAM_NETWORK in mgs_write_log_param()


10 years agob=24320 do not fork a new thread in mem pressure
Vitaly Fertman [Tue, 21 Dec 2010 13:07:45 +0000 (16:07 +0300)]
b=24320 do not fork a new thread in mem pressure


we already check for PF_MEMALLOC in ldlm shrinker and pass this flag
to the blocking thread, but a new thread start was still done with
no check for this flag.

10 years agob=24128 mgs_write_log_ost() should not update client's log
Vitaly Fertman [Tue, 21 Dec 2010 13:04:22 +0000 (16:04 +0300)]
b=24128 mgs_write_log_ost() should not update client's log

o=Lai Siyao

when an OST is set to an existed index, don't update client logs.
otherwise client will have duplicate OSC logs to the same OST.

10 years agob=24017 make llverfs more robust against errors during testing
Jian Yu [Mon, 20 Dec 2010 10:33:40 +0000 (18:33 +0800)]
b=24017 make llverfs more robust against errors during testing

Make llverfs more robust about continuing in the face of errors.

o=Andreas Dilger

10 years agob=23051 improve summary of acc-sm to include test times
Elena Gryaznova [Mon, 20 Dec 2010 18:41:36 +0000 (21:41 +0300)]
b=23051 improve summary of acc-sm to include test times


10 years agob=22771 Patch to disable mbcache
Girish Shilamkar [Sun, 19 Dec 2010 12:25:43 +0000 (17:55 +0530)]
b=22771 Patch to disable mbcache


10 years agob=16890 llapi_quotactl man page update
Andrew Perepechko [Mon, 20 Dec 2010 21:37:32 +0000 (00:37 +0300)]
b=16890 llapi_quotactl man page update

additional explanation of igrace/bgrace/itime/btime

10 years agob24336 ldlm_resource::lr_lvb_data is protected by wrong lock
Liang Zhen [Tue, 21 Dec 2010 01:26:08 +0000 (09:26 +0800)]
b24336 ldlm_resource::lr_lvb_data is protected by wrong lock

- ldlm_resource::lr_lvb_data should always be protected by lr_lvb_sem
- cleanup some unnecessary lock dance


10 years agob=22117 Patch to limit mmp interval
Girish Shilamkar [Sun, 19 Dec 2010 12:28:51 +0000 (17:58 +0530)]
b=22117 Patch to limit mmp interval


10 years agoRevert "b=22176 Add .sync_fs super block handler"
Vitaly Fertman [Wed, 22 Dec 2010 00:58:11 +0000 (03:58 +0300)]
Revert "b=22176 Add .sync_fs super block handler"

This reverts commit 860404143dafbf696b4d38bb9135eabfd471cc36.

10 years agoUpdated for 2.1.0 build 10. v2_0_59_0
Terry Rutledge [Sat, 18 Dec 2010 00:30:56 +0000 (16:30 -0800)]
Updated for 2.1.0 build 10.

10 years agob=24324 no new lock on disconnected export
Vitaly Fertman [Thu, 16 Dec 2010 00:19:35 +0000 (03:19 +0300)]
b=24324 no new lock on disconnected export


check if export is being disconnected and do not add new locks to it

10 years agob=24218 fix race on ksock_tx_t::tx_resid
Vitaly Fertman [Thu, 16 Dec 2010 00:19:37 +0000 (03:19 +0300)]
b=24218 fix race on ksock_tx_t::tx_resid


If a connection is closed before ksocknal_transmit() returns to
ksocknal_process_transmit(), then nobody has refcount on conn::ksnc_sock
and all pending ZC requests will be finalized by ksocknal_connsock_decref->
ksocknal_finalize_zcreq, ksocknal_finalize_zcreq will mark not-acked ZC
request as error by setting tx::tx_reside = -1.
This is race because ksocknal_process_transmit() will check tx::tx_resid
right after calling ksocknal_transmit(), and it can get tx->tx_resid != 0
and rc == 0 then hit later LASSERT(rc < 0).

10 years agob=24078 replay barrier checks sequence allocation
Mikhail Pershin [Wed, 15 Dec 2010 15:07:12 +0000 (18:07 +0300)]
b=24078 replay barrier checks sequence allocation


10 years agob=23289 revert patch on 21828
Liang Zhen [Wed, 15 Dec 2010 15:04:30 +0000 (23:04 +0800)]
b=23289 revert patch on 21828

this patch reverted change on bug 21828, because svc::srv_n_active_reqs
is dropped by ptlrpc_server_finish_request() now which is used to be
dropped by last calling of ptlrpc_server_drop_request, so recovery
thread has no refcount on svc::srv_n_active_reqs and we should just
remove these refcount changes.
This should also resovled issue of bug 21140, because logic in
ptlrpc_server_request_get() can be totally screwed up by

10 years agob=23289 don't start more than minimum MDT threads
Andreas Dilger [Tue, 14 Dec 2010 07:33:11 +0000 (00:33 -0700)]
b=23289 don't start more than minimum MDT threads

Because there are many different portals running on the MDS node (MDT,
readpage, setattr, SEQ, SEQ server, FLD, DLM), starting more than the
minimum number of threads can result in thousands of threads running
on large MDS nodes, even if many of those threads are not needed.

Start only the minimum number of threads at mount time, and start more
as required by the load.


10 years agob=23072 avoid duplicate cfs_size_round declarations
Andreas Dilger [Tue, 14 Dec 2010 06:54:49 +0000 (23:54 -0700)]
b=23072 avoid duplicate cfs_size_round declarations

Use HAVE_CFS_SIZE_ROUND to determine if libcfs_private.h or lustre_user.h has
declared the cfs_size_round() helper function.  This avoids conflicts if both
of these headers are included in different orders.

10 years agob=22376 accmode fix
Vitaly Fertman [Thu, 16 Dec 2010 00:19:36 +0000 (03:19 +0300)]
b=22376 accmode fix


accmode is fixed to not let other flags to be set together with MAY_EXEC
test is added

10 years agob=21244 add man page for ll_decode_filter_fid tool
Andreas Dilger [Fri, 17 Dec 2010 00:17:22 +0000 (17:17 -0700)]
b=21244 add man page for ll_decode_filter_fid tool

Add man page for new ll_decode_filter_fid(8) tool.
Add references to lustre(7) man page.

10 years agob=20288 kiblnd_check_conns can deadlock
Isaac Huang [Wed, 15 Dec 2010 15:35:21 +0000 (08:35 -0700)]
b=20288 kiblnd_check_conns can deadlock

Avoid dropping global lock in kiblnd_check_conns


10 years agob=16605 verify object group before using it
Andreas Dilger [Wed, 15 Dec 2010 06:44:39 +0000 (23:44 -0700)]
b=16605 verify object group before using it

Avoid trying to access an invalid/unknown group sent from the client.
Don't return -EINVAL, since it is possible that the problem is the OST
code is old and this object actually exists.  Instead, return -EBADF
so that we can more easily diagnose this problem.

10 years agob=24048 set body->easize in mdc_getattr_pack()
Mikhail Pershin [Wed, 15 Dec 2010 14:51:41 +0000 (17:51 +0300)]
b=24048 set body->easize in mdc_getattr_pack()


This allows client to supply the server with enough info about reply buffers.
Server may use this to allocate buffers and don't guess with max_mdsize

10 years agob=24171 check the init() iterator results properly
Mikhail Pershin [Wed, 15 Dec 2010 14:50:39 +0000 (17:50 +0300)]
b=24171 check the init() iterator results properly


10 years agob=24300 Don't run autogen.sh in the spl and zfs repos.
Ricardo M. Correia [Mon, 13 Dec 2010 15:56:14 +0000 (15:56 +0000)]
b=24300 Don't run autogen.sh in the spl and zfs repos.

The SPL and ZFS repositories already come with the configure script ready to
run, therefore it's not necessary to run autogen.sh on those directories.

Furthermore, in SPL version 0.5.0 this was causing a version conflict when
regenerating the files with the autotools/libtool that come with RHEL5.

10 years agob=24266 increase replay-single test_70d dbench duration for HARD failure mode
Elena Gryaznova [Mon, 13 Dec 2010 17:37:49 +0000 (20:37 +0300)]
b=24266 increase replay-single test_70d dbench duration for HARD failure mode