Whamcloud - gitweb
fs/lustre-release.git
11 years agoLU-1548 ldiskfs: repostion some items in ldiskfs_inode_info
wangdi [Tue, 7 Aug 2012 21:55:28 +0000 (14:55 -0700)]
LU-1548 ldiskfs: repostion some items in ldiskfs_inode_info

Reposition a few items of ldiskfs_inode_info to remove
the alignment hole in the structure, i.e. reduce the size
of this structure, then each slab page can hold more
objects(ldiskfs_inode_info).

Signed-off-by: Wang di <di.wang@whamcloud.com>
Change-Id: I2a4ff44cd98366029dfbee7c4f750e13df8bf7c3
Reviewed-on: http://review.whamcloud.com/3562
Tested-by: Hudson
Reviewed-by: Fan Yong <yong.fan@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
11 years agoLU-1581 osd: zfs osd to take svname from lsi
Alex Zhuravlev [Mon, 30 Jul 2012 14:00:21 +0000 (18:00 +0400)]
LU-1581 osd: zfs osd to take svname from lsi

as the label is marked with ':' at the beginning

Signed-off-by: Alex Zhuravlev <bzzz@whamcloud.com>
Change-Id: I54b651392834278bf1b0d72807576970887ca2b3
Reviewed-on: http://review.whamcloud.com/3610
Tested-by: Hudson
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Jinshan Xiong <jinshan.xiong@whamcloud.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
11 years agoLU-708 utils: convert_hostnames() should not change its arg
Bobi Jam [Tue, 14 Aug 2012 01:39:59 +0000 (09:39 +0800)]
LU-708 utils: convert_hostnames() should not change its arg

Fix an omission.

Signed-off-by: Bobi Jam <bobijam@whamcloud.com>
Change-Id: I29b6179ef7d36751d1a9bd28119ab14145bf2f61
Reviewed-on: http://review.whamcloud.com/3628
Tested-by: Hudson
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Fan Yong <yong.fan@whamcloud.com>
11 years agoLU-1689 tests: fix mount during e2fsck test
Minh Diep [Tue, 14 Aug 2012 07:44:02 +0000 (15:44 +0800)]
LU-1689 tests: fix mount during e2fsck test

The current mmp test 8 (mount during e2fsck) has two time issues:
1) the mount operation may start before e2fsck
2) the e2fsck operation may stop before mount

This patch fixes the above issues by providing enough time for e2fsck
operation to be started before mount operation, and setting the
superblock free_blocks_count field with 0 to force e2fsck checking
the Lustre server target device, which provides enough time for
the mount operation to be started during the e2fsck operation.

Test-Parameters: testlist=mmp
Signed-off-by: Minh Diep <mdiep@whamcloud.com>
Signed-off-by: Yu Jian <yujian@whamcloud.com>
Change-Id: I37e07ffdb6669dfabceaab5ef242460abe204ba0
Reviewed-on: http://review.whamcloud.com/3569
Reviewed-by: Li Wei <liwei@whamcloud.com>
Reviewed-by: Chris Gearing <chris.gearing@intel.com>
Tested-by: Hudson
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
11 years agoLU-1053 build: add test for commit-msg with diff
Andreas Dilger [Mon, 9 Jul 2012 23:16:45 +0000 (17:16 -0600)]
LU-1053 build: add test for commit-msg with diff

Add a test case for the build/commit-msg script with "diff" at the
start of an intermediate line, which should not be ignored.  This
needs to be distinguished from the start of "git commit -av" output
which has a full "git diff" of the output inline with the commit.

Signed-off-by: Andreas Dilger <adilger@whamcloud.com>
Change-Id: Icf2399568437de7f9e23419386bf3b1b99ebbfd1
Reviewed-on: http://review.whamcloud.com/3363
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Bruce Korb <bruce_korb@xyratex.com>
Reviewed-by: Yu Jian <yujian@whamcloud.com>
11 years agoLU-474 build: document the build release versions
Andreas Dilger [Tue, 6 Dec 2011 09:04:24 +0000 (02:04 -0700)]
LU-474 build: document the build release versions

Update ancient document to better describe the build versions
and Git tags that are being used today.

Signed-off-by: Andreas Dilger <adilger@whamcloud.com>
Change-Id: I804f10bf486745ddd3b23b89e959dfd58558cab0
Reviewed-on: http://review.whamcloud.com/3625
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Brian J. Murrell <brian@whamcloud.com>
Reviewed-by: Yu Jian <yujian@whamcloud.com>
11 years agoLU-1625 test: reduce test duration for nfs mode
Keith Mannthey [Mon, 13 Aug 2012 19:55:55 +0000 (12:55 -0700)]
LU-1625 test: reduce test duration for nfs mode

There isn't much value to run long duration in
nfs mode.  Based on original work by Minh Diep.

Test-Parameters: testgroup=full
Signed-off-by: Keith Mannthey <keith@whamcloud.com>
Change-Id: I635d388e4dba5192199602b29ccaae843e9a1346
Reviewed-on: http://review.whamcloud.com/3596
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
11 years agoLU-1657 lov: LBUG at lov_fini_raid0()
Jinshan Xiong [Wed, 1 Aug 2012 22:31:05 +0000 (15:31 -0700)]
LU-1657 lov: LBUG at lov_fini_raid0()

The refcount of lsm could be greater than 1 if the object was failed
at initialization because llite held another reference.

Signed-off-by: Jinshan Xiong <jinshan.xiong@intel.com>
Change-Id: I86884eb36f256123e835fdc9e0e585a984f7713a
Reviewed-on: http://review.whamcloud.com/3513
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
Reviewed-by: jacques-Charles Lafoucriere <jacques-charles.lafoucriere@cea.fr>
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
11 years agoLU-1683 agl: increase lock cll_holds for AGL upcall
Fan Yong [Thu, 2 Aug 2012 01:30:45 +0000 (09:30 +0800)]
LU-1683 agl: increase lock cll_holds for AGL upcall

If without additional cll_holds held for AGL upcall, the AGL lock
may be cancelled/deleted by the AGL thread if the AGL lock cannot
be granted at once.

The osc_lock_wait() will re-trigger lock enqueue for non-granted
AGL lock and return 'CLO_REENQUEUED' to the caller. Original lov
lock enqueue logic ignored such case for lov_lock_enqueue_one(),
so may cause unexpected LASSERT when checking the result.

Signed-off-by: Fan Yong <yong.fan@whamcloud.com>
Change-Id: I37305aba9d9f9ad525decc20badac4afbe7aedb0
Reviewed-on: http://review.whamcloud.com/3249
Reviewed-by: Jinshan Xiong <jinshan.xiong@whamcloud.com>
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: James Simmons <uja.ornl@gmail.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
11 years agoLU-1634 test: Check client only for version matching
Jinshan Xiong [Wed, 1 Aug 2012 20:18:48 +0000 (13:18 -0700)]
LU-1634 test: Check client only for version matching

For reliability, we should check nidtbl versions on the client only
to make sure the client is not notified if IR on the MGS is disabled.
Otherwise, if the MGS can't respond while OST is restarting, the
nidtbl version won't be updated and this will fail the test.

Signed-off-by: Jinshan Xiong <jinshan.xiong@intel.com>
Change-Id: I903add4261f5fb133ae4ac170b10c0783c226e29
Reviewed-on: http://review.whamcloud.com/3511
Tested-by: Hudson
Reviewed-by: Yu Jian <yujian@whamcloud.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
11 years agoLU-1548 osd: move i_htree_lock to iam container
wangdi [Tue, 7 Aug 2012 21:40:45 +0000 (14:40 -0700)]
LU-1548 osd: move i_htree_lock to iam container

Move i_tree_lock from ldiskfs_inode to iam_container,
so to reduce ldiskfs_inode_info size.

Signed-off-by: wang di <di.wang@whamcloud.com>
Change-Id: I4277e73347380bcb3fd34e3d76f66b1d6ec881f5
Reviewed-on: http://review.whamcloud.com/3561
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Fan Yong <yong.fan@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
11 years agoLU-1538 tests: cleanup test output, acc-small
Andreas Dilger [Tue, 19 Jun 2012 02:21:05 +0000 (20:21 -0600)]
LU-1538 tests: cleanup test output, acc-small

There are a number of issues observed when running test scripts
on a single-node system.  Several minor cleanups to the test
scripts are also possible (coding style, more clear output, etc):

- sanity.sh test_51ba() should clean up subdirectories even if
  test_51b() failed to create all of the files
- sanity.sh test_61ic() shouldn't print all files to stdout
- running fsx accepts $FSXOPT to allow adding extra test options
- implement acceptance-small.sh via auster, so that we don't have to
  maintain two separate test wrappers, and it would be possible for
  users running acc-sm to upload results to Maloo if they wanted.

Signed-off-by: Andreas Dilger <adilger@whamcloud.com>
Change-Id: I8aae9c1af508d8173135f0938e0a26d9731253df
Reviewed-on: http://review.whamcloud.com/3132
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Li Wei <liwei@whamcloud.com>
Reviewed-by: Keith Mannthey <keith@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
11 years agoLU-1095: debug: Clarify a IR message about support.
James Simmons [Tue, 31 Jul 2012 19:36:07 +0000 (15:36 -0400)]
LU-1095: debug: Clarify a IR message about support.

During testing of Imperative Recovery we turned off IR but in the
logs we saw a IR:ON message which is confusing since we set it to
disable on the MGS. The IR:ON message is only to make aware that
IR functionality exist. A better message would be IR: [DIS|EN]ABLED.

Signed-off-by: James Simmons <uja.ornl@gmail.com>
Change-Id: I5990b26b326ce912faf6c5d556cb92334748322c
Reviewed-on: http://review.whamcloud.com/2853
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Jinshan Xiong <jinshan.xiong@whamcloud.com>
11 years agoLU-657 obdfilter: commit pending journals if -ENOSPC
Hongchao Zhang [Tue, 7 Aug 2012 00:43:11 +0000 (08:43 +0800)]
LU-657 obdfilter: commit pending journals if -ENOSPC

in filter_preprw_write, if there is no enough space for this
write operation, then commit the pending journals to get some
more disk space and retry it.

Signed-off-by: Hongchao Zhang <hongchao.zhang@whamcloud.com>
Change-Id: I46106b26443bb203eee6f01a0795b47be09170a6
Reviewed-on: http://review.whamcloud.com/3446
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
11 years agoLU-359 llite: no close error if application has known failure
Fan Yong [Thu, 2 Aug 2012 04:29:25 +0000 (12:29 +0800)]
LU-359 llite: no close error if application has known failure

Don't return error again when close if the application has known
former write failure.

Rename the per-inode based 'lli_write_rc' as per file-structure
based 'fd_last_write' to track the last write/fsync failure.
When 'sys_close()' is called against the 'file' structure,
we know whether the caller has already known former write/fsync
failure or not, then avoid potenical rdundant error handling,
like confused error message.

Miss to set 'lli_async_rc' if ll_writepage() failed.

Signed-off-by: Fan Yong <yong.fan@whamcloud.com>
Change-Id: I62d9cd83fc03fad22c994f2a77774ca113a6c057
Reviewed-on: http://review.whamcloud.com/1497
Tested-by: Hudson
Reviewed-by: Niu Yawei <niu@whamcloud.com>
Reviewed-by: Jinshan Xiong <jinshan.xiong@whamcloud.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
11 years agoLU-1648 llog: prevent journal to restart for llog cancel
Fan Yong [Sun, 5 Aug 2012 15:59:38 +0000 (23:59 +0800)]
LU-1648 llog: prevent journal to restart for llog cancel

Journal credit for llog cancel should include the blocks for
deleting the inode. Otherwise the journal handle may need to
be restarted during the transaction for more credit. Then it
may be blocked with holding llog_handle::lgh_lock, which may
cause deadlock with other llog operations those have started
journal handle but waiting for the llog_handle::lgh_lock.

Signed-off-by: Fan Yong <yong.fan@whamcloud.com>
Change-Id: I157093cbe9e625715ec033f59dde325e81ec6b71
Reviewed-on: http://review.whamcloud.com/3463
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Niu Yawei <niu@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
11 years agoLU-1095 debug: Improve RPC completed message
Jinshan Xiong [Mon, 23 Jul 2012 09:09:16 +0000 (17:09 +0800)]
LU-1095 debug: Improve RPC completed message

Do not print RPC completed messages for fake requests.

Signed-off-by: Jinshan Xiong <jinshan.xiong@whamcloud.com>
Change-Id: I7156c22c94210f265d01c8621e2be9da0c4a913d
Reviewed-on: http://review.whamcloud.com/3445
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
11 years agoLU-812 compat: SPIN/RW_LOCK_UNLOCKED no longer exist
James Simmons [Tue, 7 Aug 2012 13:55:49 +0000 (09:55 -0400)]
LU-812 compat: SPIN/RW_LOCK_UNLOCKED no longer exist

SPIN_LOCK_UNLOCKED/RW_LOCK_UNLOCKED were deprecated and no longer
exist after 2.6.39. Use DEFINE_SPIN_LOCK/DEFINE_RW_LOCK instead.

Signed-off-by: James Simmons <uja.ornl@gmail.com>
Signed-off-by: Wally Wang <wang@cray.com>
Change-Id: Ie294e817c850c35e561987f6ab46c6d12d6d018d
Reviewed-on: http://review.whamcloud.com/1942
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
11 years agoLU-1617 build: skip generated files in .gitignore
Andreas Dilger [Mon, 9 Jul 2012 23:08:26 +0000 (17:08 -0600)]
LU-1617 build: skip generated files in .gitignore

Ignore automatically generated files via .gitignore so that a large
number of spurious files are not listed at "git commit" time.  Also
consolidate many duplicate ignored files in the top-level .gitignore
file, since we don't need to list the exceptions in every directory
like the original .cvsignore files that they were generated from.

Signed-off-by: Andreas Dilger <adilger@whamcloud.com>
Change-Id: I5f9c60dc498f98abcd8c91042fc72408a915500c
Reviewed-on: http://review.whamcloud.com/3364
Reviewed-by: Yu Jian <yujian@whamcloud.com>
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
11 years agoLU-812 replace get_sb with mount in struct file_system_type
James Simmons [Tue, 24 Jul 2012 14:39:22 +0000 (10:39 -0400)]
LU-812 replace get_sb with mount in struct file_system_type

get_sb() has been removed from struct file_system_type since
2.6.39 and switched to mount().

Signed-off-by: James Simmons <uja.ornl@gmail.com>
Signed-off-by: Wally Wang <wang@cray.com>
Change-Id: I44fa30eb51c4444d9ef8e7b545f2b80088f127cc
Reviewed-on: http://review.whamcloud.com/1951
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Bob Glossman <bogl@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
11 years agoLU-1548 mdssurvey: fix file_count in mds_survey
wangdi [Mon, 6 Aug 2012 21:49:26 +0000 (14:49 -0700)]
LU-1548 mdssurvey: fix file_count in mds_survey

It should use file_count_per_thread in lctl command,
instead of file_count directly.

Signed-off-by: Wang di <di.wang@whamcloud.com>
Change-Id: I606a988db373c1065544385fdd1dfd42821eef2b
Reviewed-on: http://review.whamcloud.com/3549
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Minh Diep <mdiep@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
11 years agoLU-1458 test: dump changelog for lustre-rsync-test
Bobi Jam [Tue, 7 Aug 2012 05:57:33 +0000 (13:57 +0800)]
LU-1458 test: dump changelog for lustre-rsync-test

Dump plain text format changelog records for failed lustre-rsync-test
test case to help debugging.

Signed-off-by: Bobi Jam <bobijam@whamcloud.com>
Change-Id: I2041e8262dd3b5307e4d5eac4619a3fbcfe5abb4
Reviewed-on: http://review.whamcloud.com/3551
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Fan Yong <yong.fan@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
11 years agoLU-1685 test: fix FAIL_ON_ERROR=false parameter
Minh Diep [Wed, 1 Aug 2012 16:18:21 +0000 (09:18 -0700)]
LU-1685 test: fix FAIL_ON_ERROR=false parameter

The previous fix on LU-1528 which was supposed to
fix the overall status to FAIL when a subtest FAIL
didn't work. This commit will provide the proper
fix. The issue is grep FAIL: in exit_status() should
not include ":"

Signed-off-by: Minh Diep <mdiep@whamcloud.com>
Change-Id: I7eaf14351ac8a2c3d21f530e43c75871b4b01f6a
Reviewed-on: http://review.whamcloud.com/3507
Tested-by: Hudson
Reviewed-by: Yu Jian <yujian@whamcloud.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Cliff White <cliffw@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
11 years agoLU-1697 mdt: create OST-objects for open by fid
Fan Yong [Sat, 4 Aug 2012 03:07:48 +0000 (11:07 +0800)]
LU-1697 mdt: create OST-objects for open by fid

Original prcessing for open by fidi gnored the case of OST-objects
to be created because of re-exporting by NFS or O_LOV_DELAY_CREATE
flags, then MDD failed to find useful stripe EA to create releated
OST-objects.

Rename mdt_open_anon_by_fid() to mdt_open_by_fid_lock() to make it
more match the real use cases.

Signed-off-by: Fan Yong <yong.fan@whamcloud.com>
Change-Id: I3c0e4791f6f74fa6b365d41a8e550dba518b39cc
Reviewed-on: http://review.whamcloud.com/3537
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Lai Siyao <laisiyao@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
11 years agoLU-1696 test: Test failure on test suite sanity test_49
yangsheng [Mon, 6 Aug 2012 19:12:56 +0000 (03:12 +0800)]
LU-1696 test: Test failure on test suite sanity test_49

Use 'grep -w' for pid search to avoid a partial match.

Signed-off-by: yang sheng <ys@whamcloud.com>
Change-Id: I809a9c3553b3a9d736891c7399d2786eb43c4d3b
Reviewed-on: http://review.whamcloud.com/3544
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Yu Jian <yujian@whamcloud.com>
Reviewed-by: Jinshan Xiong <jinshan.xiong@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
11 years agoLU-718 mdt: rename mdt threads parameters to mds
wangdi [Wed, 11 Jul 2012 09:34:11 +0000 (02:34 -0700)]
LU-718 mdt: rename mdt threads parameters to mds

rename mdt threads parameters to mds, so the following
mds layer change does not need to deal with the compatiblity
issue.

Signed-off-by: Wang Di <di.wang@whamcloud.com>
Change-Id: I3cb095812488b9459e4a3e878757d40410ac4191
Reviewed-on: http://review.whamcloud.com/3377
Tested-by: Hudson
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Liang Zhen <liang@whamcloud.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
11 years agoLU-1698 test: Test failure on test suite lfsck
yangsheng [Fri, 3 Aug 2012 13:17:19 +0000 (21:17 +0800)]
LU-1698 test: Test failure on test suite lfsck

Fix the wrong redirect in get_obdfilter_param().

Test-Parameters: testlist=lfsck
Signed-off-by: yang sheng <ys@whamcloud.com>
Change-Id: I665bd5b25287b453f5c6fc3fc12405064d02b696
Reviewed-on: http://review.whamcloud.com/3523
Reviewed-by: Yu Jian <yujian@whamcloud.com>
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Wei Liu <sarah@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
11 years agoLU-1710 general: add LVB type connect flag
Jinshan Xiong [Fri, 3 Aug 2012 19:18:03 +0000 (12:18 -0700)]
LU-1710 general: add LVB type connect flag

LVB type will be used for nanoseconds and layout lock. Reserve this
connect flag to prevent collision.

Signed-off-by: Jinshan Xiong <jinshan.xiong@intel.com>
Change-Id: I76f6b8fc8be2d98b8b80f15e69b36ea50b9d7a7d
Reviewed-on: http://review.whamcloud.com/3528
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
11 years agoLU-1626 lov: fix lov request set finish check race
Bobi Jam [Mon, 16 Jul 2012 11:07:10 +0000 (19:07 +0800)]
LU-1626 lov: fix lov request set finish check race

When several lov_request callbacks are called, if one of them is
the last lov_request in the set, lov_finished_set() checks for
all of them will return true, while the following action is supposed
be called only once for the set, in this case the assumption is broke
and the lov request set's refcount is wrong.

This patch fixed another glitch, in qos_remedy_create(), when we use
OST pool, the ost_idx value does not initialied correctly.

Signed-off-by: Bobi Jam <bobijam@whamcloud.com>
Change-Id: Id3ff1777b2146630b2d693e046038fcc6f465309
Reviewed-on: http://review.whamcloud.com/3401
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Fan Yong <yong.fan@whamcloud.com>
Reviewed-by: wangdi <di.wang@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
11 years agoLU-1581 obdclass: take fsname from svname
Alex Zhuravlev [Mon, 9 Jul 2012 06:02:04 +0000 (10:02 +0400)]
LU-1581 obdclass: take fsname from svname

lsi_prepare() to take correct fsname from svname

Signed-off-by: Alex Zhuravlev <bzzz@whamcloud.com>
Change-Id: I60127273633b53f906fa8ff4b94dc370a5b58fcf
Reviewed-on: http://review.whamcloud.com/3484
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Li Wei <liwei@whamcloud.com>
11 years agoLU-1629 ldlm: Fix recovery messages when denying new clients
Li Wei [Fri, 27 Jul 2012 04:18:02 +0000 (12:18 +0800)]
LU-1629 ldlm: Fix recovery messages when denying new clients

Console messages printed when denying new client connections during
target recovery show misleading client statistics.  For example, in
this case:

  Lustre: lustre-MDT0000: Denying connection for new client
  192.168.117.50@o2ib1 (at 11e711ab-a329-f07a-8312-6a40af7fc5a4),
  waiting for 0 clients in recovery for 2:38
  [...]
  Lustre: lustre-MDT0000: Recovery over after 5:00, of 112 clients 0
  recovered and 112 were evicted.

The MDT was actually waiting for all the 112 known clients to recover.
None had connected, however.  In addition, the client NID and UUID
seem to be in the wrong order.  This patch changes the first console
message to look like this:

  Lustre: lustre-MDT0000: Denying connection for new client
  939243e4-2f54-a96f-3cbb-9fcf55426e2e (at 0@lo), waiting for all 2
  known clients (1 recovered, 0 in progress, and 1 unseen) to
  recover in 0:05

Hopefully this new format will be a little bit more useful to users,
although the counters are (still) read in a racy way.

Change-Id: Iefda085602de7967d66892b8f3567561962078ab
Signed-off-by: Li Wei <liwei@whamcloud.com>
Reviewed-on: http://review.whamcloud.com/3485
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Jinshan Xiong <jinshan.xiong@whamcloud.com>
Reviewed-by: Alex Zhuravlev <bzzz@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
11 years agoLU-812 vfs: address_space_operations.migratepage has 4 args
James Simmons [Thu, 12 Jul 2012 15:05:06 +0000 (11:05 -0400)]
LU-812 vfs: address_space_operations.migratepage has 4 args

Linux 3.3 address_space_operations.migratepage has 4 args.
Add detection in build configure and condition in code for
the new argument. Before Linux 3.3 the migration_mode values
were located in migrate.h instead of migrate_mode.h

Signed-off-by: James Simmons <uja.ornl@gmail.com>
Signed-off-by: Wally Wang <wang@cray.com>
Change-Id: I5be5de9c6693b09ddbbc92e729b79bfa1d99ee3a
Reviewed-on: http://review.whamcloud.com/2279
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
11 years agoLU-1422: debug: remove obsolete Cray Catamount support
James Simmons [Thu, 12 Jul 2012 13:03:16 +0000 (09:03 -0400)]
LU-1422: debug: remove obsolete Cray Catamount support

The lustre debug layer also had special hooks for Catamount
support. Now that Catamount support has been removed we can
remove this code as well.

Signed-off-by: James Simmons <uja.ornl@gmail.com>
Change-Id: I838deb6ec7f1e9f43cb2d8121bf27bff5dbad68e
Reviewed-on: http://review.whamcloud.com/3385
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
11 years agoLU-1057 quota: speed up lookup in osc_quota_chkdq
Hongchao Zhang [Thu, 12 Jul 2012 07:23:18 +0000 (15:23 +0800)]
LU-1057 quota: speed up lookup in osc_quota_chkdq

This patch replace the global hash table used to store uid/gid
about to run out of quota space with a per-OSC cfs_hash.

Signed-off-by: Hongchao Zhang <hongchao.zhang@whamcloud.com>
Change-Id: Ibf0785a60b007f33a8660298159abcc387dd8507
Reviewed-on: http://review.whamcloud.com/2074
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
Reviewed-by: Niu Yawei <niu@whamcloud.com>
Reviewed-by: Jinshan Xiong <jinshan.xiong@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
11 years agoLU-1522 recovery: rework LU-1166 patch in different way
Mikhail Pershin [Sun, 17 Jun 2012 11:08:36 +0000 (15:08 +0400)]
LU-1522 recovery: rework LU-1166 patch in different way

Dropping recovery counters upon last export put caused LU-1522 issue,
return class_export_recovery_cleanup() back to the
class_export_disconnect() and use exp_failed flag to avoid race
between target_handle_connect() and class_disconnect_stale_exports()

Signed-off-by: Mikhail Pershin <tappro@whamcloud.com>
Change-Id: I78c19a8d49786877d2de27c82bf40ebec494f044
Reviewed-on: http://review.whamcloud.com/3122
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: wangdi <di.wang@whamcloud.com>
Reviewed-by: Alex Zhuravlev <bzzz@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
11 years agoLU-1158 general: added nanosecond OBD connect flag
Isami Romanowski [Wed, 11 Jul 2012 20:56:50 +0000 (15:56 -0500)]
LU-1158 general: added nanosecond OBD connect flag

To prevent collisions with any future flags needed in features written
against this branch.

Signed-off-by: Isami Romanowski <isami@whamcloud.com>
Change-Id: I965a0ce0120f253014a75c80fe0941ba92f1c45e
Reviewed-on: http://review.whamcloud.com/3380
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
11 years agoLU-1337 kernel: NETIF_F_NO_CSUM and LOOKUP_CONTINUE removed
Liu Xuezhao [Tue, 26 Jun 2012 02:53:06 +0000 (10:53 +0800)]
LU-1337 kernel: NETIF_F_NO_CSUM and LOOKUP_CONTINUE removed

1. kernel: 3.3 removes NETIF_F_NO_CSUM feature bit
   (kernel commit 34324dc2bf27c1773045fea63cb11f7e2a6ad2b9)
   The only one place uses NETIF_F_NO_CSUM is to check
   "(caps & (NETIF_F_IP_CSUM | NETIF_F_NO_CSUM | NETIF_F_HW_CSUM))".
   Replace (NETIF_F_IP_CSUM | NETIF_F_NO_CSUM | NETIF_F_HW_CSUM)
   as NETIF_F_ALL_CSUM which was added since kernel 2.6.17
   (kernel commit 8648b3053bff39a7ee4c711d74268079c928a657)
2. Kernel 3.1 kills LOOKUP_CONTINUE, LOOKUP_PARENT is equivalent
   to it since that.
   See kernel commit 49084c3bb2055c401f3493c13edae14d49128ca0

Another minor change is adding "#include <linux/module.h>" to
libcfs/linux/linux-crypto-adler.c, linux-crypto-crc32.c.

Signed-off-by: Liu Xuezhao <xuezhao.liu@emc.com>
Change-Id: I4f3621fcb921c866b33881027a4a0a491d676640
Reviewed-on: http://review.whamcloud.com/3398
Tested-by: Hudson
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Yang Sheng <ys@whamcloud.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: James Simmons <uja.ornl@gmail.com>
11 years agoLU-1337 vfs: generic_permission has 2 args
Liu Xuezhao [Mon, 25 Jun 2012 09:12:05 +0000 (17:12 +0800)]
LU-1337 vfs: generic_permission has 2 args

generic_permission has 2 args since kernel 3.1
(kernel commit 2830ba7f34ebb27c4e5b8b6ef408cd6d74860890).

Signed-off-by: Liu Xuezhao <xuezhao.liu@emc.com>
Change-Id: I1ee2d127af3dfdc3c24f1466d9e183918c3a652c
Reviewed-on: http://review.whamcloud.com/3397
Tested-by: Hudson
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Yang Sheng <ys@whamcloud.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: James Simmons <uja.ornl@gmail.com>
11 years agoLU-1337 build: make configure process not break for 3.4 kernel
Liu Xuezhao [Tue, 19 Jun 2012 02:47:12 +0000 (10:47 +0800)]
LU-1337 build: make configure process not break for 3.4 kernel

1. remove "ifeq ($(PATCHLEVEL),4)" from build/Makefile, Rules.in
   "ifeq ($(PATCHLEVEL),4)" is for old 2.4 kernel, it breaks
   configuration process for 3.4 kernel.
2. change the "-include" file from "include/$AUTOCONF_HDIR/autoconf.h"
   to "include/linux/kconfig.h", as kconfig.h includes autoconf.h and
   defines some macros(IS_ENABLED etc.) which is needed for some
   header files.
3. Fix a typo from "blkkdev.h" to "blkdev.h" in lustre-core.m4

Signed-off-by: Liu Xuezhao <xuezhao.liu@emc.com>
Change-Id: If1c712c194ba76a2fe46c31afad28c37f0b465c6
Reviewed-on: http://review.whamcloud.com/3396
Tested-by: Hudson
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Yang Sheng <ys@whamcloud.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: James Simmons <uja.ornl@gmail.com>
11 years agoLU-1477 kernel: Kernel update [RHEL6.3 2.6.32-279.2.1.el6]
yangsheng [Tue, 10 Jul 2012 16:03:00 +0000 (00:03 +0800)]
LU-1477 kernel: Kernel update [RHEL6.3 2.6.32-279.2.1.el6]

Add support for RHEL6.3 kernel 2.6.32-279.2.1.el6.

Signed-off-by: James Simmons <uja.ornl@gmail.com>
Signed-off-by: yang sheng <ys@whamcloud.com>
Change-Id: I792b174c77d635100cd02dbbddef568b550d6ed1
Reviewed-on: http://review.whamcloud.com/3033
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Fan Yong <yong.fan@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
11 years agoLU-1682 ptlrpc: LBUG in ldlm_cli_cancel_local
Hiroya Nozaki [Fri, 27 Jul 2012 09:04:46 +0000 (18:04 +0900)]
LU-1682 ptlrpc: LBUG in ldlm_cli_cancel_local

When the case ldlm_cli_enqueue() created a new lock, and then, fails
to allocate memory for a new request, it could cause the LBUG in
ldlm_cli_cancel_local() because the lock->l_conn_export is NULL,
nevertheless the lock is a client-side lock.
This patch fills in some lock fields before the memory allocation
for a new request.

Signed-off-by: Hiroya Nozaki <nozaki.hiroya@jp.fujitsu.com>
Change-Id: I4ba4b6f1bc5dfbb51853261a0f247289304a6abd
Reviewed-on: http://review.whamcloud.com/3488
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
Reviewed-by: Jinshan Xiong <jinshan.xiong@whamcloud.com>
11 years agoNew version: 2.2.92 2.2.92 v2_2_92 v2_2_92_0
Oleg Drokin [Mon, 30 Jul 2012 22:46:46 +0000 (18:46 -0400)]
New version: 2.2.92

Change-Id: I34be6bd03d8620c228f29b32eefbbaeeb0da6dcc
Signed-off-by: Oleg Drokin <green@whamcloud.com>
11 years agoLU-1436 debug: return the original type in RETURN
Hongchao Zhang [Wed, 11 Jul 2012 07:20:20 +0000 (15:20 +0800)]
LU-1436 debug: return the original type in RETURN

in RETURN, the bit length of rc could be 32 or 64, and it also could
be pointer type, the conversion betweens pointer and integer type
will trigger compile warning related to type casting, then define an
static variable of typeof(rc), and avoid the stack usage ATM.

Signed-off-by: Hongchao Zhang <hongchao.zhang@whamcloud.com>
Change-Id: I7f3a54438dfe1f294168691718c4db8116f68de4
Reviewed-on: http://review.whamcloud.com/3072
Tested-by: Hudson
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Alex Zhuravlev <bzzz@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
11 years agoLU-1461 libcfs: Fix crypto not to use x86 pcmul for non x86 arch
Oleg Drokin [Mon, 9 Jul 2012 18:26:12 +0000 (14:26 -0400)]
LU-1461 libcfs: Fix crypto not to use x86 pcmul for non x86 arch

Also fix x86 detection in configure.

This fixes a build failure on ppc64:

In file included from user-crypto.c:34:
/var/lib/jenkins/workspace/lustre-ppc-builder/arch/ppc64/
build_type/client/distro/el6/ib_stack/inkernel/BUILD/BUILD/
lustre-2.2.60/libcfs/include/libcfs/user-crypto.h:30:
error: 'regparm' attribute directive ignored

Signed-off-by: Oleg Drokin <green@whamcloud.com>
Change-Id: Icb229a9db0826420ecc431bc7f7f5e45ecbdbd37
Reviewed-on: http://review.whamcloud.com/3361
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
11 years agoLU-1597 obdfilter: fix some capa code for OST
Bobi Jam [Wed, 11 Jul 2012 03:53:06 +0000 (11:53 +0800)]
LU-1597 obdfilter: fix some capa code for OST

* A capability should be set for filter_sync(), and when the operation
  is come from OSS itself, the capability check can be passed.

* filter_capa_fixoa() need check whether filter enabled capability.

Signed-off-by: Bobi Jam <bobijam@whamcloud.com>
Change-Id: Ie27497d2b61654a7f24c06f6975e8bfaf5e9ae13
Reviewed-on: http://review.whamcloud.com/3372
Reviewed-by: Lai Siyao <laisiyao@whamcloud.com>
Tested-by: Hudson
Reviewed-by: Fan Yong <yong.fan@whamcloud.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Prakash Surya <surya1@llnl.gov>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
11 years agoLU-1562 test: recovery-small tests 2,3,5 failed
Kyrylo Shatskyy [Mon, 9 Jul 2012 09:23:17 +0000 (12:23 +0300)]
LU-1562 test: recovery-small tests 2,3,5 failed

1. The code that executed between tests has been cleaned up
and moved to corresponding tests which use it to avoid its
execution all the time, even if it is not required when
running using ONLY.
2. The test_2, and test_3 has been joined with test_1,
and the test_7 has been joined with test_6, because
they are in fact single scenario.
3. Random and zero file creation code moved to corresponding
utility functions in the test-framework.sh
4. Added bash stack trace printing code to error_noexit that
make it easier to analyze logs.

Xyratex-bug-id: MRP-411
Signed-off-by: Kyrylo Shatskyy <kyrylo_shatskyy@xyratex.com>
Reviewed-by: Andrew Perepechko <andrew_perepechko@xyratex.com>
Reviewed-by: Alexander Zarochentsev <alexander_zarochentsev@xyratex.com>
Change-Id: Iea417c99edb001e4f5ad0d0980c7c10012e2514a
Reviewed-on: http://review.whamcloud.com/3182
Tested-by: Hudson
Reviewed-by: Li Wei <liwei@whamcloud.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Yu Jian <yujian@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
11 years agoLU-1442 llite: cleanup if a page failed to add into cache
Jinshan Xiong [Mon, 23 Jul 2012 14:09:51 +0000 (22:09 +0800)]
LU-1442 llite: cleanup if a page failed to add into cache

In lustre, we assume that a dirty page must be queued in osc cache
for writing. However, in vvp_io_commit_write(), if a page failed to
add into cache, page dirty flag isn't cleared this will cause the
page will never be added into cache again.

Signed-off-by: Jinshan Xiong <jinshan.xiong@whamcloud.com>
Change-Id: I1c132c6f1d4f5845682e51850eb895b292fc5f0d
Reviewed-on: http://review.whamcloud.com/3447
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Niu Yawei <niu@whamcloud.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
11 years agoLU-1347 obdclass: makes EXPORT_SYMBOL follows function body
Liu Xuezhao [Tue, 10 Jul 2012 13:13:26 +0000 (21:13 +0800)]
LU-1347 obdclass: makes EXPORT_SYMBOL follows function body

Makes EXPORT_SYMBOL macros immediately follow the function body,
to follow normal Linux kernel coding style.

Signed-off-by: Liu Xuezhao <xuezhao.liu@emc.com>
Change-Id: I3d1b3ebb244e3d7a069d5be13f893ffee7b5f3dd
Reviewed-on: http://review.whamcloud.com/2839
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
11 years agoLU-220 open: try open_by_fid for the known object
Fan Yong [Wed, 25 Jul 2012 05:20:06 +0000 (13:20 +0800)]
LU-220 open: try open_by_fid for the known object

If the current path for thread1 on client1 is renamed by thread2
on client2, then when thread1 tries to open its current path, it
should use the fid, but not the name. Otherwise, the open on MDS
will hit unexpected "-ENOENT".

Signed-off-by: Fan Yong <yong.fan@whamcloud.com>
Change-Id: I905113167ebd6fe5f0185007d0e9c77083342680
Reviewed-on: http://review.whamcloud.com/2493
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
11 years agoLU-1670 tests: Load in-tree osd-zfs when necessary
Li Wei [Wed, 25 Jul 2012 04:18:38 +0000 (12:18 +0800)]
LU-1670 tests: Load in-tree osd-zfs when necessary

In a developer's environment, Test Framework should try loading osd-zfs
modules from Lustre build trees when ZFS-based targets are used.

Change-Id: I47646738810401c0a48ecc5ca1459952ff811dce
Signed-off-by: Li Wei <liwei@whamcloud.com>
Reviewed-on: http://review.whamcloud.com/3458
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Niu Yawei <niu@whamcloud.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
11 years agoLU-1630 scrub: adjust expected OI scrub speed for test
Fan Yong [Thu, 26 Jul 2012 06:10:38 +0000 (14:10 +0800)]
LU-1630 scrub: adjust expected OI scrub speed for test

Because of time error and schedule delay, the real OI scrub speed
may be not as accurate as original expectation. So need to adjust
the expectation to make them match.

Signed-off-by: Fan Yong <yong.fan@whamcloud.com>
Change-Id: I8ce95fef7ca39364a44418aabc39521f47bc150c
Reviewed-on: http://review.whamcloud.com/3457
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
11 years agoLU-1497 client: SLES11 SP0 client build is broken.
James Simmons [Fri, 13 Jul 2012 13:22:55 +0000 (09:22 -0400)]
LU-1497 client: SLES11 SP0 client build is broken.

With the addition of client improvements new linux kernel
apis are exposed to the llite layer. Those apis can be
missing or have a different format in earlier kernels.
This is the case for SLES11 SP0 clients for the case
of the vma handling and the function inode_newsize_ok
that is missing. This patch fixes those issues. Final
bug is a fix to a configure test for a function from
the linux crypto api.

Signed-off-by: James Simmons <uja.ornl@gmail.com>
Change-Id: I699c67baec030e61133ff0a44e0ab03261b2f2d0
Reviewed-on: http://review.whamcloud.com/3063
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Yang Sheng <ys@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
11 years agoLU-1347 ptlrpc: makes EXPORT_SYMBOL follows function body
Liu Xuezhao [Sun, 15 Jul 2012 14:49:48 +0000 (22:49 +0800)]
LU-1347 ptlrpc: makes EXPORT_SYMBOL follows function body

Makes EXPORT_SYMBOL macros immediately follow the function body,
to follow normal Linux kernel coding style.

Signed-off-by: Liu Xuezhao <xuezhao.liu@emc.com>
Change-Id: Ifbebd942033d65382f443edeb9b6fb5100096a47
Reviewed-on: http://review.whamcloud.com/2670
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Peng Tao <bergwolf@gmail.com>
Reviewed-by: Yang Sheng <ys@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
11 years agoLU-709 build: cleanup LC_PAGE_CHECKED and LC_INODE_BLKSIZE
Peng Tao [Mon, 23 Jan 2012 06:32:44 +0000 (14:32 +0800)]
LU-709 build: cleanup LC_PAGE_CHECKED and LC_INODE_BLKSIZE

1. kernel has PageChecked() and SetPageChecked() since v2.6.18
2. struct inode doesn't have i_blksize since v2.6.18

Signed-off-by: Peng Tao <tao.peng@emc.com>
Change-Id: Ic50941948ae409ea526108f43e1337db2141ae3d
Reviewed-on: http://review.whamcloud.com/2323
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Yang Sheng <ys@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
11 years agoLU-709 build: cleanup LC_FLUSH_OWNER_ID/INVALIDATE_MAPPING
Peng Tao [Tue, 15 May 2012 06:46:25 +0000 (14:46 +0800)]
LU-709 build: cleanup LC_FLUSH_OWNER_ID/INVALIDATE_MAPPING

1. file_opeartions.flush takes fl_owner_t argument since v2.6.18
2. HAVE_INVALIDATE_MAPPING_PAGES and HAVE_INVALIDATE_INODE_PAGES
   are not used in Lustre

Signed-off-by: Peng Tao <tao.peng@emc.com>
Change-Id: I3674b032470973ee6b2be5bd05bb7ecc1311d107
Reviewed-on: http://review.whamcloud.com/2322
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Yang Sheng <ys@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
11 years agoLU-709 build: clean up LC_NR_PAGECACHE/LC_STATFS_DENTRY_PARAM
Peng Tao [Tue, 15 May 2012 06:43:58 +0000 (14:43 +0800)]
LU-709 build: clean up LC_NR_PAGECACHE/LC_STATFS_DENTRY_PARAM

1. nr_pagecache is not exported from 2.6.18 to latest
2. super_operations->statfs takes dentry instead of superblock
   as first parameter since v2.6.18

Signed-off-by: Peng Tao <tao.peng@emc.com>
Change-Id: I93fd5cdd81b83f0896a1305f5b625fa0c82ef515
Reviewed-on: http://review.whamcloud.com/2319
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Yang Sheng <ys@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
11 years agoLU-1329 ptlrpc: resend request on -EINPROGRESS
Johann Lombardi [Tue, 3 Jul 2012 10:54:54 +0000 (12:54 +0200)]
LU-1329 ptlrpc: resend request on -EINPROGRESS

It seems the EINPROGRESS is going to be used for many different
purposes (e.g. on statfs see review 3198). As a result, it sounds
like ptlrpc is the right place to resend requests on EINPROGRESS.
Upper layers can still decide to handle EINPROGRESS by themselves by
setting rq_no_retry_einprogress to 1.

Signed-off-by: Johann Lombardi <johann@whamcloud.com>
Change-Id: Iae2a5976666e66be4f6e71f82c5653e5636ba07d
Reviewed-on: http://review.whamcloud.com/3262
Tested-by: Hudson
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Niu Yawei <niu@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
11 years agoLU-1662 fid: missing unlock in seq_server_proc_write_width
wangdi [Mon, 23 Jul 2012 15:19:18 +0000 (08:19 -0700)]
LU-1662 fid: missing unlock in seq_server_proc_write_width

cfs_mutex_unlock(&seq->lcs_mutex) is missing in the error
handler of seq_server_proc_write_width.

Signed-off-by: wang di <di.wang@whamcloud.com>
Change-Id: I7b3c9ebceb40e06362a4f7c0b8e7fe5654c830ec
Reviewed-on: http://review.whamcloud.com/3451
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Tested-by: Hudson
Reviewed-by: Bobi Jam <bobijam@whamcloud.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
11 years agoLU-1484 lprocfs: refine LC_PROCFS_USERS check
Bobi Jam [Tue, 24 Jul 2012 08:40:31 +0000 (16:40 +0800)]
LU-1484 lprocfs: refine LC_PROCFS_USERS check

In some RHEL patched 2.6.18 kernels, pde_users member is added in
another struct proc_dir_entry_aux instead of in struct proc_dir_entry
in later kernel version of 2.6.23.

Signed-off-by: Bobi Jam <bobijam@whamcloud.com>
Change-Id: Icee65893b2fbf4d0c3b3e957cb038be99aaf6eb8
Reviewed-on: http://review.whamcloud.com/3455
Tested-by: Hudson
Reviewed-by: Niu Yawei <niu@whamcloud.com>
Reviewed-by: Lai Siyao <laisiyao@whamcloud.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
11 years agoLU-1564 test: sanity test_200d fails
Kyrylo Shatskyy [Wed, 13 Jun 2012 00:53:46 +0000 (03:53 +0300)]
LU-1564 test: sanity test_200d fails

Because sub-tests in test_200 group and test_201 group are
the parts of one scenario and designed to be run sequentially
one-by-one, so each of which depends on previous.
Correspondingly, tests from 200a to 200h and from 201a to 201c
have been refactored into single test_200 test.

Xyratex-bug-id: MRP-346
Signed-off-by: Kyrylo Shatskyy <kyrylo_shatskyy@xyratex.com>
Reviewed-by: Bruce Korb <bruce_korb@xyratex.com>
Reviewed-by: Alexander Zarochentsev <alexander_zarochentsev@xyratex.com>
Change-Id: I9f0b5b76bdb0a060cf6062199d8acf7ea355f7ce
Reviewed-on: http://review.whamcloud.com/3184
Tested-by: Hudson
Reviewed-by: Yu Jian <yujian@whamcloud.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
11 years agoLU-1606 api: Move defines from lustre_idl.h to lustre_user.h
Christopher J. Morrone [Wed, 18 Jul 2012 01:29:31 +0000 (18:29 -0700)]
LU-1606 api: Move defines from lustre_idl.h to lustre_user.h

Move some defines that user's require to be able to use
the api's get_stripe functions.  Since lustre_user.h is the
place designated in lustre_idl.h as the place for definitions
needed both in the IDL and in user space, and since lustre_idl.h
will not compile in user space, it seems logical to relocate
these definitions there.

Signed-off-by: Christopher J. Morrone <morrone2@llnl.gov>
Change-Id: Ib0133553f902d68222c16c5fc5b99cd5a9cd033d
Reviewed-on: http://review.whamcloud.com/3426
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Li Wei <liwei@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
11 years agoLU-1606 api: Include stddef.h in ll_fiemap.h
Christopher J. Morrone [Wed, 18 Jul 2012 00:40:10 +0000 (17:40 -0700)]
LU-1606 api: Include stddef.h in ll_fiemap.h

In user space only, include stddef.h in ll_fiemap.h to define
such things as size_t.

Signed-off-by: Christopher J. Morrone <morrone2@llnl.gov>
Change-Id: Ia5efa7758d27952009a50c479b3e02d4b6b7c9b6
Reviewed-on: http://review.whamcloud.com/3425
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Li Wei <liwei@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
11 years agoLU-1620 lnet: Make asym router failure parameters tunable
Joseph Herring [Wed, 12 May 2010 23:49:47 +0000 (16:49 -0700)]
LU-1620 lnet: Make asym router failure parameters tunable

Make the asymmetric router failure parameters tunable.

Change-Id: Ie36f79d01c35d4c11c4532187abdeb9473ea60b4
Signed-off-by: Christopher J. Morrone <morrone2@llnl.gov>
Reviewed-on: http://review.whamcloud.com/3371
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Lai Siyao <laisiyao@whamcloud.com>
Reviewed-by: Liang Zhen <liang@whamcloud.com>
Reviewed-by: Doug Oucharek <doug@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
11 years agoLU-812 build: fix "make rpms" failure
Liu Xuezhao [Tue, 24 Jul 2012 02:29:05 +0000 (10:29 +0800)]
LU-812 build: fix "make rpms" failure

In FC15 (2.6.40-4) the generated kernel headers have moved
from "include/linux" to "include/generated".
Update lustre.spec.in to fix the break of "make rpms".

Signed-off-by: Liu Xuezhao <xuezhao.liu@emc.com>
Change-Id: I5059b5f9bb4f77aa3fc05ced9612b0a46e5f2386
Reviewed-on: http://review.whamcloud.com/3444
Tested-by: Hudson
Reviewed-by: Yang Sheng <ys@whamcloud.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
11 years agoLU-1582 tests: do not force sync before failover
Andreas Dilger [Fri, 29 Jun 2012 05:22:42 +0000 (23:22 -0600)]
LU-1582 tests: do not force sync before failover

Commit 17a69cf25ed0991e04d85c259f4294dc59734e1e forced sync on every
target on a facet before it was failed, in order to fix failures in
recovery-small.sh test_105() due to initial client connect failing.
This was causing local-node testing to hang in replay-ost-single.sh
test_8b() because the client mountpoint was being sync'd and would
never finish due to a fail_loc preventing any RPCs from completing.

Syncing the targets before failover also defeats the purpose of many
recovery tests.  Instead, recovery-small test_105 creates a file on
the to-be-failed OST and writes to it, to trigger the exp_need_sync
and commit the new client export, without incorrectly forcing every
OST to be sync'd for every test that is using facet_failover().

Signed-off-by: Andreas Dilger <adilger@whamcloud.com>
Change-Id: Ied5f2ecf1f6523d4163916b24e5c4281902d500c
Reviewed-on: http://review.whamcloud.com/3239
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Jinshan Xiong <jinshan.xiong@whamcloud.com>
Reviewed-by: Niu Yawei <niu@whamcloud.com>
11 years agoLU-1664 build: Feed spl/zfs Module.symvers to osd-zfs
Li Wei [Tue, 24 Jul 2012 08:28:36 +0000 (16:28 +0800)]
LU-1664 build: Feed spl/zfs Module.symvers to osd-zfs

This hunk, from Orion, should also be included to make sure that osd-zfs
can access spl/zfs's Module.symvars files.

Change-Id: Ic4e514c1f1f8f026353c4a8c686c0e4c389663a1
Signed-off-by: Li Wei <liwei@whamcloud.com>
Reviewed-on: http://review.whamcloud.com/3454
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Alex Zhuravlev <bzzz@whamcloud.com>
Reviewed-by: Brian Behlendorf <behlendorf1@llnl.gov>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
11 years agoLU-723 fsfilt: Remove EXT4_EXT_HAS_NO_TREE macro
Prakash Surya [Mon, 11 Jun 2012 16:48:25 +0000 (09:48 -0700)]
LU-723 fsfilt: Remove EXT4_EXT_HAS_NO_TREE macro

This change removes the EXT4_EXT_HAS_NO_TREE macro and any logic which
is based on the definition of this macro. From now on, this macro is
assumed to be defined.

Signed-off-by: Prakash Surya <surya1@llnl.gov>
Change-Id: I13af4c44d7cafef53fe45cf5e143053054d6ee50
Reviewed-on: http://review.whamcloud.com/3080
Tested-by: Hudson
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: James Simmons <uja.ornl@gmail.com>
11 years agoLU-1235 osd: debug patch for transaction commit callback
Bobi Jam [Tue, 12 Jun 2012 09:31:12 +0000 (17:31 +0800)]
LU-1235 osd: debug patch for transaction commit callback

During transaction commit phase, osd_trans_commit_cb accesses an
invalid memeroy, this patch insert some assertions to find out the
possible faulty point.

Signed-off-by: Bobi Jam <bobijam@whamcloud.com>
Change-Id: Idb39a6946fbd5d8a96ee6915f828f557ccebadb6
Reviewed-on: http://review.whamcloud.com/2394
Tested-by: Hudson
Reviewed-by: Mike Pershin <tappro@whamcloud.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Liang Zhen <liang@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
11 years agoLU-1337 vfs: kernel 3.3 hides vfsmount guts
Liu Xuezhao [Mon, 9 Jul 2012 08:54:09 +0000 (16:54 +0800)]
LU-1337 vfs: kernel 3.3 hides vfsmount guts

3.3 starts hiding vfsmount guts series (move from
include/linux/mount.h to fs/mount.h, kernel commit
7d6fec45a5131918b51dcd76da52f2ec86a85be6).

Add HAVE_HIDE_VFSMOUNT_GUTS macro to differentiate it.

Signed-off-by: Liu Xuezhao <xuezhao.liu@emc.com>
Change-Id: If69b4f75e74a64d2c07c082053e639b75585bfc9
Reviewed-on: http://review.whamcloud.com/3399
Tested-by: Hudson
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Yang Sheng <ys@whamcloud.com>
11 years agoLU-1471 tests: check rpcidmapd service in setup-nfs.sh
Yu Jian [Mon, 16 Jul 2012 03:40:05 +0000 (11:40 +0800)]
LU-1471 tests: check rpcidmapd service in setup-nfs.sh

The rpcidmapd system service is not in SLES11 distro, which
caused "service: no such service rpcidmapd" error while running
setup-nfs.sh. This patch fixes the above issue by checking the
service before restarting or stopping it.

Test-Parameters: testlist=parallel-scale-nfsv3,parallel-scale-nfsv4
Signed-off-by: Yu Jian <yujian@whamcloud.com>
Change-Id: I0571d9ce3850fda246c1cd43b6c9019de30398e1
Reviewed-on: http://review.whamcloud.com/3400
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Wei Liu <sarah@whamcloud.com>
Reviewed-by: Minh Diep <mdiep@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
11 years agoLU-1418 osc: remove DEADLOCK error messages
Alexander.Boyko [Thu, 17 May 2012 12:48:09 +0000 (16:48 +0400)]
LU-1418 osc: remove DEADLOCK error messages

Deadlock is impossible for the current code, and the check
exist from some previous version. It can be removed.

Signed-off-by: Alexander Boyko <alexander_boyko@xyratex.com>
Xyratex-bug-id: MRP-497
Change-Id: Ifbd4270739894c946553952d86ff931c4c707791
Reviewed-on: http://review.whamcloud.com/2825
Tested-by: Hudson
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Jinshan Xiong <jinshan.xiong@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
11 years agoLU-709 compat: remove all reference to LINUX_VERSION_CODE
Peng Tao [Mon, 23 Jul 2012 05:26:48 +0000 (13:26 +0800)]
LU-709 compat: remove all reference to LINUX_VERSION_CODE

All LINUX_VERSION_CODE refereces are used to note kernel version older
than 2.6.18. So we can drop them all.

Affected variables are inlined as much as possible to minimize compat
code definition.

Signed-off-by: Peng Tao <tao.peng@emc.com>
Change-Id: If3b94ceb5e21db8f96b11526399e9cd8f174fd6f
Reviewed-on: http://review.whamcloud.com/3442
Tested-by: Hudson
Reviewed-by: Yang Sheng <ys@whamcloud.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Prakash Surya <surya1@llnl.gov>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
11 years agoLU-1622 lnet: hash MEs on wildcard portal
Liang Zhen [Wed, 11 Jul 2012 16:47:01 +0000 (00:47 +0800)]
LU-1622 lnet: hash MEs on wildcard portal

This patch addressed a few more issues in BZ21619.

One issue is, we should always attach ME with ignore-bits on a list
instead of hash-table, no matter it's wildcard portal or unique
portal, because message could match buffers with various match-bits
if they also had ignore-bits, which means if user set both match-bits
and ignore-bits for MEs on unique portal, incoming message could
never be able to find them because they only search MEs hashed by
match-bits even those MEs with different match-bits can also fit
them after ignoring some bits.

the reason that nobody complained about it is because Lustre doesn't
have such use-case(posting ME with ignore-bits on unique portal).

The second issue fixed by this patch is, If multiple services share
one portal but use match-bits to identify different buffer types,
we still have performance issue because all buffers are attached
on one list and searching for typed buffer could be long and
expensive. So we should hash buffers on wildcard portal as well.

This is the reason that LNet selftest performance is not so good
if we mixed PING test and BRW test, because PING and BRW are sharing
on portal, so request buffers for them are attached on the same list,
BRW requests are processed much slower than PING requests, so there
will be a long search for request buffer of PING.

Signed-off-by: Liang Zhen <liang@whamcloud.com>
Change-Id: I0d2c6330dd231d369e2a86ced2b8374c0c96dbf9
Reviewed-on: http://review.whamcloud.com/3376
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Bobi Jam <bobijam@whamcloud.com>
Reviewed-by: Doug Oucharek <doug@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
11 years agoNew version 2.2.91 2.2.91 v2_2_91 v2_2_91_0
Oleg Drokin [Thu, 19 Jul 2012 16:02:57 +0000 (12:02 -0400)]
New version 2.2.91

Change-Id: I4da67fd6e61749478a4eeecf9667116f4a64d077
Signed-off-by: Oleg Drokin <green@whamcloud.com>
11 years agoLU-1415 tests: Explain refresh_disk()
Li Wei [Wed, 18 Jul 2012 10:04:29 +0000 (18:04 +0800)]
LU-1415 tests: Explain refresh_disk()

Following Brian's suggestion in http://review.whamcloud.com/2981, this
patch adds a brief comment on refresh_disk(), which could be a bit
tricky to understand at first sight.

Change-Id: I83d583b5149c7648be4c68e31b6ff31f57410d9f
Signed-off-by: Li Wei <liwei@whamcloud.com>
Reviewed-on: http://review.whamcloud.com/3430
Tested-by: Hudson
Reviewed-by: Brian Behlendorf <behlendorf1@llnl.gov>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Mike Pershin <tappro@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
11 years agoLU-709 build: clean up i_mutex/i_private/quota_info macros
Peng Tao [Tue, 15 May 2012 06:28:48 +0000 (14:28 +0800)]
LU-709 build: clean up i_mutex/i_private/quota_info macros

1. struct inode has i_mutex instead of i_sem since v2.6.16
2. struct inode has i_private since v2.6.17
3. struct quota_info uses mutex instead semaphore since v2.6.17
4. move LC_SECURITY_PLUG to under v2.6.27

Signed-off-by: Peng Tao <tao.peng@emc.com>
Change-Id: I587d18a8d63927aa0a668268d252f71e7f224b50
Reviewed-on: http://review.whamcloud.com/2318
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Yang Sheng <ys@whamcloud.com>
Reviewed-by: Niu Yawei <niu@whamcloud.com>
11 years agoLU-709 build: clean up percpu_counter and sb.s_time_gran
Peng Tao [Tue, 15 May 2012 05:55:03 +0000 (13:55 +0800)]
LU-709 build: clean up percpu_counter and sb.s_time_gran

1. linux/percpu_counter.h exists since v2.6.12,
   percpu_counter_init() takes two arguments since v2.6.17,
   and percpu_counter is not used in lustre
2. superblock.s_time_gran exists since v2.6.12
3. get_sb_time_gran() doesn't exist since v2.6.12

Signed-off-by: Peng Tao <tao.peng@emc.com>
Change-Id: Id80b0bfc702f509218fcf29d01334894d122b9c8
Reviewed-on: http://review.whamcloud.com/2317
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Yang Sheng <ys@whamcloud.com>
11 years agoLU-709 build: clean up quota_read, follow_link and RCU
Peng Tao [Thu, 14 Jun 2012 07:08:42 +0000 (15:08 +0800)]
LU-709 build: clean up quota_read, follow_link and RCU

1. super_operations.quota_read exists since v2.6.12
2. inode_operations->follow_link returns a cookie since v2.6.13
3. RCU exists since v2.6.12

Signed-off-by: Peng Tao <tao.peng@emc.com>
Change-Id: Id79df5dcb0e3db9eb424337fd71d896ed8fefcd6
Reviewed-on: http://review.whamcloud.com/2316
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Yang Sheng <ys@whamcloud.com>
Reviewed-by: Niu Yawei <niu@whamcloud.com>
11 years agoLU-1432 ptlrpc: LBUG in lprocfs_free_client_stats()
Lai Siyao [Fri, 29 Jun 2012 09:15:41 +0000 (17:15 +0800)]
LU-1432 ptlrpc: LBUG in lprocfs_free_client_stats()

* serialize connect and target obd cleanup to avoid connect
  accessing unexisted data structure.
* connect export refcounting cleanup.

Signed-off-by: Hiroya Nozaki <nozaki.hiroya@jp.fujitsu.com>
Signed-off-by: Lai Siyao <laisiyao@whamcloud.com>
Change-Id: I0a9e8a58ecdc1212565a478f4a758755a1b95f99
Reviewed-on: http://review.whamcloud.com/3244
Tested-by: Hudson
Reviewed-by: Bobi Jam <bobijam@whamcloud.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
11 years agoLU-1614 ptlrpc: fix for AT proc read
Liang Zhen [Tue, 10 Jul 2012 04:01:01 +0000 (12:01 +0800)]
LU-1614 ptlrpc: fix for AT proc read

This patch fixed a couple of issues in AT proc read:
- calculate of buffer offset is wrong
- current lustre proc read must be less than one page

Signed-off-by: Liang Zhen <liang@whamcloud.com>
Change-Id: Ieef8ad458ca70a4146e0b61fbe8a32b35577ccdf
Reviewed-on: http://review.whamcloud.com/3365
Reviewed-by: Bobi Jam <bobijam@whamcloud.com>
Reviewed-by: wangdi <di.wang@whamcloud.com>
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
11 years agoLU-1600 lnet: another race in lnet_nid2peer_locked
Liang Zhen [Tue, 10 Jul 2012 16:02:10 +0000 (00:02 +0800)]
LU-1600 lnet: another race in lnet_nid2peer_locked

We fixed a race for the case that LNet is shutdown while the second
thread is in progress of creating a peer, but there is another race
if LNet is shutdown _before_ lnet_nid2peer_locked is called by the
second thread, for example: called from lnet_parse() by LND thread,
in this case, we should return error w/o trying to create peer,
otherwise the shuttingdown thread could think peer-table is empty
and move to the next step with leaving a peer on deathrow list.

This patch also passed error code into lnet_finalize() when it's
called from lnet_drop_delayed_msg_list, otherwise lnet_finalize()
will treat the message as a complete message and hit assertion
because the message is actually incomplet.

We also did some coding style cleanup in this patch.

Signed-off-by: Liang Zhen <liang@whamcloud.com>
Change-Id: I445298d639173840687412b11da41006ddc20c10
Reviewed-on: http://review.whamcloud.com/3369
Tested-by: Hudson
Reviewed-by: Bobi Jam <bobijam@whamcloud.com>
Reviewed-by: Doug Oucharek <doug@whamcloud.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
11 years agoLU-1129 obdfilter: handle race condition of recreating objects
Yu Jian [Fri, 13 Jul 2012 09:45:53 +0000 (17:45 +0800)]
LU-1129 obdfilter: handle race condition of recreating objects

During OST recovery, a race can happen while handling replayed
OST_WRITE request during the MDS->OST orphan recovery period to
recreate missing objects, which can trigger ASSERTION(diff >= 0)
failure.

This patch handles the above issue by adding obd->obd_recovering
into the assertion to check whether the OST is in recovery or not.
If it's in recovery and diff < 0, then no assertion failure occurs,
the object has been recreated. If the OST is not in recovery and
diff < 0, then the assertion failure occurs.

Signed-off-by: Yu Jian <yujian@whamcloud.com>
Change-Id: If486bc8221cf4b9d53b6de53e8ec14a4f3174b45
Reviewed-on: http://review.whamcloud.com/3391
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Mike Pershin <tappro@whamcloud.com>
Reviewed-by: Jinshan Xiong <jinshan.xiong@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
11 years agoLU-1627 build: Change lbuild to use https://github.com/zfsonlinux
Chris Gearing [Thu, 12 Jul 2012 15:18:09 +0000 (08:18 -0700)]
LU-1627 build: Change lbuild to use https://github.com/zfsonlinux

lbuild fetches zfs/spl source using git@github.com:zfsonlinux which
requires a ssh keypair with git hub. If we change this to
https://github.com/zfsonlinux then we will not need local keys for
github as part of the builds.

The option to change the default by using SPLZFSGITREPO should be kept.

So change
git clone ${SPLZFSGITREPO:-"git@github.com:zfsonlinux"}/$pkg.git $pkg
to
git clone ${SPLZFSGITREPO:-"https://github.com/zfsonlinux"}/$pkg.git $pkg

Signed-off-by: Chris Gearing <chris@whamcloud.com>
Change-Id: I34c870b9aea98001115ff43291a6f95c5d268016
Reviewed-on: http://review.whamcloud.com/3387
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Joshua Kugler <joshua@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
11 years agoLU-1415 tests: Add ZFS support to replay_barrier()
Li Wei [Fri, 18 May 2012 03:09:55 +0000 (11:09 +0800)]
LU-1415 tests: Add ZFS support to replay_barrier()

This patch makes sure that ZFS pool states changed by replay_barrier()
calls are restored upon subsequent failovers.

This was originally developed by Mikhail Pershin under ORI-156.  See
http://review.whamcloud.com/1490.

Change-Id: Ie8d26bc8a6d6103be3c504e0af80e8499096df58
Whamcloud-bug-id: ORI-156
Signed-off-by: Li Wei <liwei@whamcloud.com>
Reviewed-on: http://review.whamcloud.com/2981
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Brian Behlendorf <behlendorf1@llnl.gov>
Reviewed-by: Mike Pershin <tappro@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
11 years agoLU-1415 tests: Adapt oos to the new grant and osd-zfs behavior
Li Wei [Wed, 30 May 2012 13:49:09 +0000 (21:49 +0800)]
LU-1415 tests: Adapt oos to the new grant and osd-zfs behavior

This was originally developed by Johann Lombardi under ORI-151 and
ORI-398.  See http://review.whamcloud.com/1545 and
http://review.whamcloud.com/1665.

Change-Id: If778df1d672e2adb182cb801036aaef950d80af4
Whamcloud-bug-id: ORI-151
Whamcloud-bug-id: ORI-398
Signed-off-by: Li Wei <liwei@whamcloud.com>
Reviewed-on: http://review.whamcloud.com/2978
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Johann Lombardi <johann@whamcloud.com>
Reviewed-by: Niu Yawei <niu@whamcloud.com>
11 years agoLU-1415 tests: Cope with ZFS's delayed space updates
Li Wei [Tue, 22 May 2012 04:22:27 +0000 (12:22 +0800)]
LU-1415 tests: Cope with ZFS's delayed space updates

Space-related updates, such as changes of dn_used and of
ds_used_bytes, are invisible to opertions like dt_attr_get() and
dt_statfs() until after corresponding transaction groups start to be
synchronized.  This patch makes sure the tests do a dt_sync() for each
OST before checking space usage.

Change-Id: Icff5106c0c4c493ef981bc14a0aabac8da16764d
Signed-off-by: Li Wei <liwei@whamcloud.com>
Reviewed-on: http://review.whamcloud.com/2909
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Alex Zhuravlev <bzzz@whamcloud.com>
Reviewed-by: Brian Behlendorf <behlendorf1@llnl.gov>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
11 years agoLU-812 kernel: AUTOCONF_INCLUDED removed
Wally Wang [Wed, 11 Jul 2012 20:14:20 +0000 (13:14 -0700)]
LU-812 kernel: AUTOCONF_INCLUDED removed

AUTOCONF_INCLUDED has been removed since 2.6.37 and no longer there
has <linux/config.h>.

Change-Id: I68611638f61012b2becddc4e0dec613ed2b17c99
Signed-off-by: Wally Wang <wang@cray.com>
Reviewed-on: http://review.whamcloud.com/2146
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: James Simmons <uja.ornl@gmail.com>
Reviewed-by: Peng Tao <tao.peng@emc.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
11 years agoLU-812 kernel: remove smp_lock.h
James Simmons [Fri, 13 Jul 2012 13:35:06 +0000 (09:35 -0400)]
LU-812 kernel: remove smp_lock.h

smp_lock.h has been removed from kernel since 2.6.39.

Signed-off-by: James Simmons <uja.ornl@gmail.com>
Signed-off-by: Wally Wang <wang@cray.com>
Change-Id: Ia15eb8befe4d7c8c1ab29b5860c7ec5bfffbf563
Reviewed-on: http://review.whamcloud.com/2000
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Peng Tao <tao.peng@emc.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
11 years agoLU-1415 tests: Handle OFD procfs changes
Li Wei [Wed, 20 Jun 2012 02:41:34 +0000 (10:41 +0800)]
LU-1415 tests: Handle OFD procfs changes

This patch handles these procfs entry changes in OFD:

  - obdfilter.*.mntdev -> osd-*.*.mntdev
  - obdfilter.*.<cache_related> -> osd-*.*.<cache_related>

Change-Id: Ie9fac69446b5c183bc8eed44a86338ef62197529
Signed-off-by: Li Wei <liwei@whamcloud.com>
Reviewed-on: http://review.whamcloud.com/2934
Reviewed-by: Yu Jian <yujian@whamcloud.com>
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
11 years agoLU-1481 build: quiet warnings due to := assignment
Andreas Dilger [Tue, 5 Jun 2012 18:53:31 +0000 (12:53 -0600)]
LU-1481 build: quiet warnings due to := assignment

When running automake to generate Makefiles, newer compilers will
warn about non-portable assignments:

  lvfs/autoMakefile.am:52: `:='-style assignments are not portable

The use of ":=" vs. "=" determines when conditional expressions
are evaluated, but have absolutely no use for constant assignments,
so there is no point in using them at all in that case.

Convert all ":=" to "=" for constant assignments, leaving the
decision to use ":=" for conditional assignments until later.

Signed-off-by: Andreas Dilger <adilger@whamcloud.com>
Change-Id: I76caedbc1368a0d2fe02137c46b46c36fc3a500c
Reviewed-on: http://review.whamcloud.com/3129
Tested-by: Hudson
Reviewed-by: Brian J. Murrell <brian@whamcloud.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Brian Behlendorf <behlendorf1@llnl.gov>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
11 years agoLU-1209 tests: get dir size after rename in sanity 133d
wangdi [Sat, 7 Jul 2012 11:22:01 +0000 (04:22 -0700)]
LU-1209 tests: get dir size after rename in sanity 133d

Because rename will change the directory size, so we
should get dir size after rename.

Signed-off-by: wang di <di.wang@whamcloud.com>
Change-Id: I1fee9f75cbc0163d10e9de7f04fcf00a4d5c1d9d
Reviewed-on: http://review.whamcloud.com/3298
Reviewed-by: Yu Jian <yujian@whamcloud.com>
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
11 years agoLU-1604 osd: don't call md_ucred() for OFD
Johann Lombardi [Thu, 5 Jul 2012 20:34:23 +0000 (22:34 +0200)]
LU-1604 osd: don't call md_ucred() for OFD

There are several places where md_ucred() is called in osd-ldiskfs.
However, this function is specific to the metadata stack (it ends
up calling a callback in mdd layer) and should not be called with OFD.

osd_{push,pop}_ctxt() have thus been modified to be no-op when the
parent is of DT type. __osd_oi_insert() has also been changed to call
md_ucred() only when dealing with a MDT device.

This hack is required to keep compatibility with current MDT stack,
but should be removed as soon as the LOD/OSP is landed.

Signed-off-by: Johann Lombardi <johann@whamcloud.com>
Change-Id: I931f0376a78cb2aff8b00a99b1c20618d4093e7d
Reviewed-on: http://review.whamcloud.com/3286
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Alex Zhuravlev <bzzz@whamcloud.com>
Reviewed-by: Li Wei <liwei@whamcloud.com>
Reviewed-by: Mike Pershin <tappro@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
11 years agoLU-1342 test: sanity-quota test_29
Bobi Jam [Wed, 25 Apr 2012 13:44:09 +0000 (21:44 +0800)]
LU-1342 test: sanity-quota test_29

sanity-quota test_29() needs sleep enough seconds to take into account
both service time and network latency.

Signed-off-by: Bobi Jam <bobijam@whamcloud.com>
Change-Id: Ide2b321fbd104eede17eee109998d40d98fb2d30
Reviewed-on: http://review.whamcloud.com/2601
Tested-by: Hudson
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Yu Jian <yujian@whamcloud.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Jay J Lan <jay.j.lan@nasa.gov>
11 years agoLU-549 llite: Improve statfs performance if selinux is disabled
Keith [Thu, 28 Jun 2012 20:24:54 +0000 (13:24 -0700)]
LU-549 llite: Improve statfs performance if selinux is disabled

Don't disable all security xattrs when selinux is disabled just
disable security.selinux. Disabling all security xattrs was
overreaching and causing issues at a customer site.

This patch is built ontop of the original patch:
commit 7cc542fd4c26ccb117ceb13a47ac8ced3107b9b3.

Signed-off-by: Keith Mannthey <keith@whamcloud.com>
Change-Id: I73482d2ff1f166a48dceb3737df0e9e1b187dc85
Reviewed-on: http://review.whamcloud.com/2955
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Tested-by: Hudson
Reviewed-by: Fan Yong <yong.fan@whamcloud.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
11 years agoLU-1370 obdecho: few fixes about md_echo clients
wangdi [Mon, 14 May 2012 23:45:07 +0000 (16:45 -0700)]
LU-1370 obdecho: few fixes about md_echo clients

1. Remove LU_NOREF flag from echo client ctxt, because it
conflicts with cl ctxt(which has no NOREF flag). So if
echo_client and normal lustre client share the same context
cache, this flag will cause echo module refcount not being
dropped.

2. Ma_lmm_size in ma will be changed during mdd_xxx,
so we can not use that to free the ma_lmm.

3. Remove lu_device from linkage on lu_site during cleanup
 phase.

Signed-off-by: Di Wang <di.wang@whamcloud.com>
Change-Id: I08fec689ef9c8b64284f7400ab0fc4dc362ebf87
Reviewed-on: http://review.whamcloud.com/2784
Reviewed-by: Jinshan Xiong <jinshan.xiong@whamcloud.com>
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Fan Yong <yong.fan@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
11 years agoLU-1544 llog: remove lfs catinfo command
Gregoire Pichon [Wed, 20 Jun 2012 11:14:53 +0000 (13:14 +0200)]
LU-1544 llog: remove lfs catinfo command

This patch removes the llog catinfo code, both client and server
side, since it has not been maintained and may crash the client
system in some cases.

Signed-off-by: Gregoire Pichon <gregoire.pichon@bull.net>
Change-Id: Ifa627e61d73a2a416bb548fcc510fb9ca3f36e79
Reviewed-on: http://review.whamcloud.com/3151
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Bob Glossman <bogl@whamcloud.com>
Reviewed-by: Liu Xuezhao <xuezhao.liu@emc.com>
Reviewed-by: Mike Pershin <tappro@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
11 years agoLU-1415 utils: Process all {get,set}_param arguments
Andreas Dilger [Fri, 29 Jun 2012 09:04:27 +0000 (03:04 -0600)]
LU-1415 utils: Process all {get,set}_param arguments

Process all arguments to get_param and set_param before returning an
error, to allow accessing one of multiple parameters that have changed
during code reorganization.

Signed-off-by: Andreas Dilger <adilger@whamcloud.com>
Signed-off-by: Li Wei <liwei@whamcloud.com>
Change-Id: Icb3019c8bd6ed83ffbfeae1fa54084f34df00c1e
Reviewed-on: http://review.whamcloud.com/3245
Tested-by: Hudson
Reviewed-by: Yu Jian <yujian@whamcloud.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
11 years agoLU-1603 ofd: Fix null pointer dereferences in ldiskfs_statfs()
Li Wei [Fri, 6 Jul 2012 13:12:38 +0000 (21:12 +0800)]
LU-1603 ofd: Fix null pointer dereferences in ldiskfs_statfs()

On an OSS running the OFD-OSD stack, unmounting all OSTs makes all OFD
and OSD keys (i.e., lu_context_key) quiescent.  Next time the OSTs are
remounted, ofd_stack_init() will try to refill the quiescent osd_key
into the context, resulting in a null pointer dereference in
ldiskfs_statfs().  Before the problem motivating LCT_QUIESCENT and
LCT_REMEMBER can be revisited, the solution is to refill contexts
after new device types have been "revived" via allocations of their
first device instances.

Change-Id: I4835afb88fae460c21501df95765afd29e570860
Signed-off-by: Li Wei <liwei@whamcloud.com>
Reviewed-on: http://review.whamcloud.com/3353
Tested-by: Hudson
Reviewed-by: Jinshan Xiong <jinshan.xiong@whamcloud.com>
Reviewed-by: Fan Yong <yong.fan@whamcloud.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
11 years agoLU-1595 build: add 70-column commit comment wrap
Andreas Dilger [Wed, 20 Jun 2012 18:59:47 +0000 (12:59 -0600)]
LU-1595 build: add 70-column commit comment wrap

Add a Vim modeline "textwidth=70" to force commit comments to wrap
at 70 characters, to match standard Git commit style guidelines.

Reduce the commit summary length to 62 characters, which is the
maximum that can fit after "Subject: [PATCH] " when formatting a
patch for email.  This is still above the recommended 50 character
summary limit, but with the leading ticket number and component,
there wouldn't be very much room left for a meaningful comment.

Signed-off-by: Andreas Dilger <adilger@whamcloud.com>
Change-Id: I04f8e0278a18a869b78acbf9dd1b55adf354500c
Reviewed-on: http://review.whamcloud.com/3284
Tested-by: Hudson
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Li Wei <liwei@whamcloud.com>
Reviewed-by: Bruce Korb <bruce_korb@xyratex.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
11 years agoLU-1339 crc: remove #warning
Alexander.Boyko [Mon, 9 Jul 2012 08:16:11 +0000 (12:16 +0400)]
LU-1339 crc: remove #warning

Patch removes #warning from 32bit code path.

Signed-off-by: Alexander Boyko <alexander_boyko@xyratex.com>
Change-Id: I9183caf6bc1a98da0c78086ecd300e7cf6880eb0
Reviewed-on: http://review.whamcloud.com/3354
Tested-by: Hudson
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Tested-by: Maloo <whamcloud.maloo@gmail.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>