Whamcloud - gitweb
fs/lustre-release.git
14 years agobump version to 1.8.2.52 v1_8_2_52
Johann Lombardi [Fri, 12 Mar 2010 14:47:12 +0000 (15:47 +0100)]
bump version to 1.8.2.52

14 years agob=22241 call sync instead of fsync on local cancel to reduce stack usage
Johann Lombardi [Fri, 12 Mar 2010 14:28:41 +0000 (15:28 +0100)]
b=22241 call sync instead of fsync on local cancel to reduce stack usage

i=oleg
i=andreas

sync_on_lock_cancel is needed for recovery when async journal is enabled,
but we actually just need to make sure that metadata blocks have hit the
journal, so doing a fs sync should be enough and should consume less
stack (just create an empty handle and commmit it).

14 years agob=21686 simplify client disconnect code on server side
Johann Lombardi [Fri, 12 Mar 2010 14:17:49 +0000 (15:17 +0100)]
b=21686 simplify client disconnect code on server side

o=liang
i=johann
i=shadow

attach 25564
This patch was reverted because we were chasing some regression.
It is now safe to re-apply.

14 years agob=20837 incomplete test output for ost-pools
Manoj Joseph [Thu, 11 Mar 2010 22:39:56 +0000 (15:39 -0700)]
b=20837 incomplete test output for ost-pools

i=nathan.rutman
i=grev

Instead of creating many 10M files to fill the OST, create 9 files
of size OST_SIZE/10 each.

14 years agob=19917 Repeated atomic allocation failures
Dmitry Zogin [Thu, 11 Mar 2010 17:20:31 +0000 (12:20 -0500)]
b=19917 Repeated atomic allocation failures

 Comment change.

 o=he.huang
 i=dmitry.zogin

14 years agob=22035 workaround patch
hongchao.zhang [Mon, 8 Mar 2010 10:15:12 +0000 (18:15 +0800)]
b=22035 workaround patch

disable the per-thread data (current->journal_info)
containing the lock info during I/O to work around
the issue for short tem

i=hongchao.zhang@sun.com

14 years agob=21927 recovery-small, replay-single 18 <-> 20 interop fix
Elena Gryaznova [Wed, 10 Mar 2010 19:12:26 +0000 (22:12 +0300)]
b=21927 recovery-small, replay-single 18 <-> 20 interop fix

i=Mikhail.Pershin

14 years agob=21927 replay-vbr 18 <-> 20 interop fix
Elena Gryaznova [Wed, 10 Mar 2010 19:03:54 +0000 (22:03 +0300)]
b=21927 replay-vbr 18 <-> 20 interop fix

i=Mikhail.Pershin

14 years agob=22194 Print a dash in empty lfs quota grace columns
Andrew Perepechko [Wed, 10 Mar 2010 18:59:50 +0000 (21:59 +0300)]
b=22194 Print a dash in empty lfs quota grace columns

Polish lfs quota output for easier processing with awk/sed

o=Christopher Morrone (LLNL)
i=Andrew Perepechko
i=ZhiYong Tian

14 years agob=21927 t-f: use the global variables for facets mount points
Elena Gryaznova [Wed, 10 Mar 2010 18:49:36 +0000 (21:49 +0300)]
b=21927 t-f: use the global variables for facets mount points

i=Andreas.Dilger

14 years agob=21938 rq_invalid_rqset should be a bitfield
Johann Lombardi [Wed, 10 Mar 2010 22:50:44 +0000 (23:50 +0100)]
b=21938 rq_invalid_rqset should be a bitfield

14 years agob=21815 Test case for clear stale nid-stats hash.
yangsheng [Wed, 10 Mar 2010 16:44:32 +0000 (00:44 +0800)]
b=21815 Test case for clear stale nid-stats hash.

i=johann
i=wang.yibin

14 years agob=19933 control DCACHE_LUSTRE_INVALID flag with MDS_INODELOCK_LOOKUP lock
Fan Yong [Wed, 10 Mar 2010 16:11:51 +0000 (00:11 +0800)]
b=19933 control DCACHE_LUSTRE_INVALID flag with MDS_INODELOCK_LOOKUP lock

"DCACHE_LUSTRE_INVALID" is controlled by "MDS_INODELOCK_LOOKUP" lock which is corresponding to "IT_LOOKUP", do not skip invalidate for other intent.

i=robert.read
i=johann

14 years agob=20997 Cannot send after transport shutdown
Dmitry Zogin [Wed, 10 Mar 2010 15:13:45 +0000 (10:13 -0500)]
b=20997 Cannot send after transport shutdown

 Clear imp_vbr_failed flag upon eviction

 i=robert.read
 i=alexander.zarochentsev

14 years agob=21938 use req->rq_set itself during recovery
hongchao.zhang [Sun, 7 Mar 2010 06:52:55 +0000 (14:52 +0800)]
b=21938 use req->rq_set itself during recovery

during recovery, uses req->rq_set itself to replay the request
instead of ptlrpcd_recovery_pc

i=tappro@sun.com
i=johann@sun.com

14 years agob=22069 introduce server major version for b1_8 and b2_0 quota utils interoperability
Fan Yong [Wed, 10 Mar 2010 02:51:44 +0000 (10:51 +0800)]
b=22069 introduce server major version for b1_8 and b2_0 quota utils interoperability

Introduce server major version for b1_8 and b2_0 quota utils interoperability.

i=andrew.perepechko
i=robert.read

14 years agob=21983 Use CFS_ALLOC_IO instead of _STD in llap_from_page_with_lockh
Dmitry Zogin [Tue, 9 Mar 2010 14:48:23 +0000 (09:48 -0500)]
b=21983 Use CFS_ALLOC_IO instead of _STD in llap_from_page_with_lockh

During an ll_readahead under ll_readpage, we have seen the the
OBD_SLAB_ALLOC hang under ldlm_pools_shrink when trying to lock
a page that is already locked by the readahead code.

Using CFS_ALLOC_IO instead of CFS_ALLOC_STD will prevent
ldlm_pools_shrink from actually freeing slab, so the call path
that blocks indefinitely can never happen.

 i=adilger
 i=dmitry.zogin
 i=johann

14 years agob=22177 inc nlink by 2 instead of 1 in mds_orphan_add_link()
Johann Lombardi [Fri, 5 Mar 2010 22:17:53 +0000 (23:17 +0100)]
b=22177 inc nlink by 2 instead of 1 in mds_orphan_add_link()

i=adilger
i=dmitry

Fix regression introduced by 19640.
ext3_inc_count() can reset nlink to 1 when the directory is indexed and
inode->i_nlink == 2. Work around the problem by incrementing nlink by 2
instead of 1.

14 years agob=17591 sanity-benchmark s/MOUNT/DIR/ cleanup
Elena Gryaznova [Fri, 5 Mar 2010 20:07:53 +0000 (23:07 +0300)]
b=17591 sanity-benchmark s/MOUNT/DIR/ cleanup

i=Robert.Read

14 years agob=22169 t-f:start_client_loads () wait the background threads to start
Elena Gryaznova [Fri, 5 Mar 2010 19:54:25 +0000 (22:54 +0300)]
b=22169 t-f:start_client_loads () wait the background threads to start

i=Robert.Read

14 years agob=22169 t-f cleanup: new do_nodev (), do_nodesv () functions
Elena Gryaznova [Fri, 5 Mar 2010 19:51:38 +0000 (22:51 +0300)]
b=22169 t-f cleanup: new do_nodev (), do_nodesv () functions

i=Robert.Read

14 years agob=22095 MDS operations hang when issued with lfs setstripe on a degraded OST
Dmitry Zogin [Thu, 4 Mar 2010 18:03:14 +0000 (13:03 -0500)]
b=22095 MDS operations hang when issued with lfs setstripe on a degraded OST

 Change the locking order in mds_lookup()

 o=jfilizetti@sms-fed.com
 i=johann
 i=adilger

14 years agob=21900 ost-pools test_25: FAIL
Dmitry Zogin [Thu, 4 Mar 2010 16:16:05 +0000 (11:16 -0500)]
b=21900 ost-pools test_25: FAIL

 Make ost-pools test_25 more robust

 i=manoj.joseph

14 years agob=22127 lustre 1.8.2 lfs permissions Patch corrects cfs_curproc_euid() logic.
Dmitry Zogin [Thu, 4 Mar 2010 02:59:50 +0000 (21:59 -0500)]
b=22127 lustre 1.8.2 lfs permissions Patch corrects cfs_curproc_euid() logic.

 o=bschubert@ddn.com
 i=oleg.drokin
 i=johann

14 years agob=21066 ost-pools test_14 should not assert that files are from a specific OST
Manoj Joseph [Wed, 3 Mar 2010 22:32:21 +0000 (15:32 -0700)]
b=21066 ost-pools test_14 should not assert that files are from a specific OST

Round-robin allocation test should not assert that files are allocated
in strict round-robin fashion.

i=nathan.rutman
i=grev

14 years agob=17258 fix error with make rpms after configure --disable-tests
Brian J. Murrell [Wed, 3 Mar 2010 16:51:40 +0000 (11:51 -0500)]
b=17258 fix error with make rpms after configure --disable-tests

If one configures lustre with "--disable-tests" a subsequent "make rpms"
will fail as it would still try to package up the lustre-tests RPM.
Fixing this provided the opportunity to fix another wart, that being the
subst'ing the configure arguments into the lustre.spec.  Now they are
passed as value with "--define 'configure_args ...'" when calling rpmbuild.

14 years agob=21726 stop waitting for next replay transno if shutdown
hongchao.zhang [Sun, 28 Feb 2010 23:30:04 +0000 (07:30 +0800)]
b=21726 stop waitting for next replay transno if shutdown

if the system is shutting down, wake up service thread blocked
to wait for next replay transno during recovery, then all the
references held by queued requests can be dropped and device
can be stopped.

i=hongchao.zhang@sun.com
i=tappro@sun.com

14 years agob=20101 lfs getstripe -d test for sanity 27w
yangsheng [Tue, 2 Mar 2010 15:32:26 +0000 (23:32 +0800)]
b=20101 lfs getstripe -d test for sanity 27w

i=adilger
i=robert

14 years agob=19873 sanity: Memory leaks detected, FAILed to clean up
Dmitry Zogin [Tue, 2 Mar 2010 13:58:41 +0000 (08:58 -0500)]
b=19873 sanity: Memory leaks detected, FAILed to clean up

 Patch backport from bz 20650, attachment 26416 - introduce lprocfs counter on IRQs
 The lc_sum_irq counter is used to calculate memory freed on the interrupt.

 i=adilger
 i=andrew.perepechko

14 years agobump version to 1.8.2.51
Johann Lombardi [Mon, 1 Mar 2010 23:03:59 +0000 (00:03 +0100)]
bump version to 1.8.2.51

14 years agob=17197 fix typo for OBD_CALC_STRIPE_RPC_END_ALIGN fix typo in OBD_CALC_STRIPE_RPC_EN...
Wang Di [Mon, 1 Mar 2010 16:56:20 +0000 (11:56 -0500)]
b=17197 fix typo for OBD_CALC_STRIPE_RPC_END_ALIGN fix typo in OBD_CALC_STRIPE_RPC_END_ALIGN and do not aligned 1M for stride readahead.

i=ericm
i=johann

14 years agob=21816 return approximate block/inode usage when OSTs are down
Andrew Perepechko [Mon, 1 Mar 2010 16:27:44 +0000 (19:27 +0300)]
b=21816 return approximate block/inode usage when OSTs are down

Really return approximate block/inode usage when OSTs are down.
The old version erroneously skipped oqctl copying on error which
prevented this from working properly.

i=Johann Lombardi
i=ZhiYong Tian

14 years agob=20989 lov_merge_lvb()) ASSERTION(spin_is_locked(&lsm->lsm_lock)) failed
Dmitry Zogin [Mon, 1 Mar 2010 13:39:47 +0000 (08:39 -0500)]
b=20989 lov_merge_lvb()) ASSERTION(spin_is_locked(&lsm->lsm_lock)) failed

 Protect lli->lli_smd pointer updates with lli->lli_lock.

 o=oleg.drokin
 i=johann
 i=dmitry.zogin

14 years agob=21815 Avoid operating lustre-hash internal structures directly.
yangsheng [Mon, 1 Mar 2010 13:24:33 +0000 (21:24 +0800)]
b=21815 Avoid operating lustre-hash internal structures directly.

i=johann
i=nathan

14 years agob=22097 mount.lustre fails to pass some options to mount()
Johann Lombardi [Fri, 26 Feb 2010 21:38:02 +0000 (22:38 +0100)]
b=22097 mount.lustre fails to pass some options to mount()

i=yangsheng
i=dmitry

14 years agob=18649 set wait_recovery_complete() MAX value to max recovery time estimated
Elena Gryaznova [Fri, 26 Feb 2010 19:12:28 +0000 (22:12 +0300)]
b=18649 set wait_recovery_complete() MAX value to max recovery time estimated

i=Mikhail.Pershin

14 years agob=21992 sanity-quota interop: proc path fix for 2.0 servers
Elena Gryaznova [Fri, 26 Feb 2010 16:19:30 +0000 (19:19 +0300)]
b=21992 sanity-quota interop: proc path fix for 2.0 servers

i=Johann.Lombardi

14 years agob=21255 parallel-scale statahead test fix
Elena Gryaznova [Fri, 26 Feb 2010 16:06:28 +0000 (19:06 +0300)]
b=21255 parallel-scale statahead test fix

i=Vladimir.Saveliev
i=Andrew.Perepechko

use mpi for create/delete files instead of createmany and rm

14 years agob=21380 make dist seems to exclude the "darwin" bits
Brian J. Murrell [Thu, 25 Feb 2010 17:59:50 +0000 (12:59 -0500)]
b=21380 make dist seems to exclude the "darwin" bits

Include all of the darwin bits in the distribution tarball created with
make dist.

i=adilger

14 years agob=21911 fix for double release of ibc_lock in o2iblnd
Maxim Patlasov [Thu, 25 Feb 2010 12:39:51 +0000 (15:39 +0300)]
b=21911 fix for double release of ibc_lock in o2iblnd

i=liang
Re-acquire ibc_lock in kiblnd_post_tx_locked(). Add extra reference to conn before calling kiblnd_post_tx_locked() to avoid scenario when conn disappears inside kiblnd_post_tx_locked().

14 years agob=17952 allow relative pathnames
Brian J. Murrell [Wed, 24 Feb 2010 18:53:21 +0000 (13:53 -0500)]
b=17952 allow relative pathnames

This patch allows one to give relative pathnames to --with-linux and
friends.

i=adilger
i=dmitry.zogin

14 years agob=19336 post landing cleanups
Brian J. Murrell [Wed, 24 Feb 2010 18:53:20 +0000 (13:53 -0500)]
b=19336 post landing cleanups

Remove generic find_linux_devel_paths()
- now that both the rhel5 and sles method files have their own particular
  version of this method, remove this hacky-trying-to-work-for-both versions
  from lbuild.
Remove a block of what is now redundant code.
Remove the comments from the target files describing what happened with
this bug.
Align the sles10 and sles11 target files:
- include the rpmfix specifier in the sles10 file
- remove the EXTRA_VERSION_DELIMETER from the sles10 file
- change the TARGET_DELIMETER to FLAVOR_DELIMETER in the sles11 file
Some whitespace cleanups.

i=wangyb
i=yangsheng

14 years agob=21968 conf-sanity changes for config mgs and mds are not combined
Elena Gryaznova [Wed, 24 Feb 2010 15:53:14 +0000 (18:53 +0300)]
b=21968 conf-sanity changes for config mgs and mds are not combined

i=Nathan.Rutman

includes t-f:combined_mgs_mds ()

14 years agob=21097 quote the passing on of it's own arguments through $@
Jian Yu [Tue, 23 Feb 2010 07:14:00 +0000 (15:14 +0800)]
b=21097 quote the passing on of it's own arguments through $@

Fix do_nodes() in acc-sm test-framework to quote the passing on
of it's own arguments through $@ to do_node().

i=jian.yu
i=edg

14 years agob=21869 improve filter_fid2dentry() error message
Landen [Mon, 22 Feb 2010 03:19:38 +0000 (11:19 +0800)]
b=21869 improve filter_fid2dentry() error message

i=landen
i=girish

14 years agob=20433 decrease the usage of memory on clients.
Landen [Mon, 22 Feb 2010 03:08:46 +0000 (11:08 +0800)]
b=20433 decrease the usage of memory on clients.

1. On clients, recycle dentries and inodes unused.
2. Delete the code related to ll_deathrow(att 6215 in bug 1443). It
   is useless now.

i=adilger
i=oleg

14 years agob=21137 ext4 extent allocation is slower than in ext3
dzogin [Fri, 19 Feb 2010 16:19:17 +0000 (11:19 -0500)]
b=21137 ext4 extent allocation is slower than in ext3

 Increase the default value of MB_DEFAULT_ORDER2_REQS to 8, enlarge ext4
 preallocation table for 2048 4K blocks extents creation.

 i=alex.zhuravlev
 i=andrew.perepechko

14 years agoUpdated with Oracle copyright. v1_8_2_50
Terry Rutledge [Fri, 19 Feb 2010 17:12:09 +0000 (10:12 -0700)]
Updated with Oracle copyright.
2010-01-29  Sun Microsystems, Inc.
        * version 3.1.1

-------------------------------------------------------------------------------

14 years agob=22074 incorrect triggering of synchronous IO
Brian Behlendorf [Wed, 17 Feb 2010 17:49:23 +0000 (12:49 -0500)]
b=22074 incorrect triggering of synchronous IO

The OSC can mistakenly fall back to synchronous IO when the max_dirty_mb
limit is reached and no write requests have yet been issued.  This can
occur when the dirty pages are spread over many files all of which are
below the optimal request size.

o=behlendorf1@llnl.gov (Brian Behlendorf)
i=adilger
i=tom.wang

14 years agob=20383 fix errant m4 "dnl" usage
Brian J. Murrell [Wed, 17 Feb 2010 16:51:46 +0000 (11:51 -0500)]
b=20383 fix errant m4 "dnl" usage

Some dnl() usage seems to have been causing some errors in the resulting
configure script.

o=tappro
i=brian
i=adilger

14 years agob=21913 conf-sanity test_35b FAIL due to client reconnection has not been triggered
Dmitry Zogin [Fri, 12 Feb 2010 22:27:32 +0000 (17:27 -0500)]
b=21913 conf-sanity test_35b FAIL due to client reconnection has not been triggered

 Improve the robustness of conf-sanity test_35b.

 i=johann

14 years agoRevert "b=20989 fix race in nfs export code"
Johann Lombardi [Wed, 17 Feb 2010 11:49:30 +0000 (12:49 +0100)]
Revert "b=20989 fix race in nfs export code"

This reverts commit 09f7487326e23dfed472bfbf752b5ee58c225855.

Conflicts:

lustre/ChangeLog

14 years agob=21829 fix broken llobdstat and add a counter parameter
Brian J. Murrell [Fri, 12 Feb 2010 20:52:12 +0000 (15:52 -0500)]
b=21829 fix broken llobdstat and add a counter parameter

Need to make sure we limit the search for OBD stats files to the obdfilter
subdirectory of "/proc/fs/lustre".
Add a counter argument to limit the number of items returned when using the
interval parameter.
Fix lots of whitespace atrocities as well as better format some of the code.

i=cliff
i=wangyb

14 years agob=13520 PTLRPC_PAUSE_REQ checking should ignore PING.
Eric Mei [Fri, 12 Feb 2010 18:48:08 +0000 (11:48 -0700)]
b=13520 PTLRPC_PAUSE_REQ checking should ignore PING.

r=nathan
r=wangdi

14 years agob=20355 Add $(PTHREAD_LIBS) to lctl and lfs build
Cliff White [Wed, 3 Feb 2010 19:41:54 +0000 (11:41 -0800)]
b=20355 Add $(PTHREAD_LIBS) to lctl and lfs build

i=johann

$(PTHREAD_LIBS) is needed to compile lctl and lfs for BG/P

14 years agob=20918 t-f max recovery time estimation
Elena Gryaznova [Thu, 11 Feb 2010 20:51:33 +0000 (23:51 +0300)]
b=20918 t-f max recovery time estimation

i=Nathan.Rutman
i=Brian.Murrell

inform user about the estimated maximum recovery time value
to help him to set the server FAILOVER period properly

14 years agob=21932 disable some tests on NFSCLIENT
Elena Gryaznova [Thu, 11 Feb 2010 20:29:40 +0000 (23:29 +0300)]
b=21932 disable some tests on NFSCLIENT

i=Oleg.Drokin

disable simul,  write_disjoint, write_append_truncate for NFSCLIENT mode;
setstripe on lustre client (nfsserver) instead of nfs client for ior test

14 years agob=21919 Optimize quota_ctl operations by sending requests in parallel
Andrew Perepechko [Thu, 11 Feb 2010 19:48:35 +0000 (22:48 +0300)]
b=21919 Optimize quota_ctl operations by sending requests in parallel

i=Johann Lombardi
i=ZhiYong Tian

Based on a patch from Joseph Herring (LLNL).
Send MDS->OST quota_ctl requests in parallel, do not resend.
Compiled from two attachments in the ticket.

14 years agob=18030 dealock fix
Alexander.Zarochentev [Thu, 11 Feb 2010 17:28:08 +0000 (20:28 +0300)]
b=18030 dealock fix

start the transaction earlier in llog_lvfs_destroy to get transaction start
and inode mutex lock nested properly.

i=adilger
i=alex.zhuravlev

14 years agob=21264 workaround dd bus error
Alexander.Zarochentev [Thu, 11 Feb 2010 15:59:25 +0000 (18:59 +0300)]
b=21264 workaround dd bus error

A buggy coreutils/gettext combination workaround.  Suppressing dd xfer
statistic makes dd do not call gnu gettext library and avoid crashing.

i=robert.read

14 years agob=15057 fix file ownerships in lustre-modules RPM
Brian J. Murrell [Wed, 10 Feb 2010 18:47:45 +0000 (13:47 -0500)]
b=15057 fix file ownerships in lustre-modules RPM

The files in the lustre-modules RPM were not being set with a correct
owner and were therefore just using what was on the filesystem.

i=yangsheng
i=wangyb

14 years agob=21665 a small fix for "lfs osts"
Andrew Perepechko [Wed, 10 Feb 2010 16:34:14 +0000 (19:34 +0300)]
b=21665 a small fix for "lfs osts"

a=Joseph Herring
i=Andreas Dilger
i=Andrew Perepechko

Actually, we don't want to traverse the directory tree,
so return a positive value from sem_init to terminate
the traversal before it starts.

14 years agob=21882 handle SLV==1 on client side
Vitaly Fertman [Wed, 10 Feb 2010 16:12:16 +0000 (19:12 +0300)]
b=21882 handle SLV==1 on client side

Initialize ldlm pool SLV to 0 on client side to handle SLV==1 obtained from server correctly

i=rreed
i=adilger

14 years agob=21882 lru resize SLV can get stuck
Vitaly Fertman [Wed, 10 Feb 2010 16:11:32 +0000 (19:11 +0300)]
b=21882 lru resize SLV can get stuck

calculate SLV with a greater precision to not lose small changes due to interger math truncation;
round up SLV only if the amount of granted locks less than the limit to not get stuck with this SLV

i-adilger
i=green

14 years agob=21666 prevent use of OFED source dir instead of headers
Brian J. Murrell [Mon, 8 Feb 2010 19:41:49 +0000 (14:41 -0500)]
b=21666 prevent use of OFED source dir instead of headers

Try to determine if the user is pointing configure at the OFED source
directory intead of the devel/headers directory and error out of
configure if so and display an informative warning.

i=panda
i=whitebear

14 years agob=19553 Ignore broken cancel_dirty_page() in OFED 1.4.1
Brian J. Murrell [Mon, 8 Feb 2010 19:34:16 +0000 (14:34 -0500)]
b=19553 Ignore broken cancel_dirty_page() in OFED 1.4.1

OFED 1.4.1 had a broken implementation of cancel_dirty_page for SLES10.
This patch detects that and ignores the function if found.

i=johann
i=yangsheng

14 years agob=19336 Get rid of the EXTRA_VERSION_DELIMETER shenanigans
Brian J. Murrell [Mon, 8 Feb 2010 17:54:41 +0000 (12:54 -0500)]
b=19336 Get rid of the EXTRA_VERSION_DELIMETER shenanigans

We used to carry around a bunch of baggage in order to specify what kind
of delimeter to put between the version and "extra version".  The truth of
the matter is that this should always be "-".

This patch includes some support for a build system developer to force
an uncached rebuild of all products.

i=wangyb
i=yangsheng

14 years agob=21882 lru-resize comment cleanups i=adilger
Vitaly Fertman [Thu, 4 Feb 2010 22:39:32 +0000 (01:39 +0300)]
b=21882 lru-resize comment cleanups i=adilger

14 years agob=20718 sanity-quota test_14b: improve error message
Elena Gryaznova [Wed, 3 Feb 2010 19:25:32 +0000 (22:25 +0300)]
b=20718 sanity-quota test_14b: improve error message

i=Andrew.Perepechko

make check_runas_id_ret () more verbose

14 years agob=21961 add changelog entry
Johann Lombardi [Thu, 4 Feb 2010 22:24:44 +0000 (23:24 +0100)]
b=21961 add changelog entry

14 years agob=21961 (17914) ignore trailing -mdc when determining index number
Nathan Rutman [Thu, 4 Feb 2010 21:58:08 +0000 (13:58 -0800)]
b=21961 (17914) ignore trailing -mdc when determining index number

a=jinshan.xiong
i=nathan
i=h.huang

14 years agob=21990 add parallel-scale EXCEPT list
Elena Gryaznova [Thu, 4 Feb 2010 16:30:52 +0000 (19:30 +0300)]
b=21990 add parallel-scale EXCEPT list

i=Minh.Diep

14 years agob=21966 avoid divide-by-zero in lprocfs_rd_import()
Andreas Dilger [Thu, 4 Feb 2010 12:37:29 +0000 (13:37 +0100)]
b=21966 avoid divide-by-zero in lprocfs_rd_import()
i=johann

14 years agob=16909 simplify MDT/OST service start message
Johann Lombardi [Thu, 4 Feb 2010 08:15:55 +0000 (09:15 +0100)]
b=16909 simplify MDT/OST service start message

i=nathan
i=adilger

14 years agob=21686 add changelog entry
Johann Lombardi [Thu, 4 Feb 2010 08:12:44 +0000 (09:12 +0100)]
b=21686 add changelog entry

14 years agob=16909 Simplify MDT/OST service start message
Landen [Thu, 4 Feb 2010 07:57:49 +0000 (15:57 +0800)]
b=16909 Simplify MDT/OST service start message

i=nathan
i=adilger

14 years agob=18489 test_116, test_118k cleanup
Elena Gryaznova [Wed, 3 Feb 2010 21:58:09 +0000 (00:58 +0300)]
b=18489 test_116, test_118k cleanup

i=Andrew.Perepechko (panda)
i=Andreas.Dilger

14 years agob=21953 use separate failover counter for each facet
Elena Gryaznova [Wed, 3 Feb 2010 21:37:08 +0000 (00:37 +0300)]
b=21953 use separate failover counter for each facet

i=Mikhail.Pershin (tappro)

14 years agoadd 1.8.3 section in the lnet changelog
Johann Lombardi [Wed, 3 Feb 2010 18:46:33 +0000 (19:46 +0100)]
add 1.8.3 section in the lnet changelog

14 years agob=21147 call build_lqs only from generic_quota_on
Andrew Perepechko [Wed, 3 Feb 2010 17:13:59 +0000 (20:13 +0300)]
b=21147 call build_lqs only from generic_quota_on

i=Johann Lombardi
i=ZhiYong Tian

14 years agob=21259 "lfs check" is only allowed for root.
Dmitry Zogin [Wed, 3 Feb 2010 16:54:11 +0000 (11:54 -0500)]
b=21259 "lfs check" is only allowed for root.

 Code cleanup around obd_class_*() functions and sanity test for non-root lfs check

 i=adilger
 i=andrew.perepechko

14 years agob=21632 Kernel update to OEL5.4 2.6.18-164.11.1.0.1.el5.
yangsheng [Tue, 2 Feb 2010 15:15:37 +0000 (23:15 +0800)]
b=21632 Kernel update to OEL5.4 2.6.18-164.11.1.0.1.el5.

i=johann

14 years agobump version to 1.8.2.50
Johann Lombardi [Wed, 3 Feb 2010 18:35:51 +0000 (19:35 +0100)]
bump version to 1.8.2.50

14 years agoadd changelog section for 1.8.3
Johann Lombardi [Wed, 3 Feb 2010 18:34:07 +0000 (19:34 +0100)]
add changelog section for 1.8.3

14 years agoRevert debug patch from b=21364
Johann Lombardi [Tue, 2 Feb 2010 10:49:29 +0000 (11:49 +0100)]
Revert debug patch from b=21364

This reverts commit 818de83d3200ae48dae7096500ba0118b8f95976.
I inadvertently committed my debug patch.

14 years agob=20970 need add an additional barrier for write_disjoint
Landen [Fri, 29 Jan 2010 07:13:42 +0000 (15:13 +0800)]
b=20970 need add an additional barrier for write_disjoint

i=rread
i=grev

14 years agob=21948 skip parallel grouplock test for NFSCLIENT mode
Elena Gryaznova [Thu, 28 Jan 2010 15:06:18 +0000 (18:06 +0300)]
b=21948 skip parallel grouplock test for NFSCLIENT mode

i=Johann

14 years agob=21900 ost-pools test_25: FAIL: /mnt/lustre/d0.ost-pools/d25/file1 not allocated...
Dmitry Zogin [Wed, 27 Jan 2010 18:04:27 +0000 (13:04 -0500)]
b=21900 ost-pools test_25: FAIL: /mnt/lustre/d0.ost-pools/d25/file1 not allocated from OSTs 0.

 Modify ost-pools test_25 to wait for MDS-OST connection to re-establish.

 i=johann

14 years agodelete test_12 in sanity-quota.sh
Landen [Wed, 27 Jan 2010 07:48:03 +0000 (15:48 +0800)]
delete test_12 in sanity-quota.sh

14 years agob=21686 fail the request if its obd_device stopping
root [Sat, 30 Jan 2010 07:02:05 +0000 (15:02 +0800)]
b=21686 fail the request if its obd_device stopping

in ldlm_handle_enqueue, the request should be failed
if its obd_device had been marked as "fail"(obd_fail=1),
which will be set during umount.

i=johann@sun.com
i=oleg.drokin@sun.com

14 years agoMerge branch 'b1_8' of git@git.lustre.org:prime/lustre into b1_8
Johann Lombardi [Tue, 2 Feb 2010 10:09:15 +0000 (11:09 +0100)]
Merge branch 'b1_8' of git@git.lustre.org:prime/lustre into b1_8

14 years agoForgot to update these when I updated the lustre/ChangeLog. Added date
Terry Rutledge [Sat, 30 Jan 2010 20:57:52 +0000 (13:57 -0700)]
Forgot to update these when I updated the lustre/ChangeLog.  Added date
and another supported version of OFED.

14 years agob=21364 debug patch
Johann Lombardi [Fri, 29 Jan 2010 00:53:20 +0000 (01:53 +0100)]
b=21364 debug patch

14 years agob=21815 lustre_hash_rehash_key() should use lh_read_unlock()
Johann Lombardi [Wed, 27 Jan 2010 14:39:49 +0000 (15:39 +0100)]
b=21815 lustre_hash_rehash_key() should use lh_read_unlock()

lh_read_lock() is no-op if rehash is disabled, so we should
use lh_read_unlock() in this function.
This should not have any consequence, but better to fix it.

14 years agob=21815 move assertion under write lock
Johann Lombardi [Wed, 27 Jan 2010 14:27:07 +0000 (15:27 +0100)]
b=21815 move assertion under write lock

14 years agob=21815 print more debug info in lustre_hash_exit when assertion fails
Johann Lombardi [Wed, 27 Jan 2010 12:49:37 +0000 (13:49 +0100)]
b=21815 print more debug info in lustre_hash_exit when assertion fails

14 years agob=19405 do not flag a request as rq_replay for non replayable imports
Vladimir V. Saveliev [Tue, 26 Jan 2010 16:12:40 +0000 (17:12 +0100)]
b=19405  do not flag a request as rq_replay for non replayable imports

i=ericm
i=robert

14 years agob=21906 LBUG doesn't print stack trace on sles9 because show_stack not exported
Johann Lombardi [Tue, 26 Jan 2010 16:04:06 +0000 (17:04 +0100)]
b=21906 LBUG doesn't print stack trace on sles9 because show_stack not exported

14 years agoRevert "b=21097 fix md5sum error in metadata-updates.sh" 1.8.2 v1_8_2_RC4
Johann Lombardi [Sat, 23 Jan 2010 00:36:25 +0000 (01:36 +0100)]
Revert "b=21097 fix md5sum error in metadata-updates.sh"

This reverts commit 89b5d6f0e40b35bcc93d6830568e823d67e8f364.

14 years agob=17682 fix time unit in message
Johann Lombardi [Fri, 22 Jan 2010 23:03:01 +0000 (00:03 +0100)]
b=17682 fix time unit in message