Whamcloud - gitweb
fs/lustre-release.git
9 years agob=24037 Changes of 2.6.32 kernel.
yangsheng [Fri, 3 Dec 2010 14:24:49 +0000 (22:24 +0800)]
b=24037 Changes of 2.6.32 kernel.

--request_queue new member and new macro;
--quota interface name changes;
--capa_* relate changes;

i=adilger
i=vitaly

9 years agob=19706 porting patch for master.
yangsheng [Fri, 3 Dec 2010 14:24:17 +0000 (22:24 +0800)]
b=19706 porting patch for master.

Since 2.6.32, kernel add pde_user in proc_dir_entry to protect
directory entry from deleted. So we don't need extra lock in
lprocfs_remove().

i=hongchao.zhang
i=johann

9 years agob=23575 O2iblnd credit deadlock regression
Isaac Huang [Mon, 29 Nov 2010 07:08:24 +0000 (00:08 -0700)]
b=23575 O2iblnd credit deadlock regression

This fixed a regression of bug 14425.

i=liang

9 years agob=23289 cleanup assertion on atomic
Jian Yu [Mon, 29 Nov 2010 13:37:56 +0000 (21:37 +0800)]
b=23289 cleanup assertion on atomic

Remove assertion on atomic from addref functions, which is less helpful,
also, add macros to reduce atomic_read() in assertion.

o=Liang Zhen
i=andreas.dilger
i=mikhail.pershin
i=vitaly.fertman

9 years agob=24228 fix test duration check to be more accurate
elena.gryaznova@oracle.com [Fri, 26 Nov 2010 22:31:09 +0000 (01:31 +0300)]
b=24228 fix test duration check to be more accurate

i=vitaly

9 years agob=24202 Fix broken PATHS in autoconf file.
Wang Yibin [Wed, 24 Nov 2010 03:06:02 +0000 (11:06 +0800)]
b=24202 Fix broken PATHS in autoconf file.

There should be a space between paths.

i=sheng.yang
i=brian.murrell

9 years agob=24188 rehash patch makes oops in quota_search_lqs
Landen [Thu, 25 Nov 2010 09:03:37 +0000 (17:03 +0800)]
b=24188 rehash patch makes oops in quota_search_lqs

Problem is: we should do "cfs_hash_getref(qctxt->lqc_lqs_hash)"
when we search and create a new quota hash entry.
This patch does some optimization for lquota code so that lquota
can take one less reference for lqs.

i=landen
i=vitaly.fertman

9 years agob=24051 sanity test_76 fix
Andrew Perepechko [Tue, 23 Nov 2010 23:38:37 +0000 (02:38 +0300)]
b=24051 sanity test_76 fix

Due to per-cpu slab caches, "active" stat from /proc/slabinfo
is not always accurate.

i=ZhiYong Tian

9 years agob=24020 lustre doesn't start with ext4 based ldiskfs.
Elena Gryaznova [Fri, 26 Nov 2010 08:08:35 +0000 (11:08 +0300)]
b=24020 lustre doesn't start with ext4 based ldiskfs.

i=Andrew.Perepechko

load jbd2 module in load modules

9 years agob=23996 grep lfs getstripe output for pools correctly
Vladimir Saveliev [Fri, 26 Nov 2010 06:51:07 +0000 (09:51 +0300)]
b=23996 grep lfs getstripe output for pools correctly

i=zam

9 years agob=23842 some issues with recent changes to build/nn-final-symbol-list.txt
Aleksandr Guzovskiy [Wed, 24 Nov 2010 23:01:26 +0000 (18:01 -0500)]
b=23842 some issues with recent changes to build/nn-final-symbol-list.txt

i=adilger

9 years agob=19808 fix d_obtain_alias() misuse due to compat macro.
yangsheng [Wed, 24 Nov 2010 14:09:01 +0000 (22:09 +0800)]
b=19808 fix d_obtain_alias() misuse due to compat macro.

o=Adreas.Dilger
i=yangsheng
i=Oleg.Drokin

ll_iget_for_nfs() get a change from d_alloc_anon() to d_obtain_alias().
The former would NOT release the inode reference if there was an error, while
the latter does.  Hide this detail in the d_obtain_alias() compat macro, for
kernels that do not have this function, and remove it from the error handling
code at the caller, where it would drop the inode reference twice in rare error
cases on all kernels WITH d_obtain_alias() support (RHEL5 and newer).

9 years agob=15599 remove OBD_CONNECT_LAYOUTLOCK from supported bits
Andreas Dilger [Wed, 24 Nov 2010 23:34:33 +0000 (16:34 -0700)]
b=15599 remove OBD_CONNECT_LAYOUTLOCK from supported bits

Until the rest of the layout lock support is landed, it doesn't
make sense for the MDT to advertize the OBD_CONNECT_LAYOUTLOCK flag
at connection time.

Remove the flag from MDS_CONNECT_SUPPORTED until the actual
MDS_INODEBITS_LAYOUT ibits lock is landed and functional.

9 years agob=22598
Vitaly Fertman [Wed, 24 Nov 2010 01:30:44 +0000 (04:30 +0300)]
b=22598

disable thandle debug usage statistics

9 years agob=22965 enable lockless truncate
Vitaly Fertman [Tue, 23 Nov 2010 21:44:50 +0000 (00:44 +0300)]
b=22965 enable lockless truncate

i=zam
i=adilger

9 years agob=23170 lockless enqueue lbug
Vitaly Fertman [Tue, 23 Nov 2010 21:44:49 +0000 (00:44 +0300)]
b=23170 lockless enqueue lbug

lockless enqueue still could be called if first enqueue attempt set
lockless ops but found a conflict and therefore tried to enqueue again
later when the confict was resolved.

i=green
i=ericm

9 years agob=14836 Fail if "lfs setstripe" has positional parameters
Andreas Dilger [Tue, 23 Nov 2010 21:17:32 +0000 (14:17 -0700)]
b=14836 Fail if "lfs setstripe" has positional parameters

Since 1.6.1 it has been possible to use named options -c/-i/-s to
specify the layout parameters for "lfs setstripe".

Since 1.8.0 the use of the old positional parameters has generated
a warning that they are deprecated.

The code was set to fail building at 2.1.0.0, so this needs to be
resolved before moving to a release build.  Instead of simply removing
the conditional code, change "lfs setstripe" to return an error if
positional parameters are used.  That would avoid the confusion of
creating files with names of the old {size}, {offset}, and {count}
parameters, instead of returning a meaningful error to the user.

9 years agob=24022 don't return NULL in lprocfs_alloc_stats to satisfy userspace error checker
Jian Yu [Sat, 20 Nov 2010 12:15:18 +0000 (20:15 +0800)]
b=24022 don't return NULL in lprocfs_alloc_stats to satisfy userspace error checker

o=Liang Zhen
i=vitaly.fertman

9 years agob=23994 hide EMFILE error in ptlrpc_start_threads().
Eric Mei [Mon, 22 Nov 2010 15:47:59 +0000 (08:47 -0700)]
b=23994 hide EMFILE error in ptlrpc_start_threads().

i=vitaly.fertman

9 years agob=23985 Fix upstream jbd2 issue.
yangsheng [Mon, 22 Nov 2010 16:01:50 +0000 (00:01 +0800)]
b=23985 Fix upstream jbd2 issue.

The jbd2_stats_proc_init() was placed on wrong location in
jbd2_journal_init_dev().  This may cause /proc/fs/jdb2/<dev>/*
cannot be created when using external journal device.

i=adilger
i-johann

9 years agob=23535 sgpdd-survey.sh should check for sg_map
Elena Gryaznova [Fri, 19 Nov 2010 18:02:00 +0000 (21:02 +0300)]
b=23535 sgpdd-survey.sh should check for sg_map

i=Brian.Murrell

check that iokit sgpdd-survey and sg_map are installed

9 years agob=23533 ldiskfs to depend on latest e2fsprogs
Brian J. Murrell [Fri, 19 Nov 2010 17:22:14 +0000 (01:22 +0800)]
b=23533 ldiskfs to depend on latest e2fsprogs

ldiskfs should depend on the latest e2fsprogs package so that users
are required to keep their e2fsprogs RPMs up to date.

i=mjmac
i=adilger

9 years agob=23460 remove expensive memory check
Vitaly Fertman [Tue, 23 Nov 2010 21:03:08 +0000 (00:03 +0300)]
b=23460 remove expensive memory check

i=green

remove cfs_mem_in_cache check when checking validity of pages and locks

9 years agob=23460 make page state changes tracking optinal
Vitaly Fertman [Tue, 23 Nov 2010 21:03:09 +0000 (00:03 +0300)]
b=23460 make page state changes tracking optinal

i=green

The stats are unused by anybody, yet they take away several percents of read
speed. Let's make them optional.

9 years agob=23289 no need to always start transaction for attr_set(atime) on close
Jian Yu [Tue, 23 Nov 2010 05:24:11 +0000 (13:24 +0800)]
b=23289 no need to always start transaction for attr_set(atime) on close

To reduce unnecessary transaction overhead on file closing.

o=Liang Zhen
i=mikhail.pershin
i=di.wang

9 years agob=21525 debug enhancements + minor comment updates
minhdiep [Mon, 22 Nov 2010 20:29:15 +0000 (13:29 -0700)]
b=21525 debug enhancements + minor comment updates

a=Issac
i=Liang
i=Maxim

- when a message is dropped, the lnet counters should be increased;
- /proc/sys/lnet/peers should contain some data for debugging, like
  the last_alive time stamp.

9 years agob=18551 adapt ldlm_namespace to new cfs_hash
Liang Zhen [Fri, 19 Nov 2010 14:44:02 +0000 (22:44 +0800)]
b=18551 adapt ldlm_namespace to new cfs_hash

Use new cfs_hash to implement ldlm_namespace

o=Liang Zhen
i=Andreas Dilger
i=Oleg Drokin
i=Vitaly Fertman

9 years agoUpdated for 2.1.0 build 08. 2.0.56.0 v2_0_56_0
Terry Rutledge [Fri, 19 Nov 2010 21:33:04 +0000 (13:33 -0800)]
Updated for 2.1.0 build 08.

9 years agob=24170 checking len of debug string before matching
minhdiep [Thu, 18 Nov 2010 22:04:54 +0000 (15:04 -0700)]
b=24170 checking len of debug string before matching

i=vitaly
i=rahul

We need to check the len before matching the string
to prevent something like net matches neterror

9 years agob=23289 grained lock for ptlrpc service
Jian Yu [Wed, 17 Nov 2010 02:15:26 +0000 (10:15 +0800)]
b=23289 grained lock for ptlrpc service

- cacheline optimization for struct ptlrpc_service
- split ptlrpc_service::srv_lock to three locks:
  . ptlrpc_service::srv_lock
    serialize operations on rqbd and requests queued on srv_req_in_queue
  . ptlrpc_service::srv_rq_lock
    serialized operations on active requests
  . ptlrpc_service::srv_rs_lock
    serialize operations on RS
- cleanup logic in ptlrpc_main()
- remove unused atomic srv_outstanding_replies

o=liang
i=andreas.dilger
i=mikhail.pershin

9 years agob=20153 add IB bonding failover support to o2iblnd
Isaac Huang [Wed, 17 Nov 2010 14:14:46 +0000 (07:14 -0700)]
b=20153 add IB bonding failover support to o2iblnd

O2iblnd changes to support failover events from an IB
bonding IPoIB interface. Mostly to recreate device
specific resources, e.g. listener CMID.

i=isaac
i=liang

9 years agob=19890 fix mgs conf for mds and client for nosvc option
Mikhail Pershin [Thu, 18 Nov 2010 07:14:58 +0000 (10:14 +0300)]
b=19890 fix mgs conf for mds and client for nosvc option

i=zam
i=ericm

9 years agob=24087 reverse order of $LINUX{,_OBJ}/include
Brian J. Murrell [Mon, 15 Nov 2010 21:15:29 +0000 (05:15 +0800)]
b=24087 reverse order of $LINUX{,_OBJ}/include

It is important that /usr/src/linux-...-obj/include is searched for
includes before /usr/src/linux-.../include so that the inclusion of
"include/linux/autoconf.h" picks up the one for the kernel we are
trying to build against, and not the one for the currently running
kernel, which is what is in /usr/src/linux-.../ copy is.

i=wangyb
i=cliff

9 years agob=24095 don't really need configure --with-linux
Brian J. Murrell [Mon, 15 Nov 2010 21:15:28 +0000 (05:15 +0800)]
b=24095 don't really need configure --with-linux

Don't really need to have run configure --with-linux for version-tag.pl
to work.

i=wangyb
i=cliff

9 years agob=24093 not all build files/scripts being distributed
Brian J. Murrell [Mon, 15 Nov 2010 21:15:27 +0000 (05:15 +0800)]
b=24093 not all build files/scripts being distributed

Some files that need to be are not being included in the tarball
when make dist is being run.

i=cliff
i=minh

9 years agob=24091 find_linux_rpms utility
Brian J. Murrell [Wed, 17 Nov 2010 18:27:19 +0000 (02:27 +0800)]
b=24091 find_linux_rpms utility

Write a utility to find the the kernel "devel" package needed.
Restructure some code to facilitate reuse of existing code to
use in the utility.

i=cliff
i=minh

9 years agob=24090 distro and target autodetection
Brian J. Murrell [Wed, 17 Nov 2010 18:27:18 +0000 (02:27 +0800)]
b=24090 distro and target autodetection

Automatically detect the target and distro.  It's in fact a little
crazy that these are two separate options.  They should be merged.

i=cliff
i=minh

9 years agob=24089 avoid reuse cache storage collisions
Brian J. Murrell [Wed, 17 Nov 2010 18:27:17 +0000 (02:27 +0800)]
b=24089 avoid reuse cache storage collisions

This is a hack.  Basically, all nodes do their build and the last one
to store in the cache wins.
A better solution would be for nodes to announce their intention to
build for a given hash and any other nodes that will do the same wait
for the first node to finish and store it.  Subsequent nodes then just
use the first node's stored result.

i=wangyb
i=cliff

9 years agob=24117 params_tree code for YangtzeI
LiuYing [Tue, 16 Nov 2010 01:38:14 +0000 (09:38 +0800)]
b=24117 params_tree code for YangtzeI

part of params_tree code for YangtzeI
-add safe cfs_{v}snprintf
-move linux proc related stuff to params_tree.h

i=andreas
i=alexg

9 years agob=24017 fix the performance issue of lu_cache_shrink
Jian Yu [Tue, 16 Nov 2010 15:38:24 +0000 (23:38 +0800)]
b=24017 fix the performance issue of lu_cache_shrink

- increase busy counter in lu_obj_hop_get, so that we don't have to hack out
  this in llite
- make some changes to cfs_hash:
  . do not need to return anything by cfs_hash_get/put, they are not used by
    anyone, cfs_hash_object also can do the same thing
  . pass cfs_hash to cfs_hash_get/put

o=liang
i=vitaly.fertman

9 years agob=24003 Shows stripe_count: 65535 but should be '-1'.
yangsheng [Tue, 16 Nov 2010 06:04:24 +0000 (14:04 +0800)]
b=24003 Shows stripe_count: 65535 but should be '-1'.

i=johann

9 years agob=23954 MGS device has stopped when we try to start the second mgs
Elena Gryaznova [Tue, 16 Nov 2010 11:18:17 +0000 (14:18 +0300)]
b=23954 MGS device has stopped when we try to start the second mgs

i=Mikhail.Pershin

add test_24b to ALWAYS_EXCEPT list for configuration mgs/mds are not combined

9 years agob=23289 remove global atomic from lustre handle
Jian Yu [Tue, 16 Nov 2010 01:21:43 +0000 (09:21 +0800)]
b=23289 remove global atomic from lustre handle

o=liang
i=andreas.dilger
i=mikhail.pershin

9 years agob=23289 less global lock & atomic for ldlm pool
Jian Yu [Tue, 16 Nov 2010 01:11:37 +0000 (09:11 +0800)]
b=23289 less global lock & atomic for ldlm pool

o=liang
i=oleg.drokin
i=andreas.dilger

9 years agob=22544 delete module_setup.sh
Elena Gryaznova [Tue, 16 Nov 2010 10:53:03 +0000 (13:53 +0300)]
b=22544 delete module_setup.sh

i=Brian.Murrell

9 years agob=20744 Update to latest SLES10 SP3 kernel 2.6.16.60-0.69.1(patchless client).
yangsheng [Tue, 16 Nov 2010 05:57:45 +0000 (13:57 +0800)]
b=20744 Update to latest SLES10 SP3 kernel 2.6.16.60-0.69.1(patchless client).

i=brian

9 years agob=18717 make "lfs check" output consistent on stdout
Elena Gryaznova [Tue, 16 Nov 2010 11:30:43 +0000 (14:30 +0300)]
b=18717 make "lfs check" output consistent on stdout

o=Andreas.Dilger
i=Brian.Murrell
i=Elena.Gryaznova

9 years agob=24086 separate kernel and userspace __u64 checking
Brian J. Murrell [Thu, 11 Nov 2010 19:14:56 +0000 (03:14 +0800)]
b=24086 separate kernel and userspace __u64 checking

Even if we don't configure lustre --with-linux= we still need to determine
what type __u64 is for userspace otherwise the LP*64 macros won't be
properly defined.

i=whitebear
i=panda

9 years agob=23956 change conf-sanity test_37 to be functional on remote setup
Elena Gryaznova [Fri, 12 Nov 2010 17:05:53 +0000 (20:05 +0300)]
b=23956 change conf-sanity test_37 to be functional on remote setup

i=Mikhail.Pershin

fix test_37 to not be skipped on remote setup;
use the existing mdt device instead of create a new one

9 years agob=23289 remove extra debugging from commonly-called ldlm pool functions
Jian Yu [Thu, 11 Nov 2010 14:24:00 +0000 (22:24 +0800)]
b=23289 remove extra debugging from commonly-called ldlm pool functions

o=andreas
i=liang
i=jian.yu

9 years agob=23289 don't need transaction on mdt_close if it's not orphan
Jian Yu [Thu, 11 Nov 2010 14:16:14 +0000 (22:16 +0800)]
b=23289 don't need transaction on mdt_close if it's not orphan

o=liang
i=mikhail.pershin
i=alex.zhuravlev

9 years agob=23049 canonicalize disk names
Elena Gryaznova [Thu, 11 Nov 2010 12:26:42 +0000 (15:26 +0300)]
b=23049 canonicalize disk names

o=Brian.Murrell
i=grev

9 years agob=22180 fix the incorrect MDSDEV check
Elena Gryaznova [Thu, 11 Nov 2010 13:49:35 +0000 (16:49 +0300)]
b=22180 fix the incorrect MDSDEV check

i=Nathan.Rutman

new t-f is_blkdev ()
check MDSDEV on mds instead of local client
test_17, test_18 changes for config mgs and mds are not combined

9 years agob=22157 combined mgs/mds fix for single node setup
Elena Gryaznova [Thu, 11 Nov 2010 23:45:55 +0000 (02:45 +0300)]
b=22157 combined mgs/mds fix for single node setup

i=Mikhail.Pershin

for configuration combined mgs/mds on single node setup
we do not need to unload the modules because conf-sanity keeps
the mgs mounted during all tests

9 years agob=21420 fix aggregate mdt stats
LiuYing [Fri, 12 Nov 2010 01:06:06 +0000 (09:06 +0800)]
b=21420 fix aggregate mdt stats

mdt_counter_incr() should act upon obd->md_stats instead of
obd->obd_stats, the former is for recording md ops, while
the later for obd ops.

o=bobijam
i=andreas
i=liuying

9 years agob=18233 fix read+truncate livelock
Andrew Perepechko [Thu, 11 Nov 2010 13:47:30 +0000 (16:47 +0300)]
b=18233 fix read+truncate livelock

The race fix for ll_readpage which creates new (just truncated) pages
and truncate which truncates these pages.

i=Vitaly Fertman
i=Mike Pershin

9 years agob=24001 fix conf-sanity 53.
Eric Mei [Mon, 8 Nov 2010 17:32:11 +0000 (10:32 -0700)]
b=24001 fix conf-sanity 53.

i=jian.yu
i=grev

threads_max and threads_min could be the same.

9 years agob=23289 lprocfs_counter cleanup
Jian Yu [Mon, 8 Nov 2010 10:09:36 +0000 (18:09 +0800)]
b=23289 lprocfs_counter cleanup

Eliminate unnecessary atomic operations in lprocfs_counter.

o=liang
i=andreas.dilger
i=maxim.patlasov

9 years agob=23049 various t-f.sh patches
Elena Gryaznova [Mon, 8 Nov 2010 16:09:06 +0000 (19:09 +0300)]
b=23049 various t-f.sh patches

o=Brian.Murrell
i=grev

rundbench is a bash script;
obdfilter-survey is a bash script;
don't su if MPI_USER == "";

9 years agob=23049 set path to truncate
Elena Gryaznova [Mon, 8 Nov 2010 15:57:40 +0000 (18:57 +0300)]
b=23049 set path to truncate

i=Brian.Murrell

9 years agob=20101 a fix for lfs getstripe --pool
Andreas Dilger [Mon, 8 Nov 2010 09:21:34 +0000 (12:21 +0300)]
b=20101 a fix for lfs getstripe --pool

a=Jacques-Charles Lafoucriere (CEA)
a=Andreas Dilger
i=Andreas Dilger
i=Andrew Perepechko

9 years agob=24094 add kver; update kdir, kobjdir
Brian J. Murrell [Mon, 8 Nov 2010 11:20:51 +0000 (19:20 +0800)]
b=24094 add kver; update kdir, kobjdir

Add a new macro to allow the caller to simply define which kernel
he wants to build for.
Refactor the kdir and kobjdir to function more properly.
Remove krequires.  Nothing is using it anymore.

i=wangyb
i=cliff

9 years agob=24088 Quiet unused "path" variable warning.
adilger [Mon, 8 Nov 2010 11:20:50 +0000 (19:20 +0800)]
b=24088 Quiet unused "path" variable warning.

o=adilger
i=brian
i=panda

9 years agob=24092 build src.rpm for lustre if requested
Brian J. Murrell [Mon, 8 Nov 2010 11:20:49 +0000 (19:20 +0800)]
b=24092 build src.rpm for lustre if requested

Fix a bug where we were not building the lustre SRPM even if
requested.

i=wangyb
i=cliff

9 years agob=19669 update LOV object version only on data change
Vladimir Saveliev [Fri, 5 Nov 2010 15:22:14 +0000 (18:22 +0300)]
b=19669 update LOV object version only on data change

LOV object version is changed only when LOV object content is changed, either
on write or size change (truncate). No more on metadata change like
chmod/chown/chgrp. Needed for HSM.

a=aurelien.degremont
i=mikhail.pershin

9 years agob=20631 grouplock use from MDT for HSM
Vladimir Saveliev [Fri, 5 Nov 2010 15:22:01 +0000 (18:22 +0300)]
b=20631 grouplock use from MDT for HSM

https://bugzilla.lustre.org/attachment.cgi?id=31976

a=jc.lafoucriere
i=oleg.drokin

9 years agob=15599 hsm infrastructure
Vladimir Saveliev [Fri, 5 Nov 2010 15:21:44 +0000 (18:21 +0300)]
b=15599 hsm infrastructure

https://bugzilla.lustre.org/attachment.cgi?id=31961
and wiretest update (https://bugzilla.lustre.org/attachment.cgi?id=31991)

a=jc.lafoucriere
i=andreas.dilger
i=nathan.rutman
i=vladimir.saveliev

9 years agoUpdated for build 07. 2.0.55.0 v2_0_55_0
Terry Rutledge [Fri, 5 Nov 2010 17:11:20 +0000 (10:11 -0700)]
Updated for build 07.

9 years agob=23064 Adopt for new backing_dev_info interface.
Vitaly Fertman [Thu, 4 Nov 2010 20:53:32 +0000 (23:53 +0300)]
b=23064 Adopt for new backing_dev_info interface.

o=yangsheng
i=dmitry

9 years agob=23402 mmp_fini () multiple oss fix
Elena Gryaznova [Tue, 2 Nov 2010 13:55:52 +0000 (16:55 +0300)]
b=23402 mmp_fini () multiple oss fix

i=Yu.Jian

9 years agob=23399 Deadlock probably due to statahead
LiuYing [Tue, 2 Nov 2010 01:18:04 +0000 (09:18 +0800)]
b=23399 Deadlock probably due to statahead

partly revoke patch (id=28310)

o=fanyong
i=tappro
i=wangdi

9 years agob=23289 cleanup unnecessary spinlock dance in ldlm
Jian Yu [Tue, 2 Nov 2010 14:05:34 +0000 (22:05 +0800)]
b=23289 cleanup unnecessary spinlock dance in ldlm

o=liang
i=andreas.dilger
i=oleg.drokin

9 years agob=22281 refactor packaging and versioning
Brian J. Murrell [Tue, 2 Nov 2010 20:14:44 +0000 (04:14 +0800)]
b=22281 refactor packaging and versioning

Configure "--with-dist" switch to optimize configure to do just
enough to enable "make dist" to work.  There is likely more optimizations
that could be done

Provide reasonable defaults for the --with-linux and --with-linux-obj
configure arguments.

The LB_BUILDID macro determines the BUILDID to be used in the resulting
packages.

The LB_DOWNSTREAM_RELEASE macro implements the --with-downstream-release
configure option for downstream packages to provide their own versioning
details

Many changes to support allowing packagers to define a value in the release
string.
- allow callers to tell rpm where to find the kernel source/headers and
  objects.
  - default to /lib/modules/$(uname -r)/build if not specified
    - XXX: this should probably be handled completely in configure
- allow callers to tell rpm what the release string should be set to
  - the spec will default to the value that configure uses currently if
    not specified
  - this allows both the release string used in Lustre and the Release:
    string used in the RPM to be in sync
- in the make rpms target, strip the configure arguments for --with-linux
  and --with-linux-obj and give them to rpmbuild via it's new switches
- also, pass the release defined by the main configure to the RPM spec for
  further consistency
  - XXX: it is questionable whether this in fact has any value
- put the "downstream release" right into the release as it is defined by
  by configure
- bring the ldiskfs spec more into sync with the lustre spec file

version_tag.pl/make_META.pl needed better support for git.
It also needed better support for transporting the information
available in a source code system into the tarball so that when
unpacked, where no sourcecode system was in place, the origins
of the code could still be determined.
Example output:
git: v1_8_3_RC1-g751b802-CHANGED-2.6.18-128.1.1.el5_lustre.1.8.0.50.20090302160821smp
cvs: b1_8-20100429144101-CHANGED-2.6.18-128.1.1.el5_lustre.1.8.0.50.20090302160821smp
When run in a tree from a tarball, the output will be composed
of the same tag and build id (hash or datestamp) as the source the
tarball came from, but will refelect whatever kernel the unpacked
source was configured with.
Rename the tree_status file to META.

There are times when the kernel should be rebuilt for all distros, but
other times only for certain distros.  Make the "all distros" case
easier than having to increment the version in all of the
build/lbuild-$distro files.

i=wangyb
i=mjmac

When the lustre-fix is .0, when a tag is made, it includes the .0 so
just drop it.

Only append the .$BUILDID to the $EXTRA_VERSION when $BUILDID is
actually not empty.  Otherwise we get a trailing ".".

It seems that the regex(7) used in bash's [[ ]] is not terribly
portable across bash versions.  SLES10 expects (, ), and | (at least)
escaped with \.  RHEL5 will accept them either escaped or not.
Ubuntu fails to match if they are escaped.

We need a more consistent test for being a subdir of Lustre.
Differences in how automake passes dirnames made the existing code
unpredictable.

When the tag format is vW_X_Y_Z and lustre-fix (Z) is _0 when a tag
is made, it includes the _0 so just drop it.

i=yangsheng
i=wangyb

The return code from git status seems to be not as one would expect
and it can return both 1 or 0 in a git repository.  So change up and
use git branch instead.
Simplify the logic around this determination as well.

Due to some unknown issue with using rpmbuild with the ldiskfs tarball
causing symbol mismatches, do the right thing instead and build ldiskfs
first and then build lustre with the external ldiskfs.

Right now the method for doing this is a bit hacky since we point lustre
at the ldiskfs rpm BUILD product.  The right fix is to make an
ldiskfs-devel RPM, which is a future enhancement of this work.

This also fixes two bugs with ldiskfs' external module support building.
One is that the external ldiskfs tree check was using a file that only
exists in ext3 and not ext4.  The second is that ldiskfs' configure was
not properly initializing ldiskfs_is_ext4 and instead depending on the
rpmbuild/configure caller to pass flags to tell it to build with ext4.

i=wangyb
i=minh

9 years agob=21610 Changes for 2.6.32 kernel udpate.
yangsheng [Wed, 3 Nov 2010 06:55:01 +0000 (14:55 +0800)]
b=21610 Changes for 2.6.32 kernel udpate.

i=Andreas.Dilger
i=Mikhail.Pershin

9 years agob=20805 rate limit D_NETERR messages
Cliff White [Tue, 2 Nov 2010 17:02:23 +0000 (10:02 -0700)]
b=20805 rate limit D_NETERR messages

i=isaac
i=liang

change CDEBUG(D_NETERROR) to CNETERR in various places. Port of 1_8 to master.
Cleans up klnds, libcfs

9 years agob=18551 remove ENTRY & EXIT from hot hash functions
Jian Yu [Wed, 3 Nov 2010 01:18:07 +0000 (09:18 +0800)]
b=18551 remove ENTRY & EXIT from hot hash functions

o=liang
i=rahul.deshmukh
i=jian.yu

9 years agob=23869 HARD failure mode fixes
Elena Gryaznova [Mon, 1 Nov 2010 16:00:12 +0000 (19:00 +0300)]
b=23869 HARD failure mode fixes

i=Brian.Murrel
i=Wei.G.Li

* facet_failover() has to restart only those affected facets which were UP
  before the node failure
* replay-single tests which use shutdown_facet() && reboot_facet() instead of
  facet_failover() have to take care about the affected facets

9 years agob=24010 lfs fid2path doesn't work for symlinks
Rahul Deshmukh [Mon, 1 Nov 2010 13:23:16 +0000 (18:53 +0530)]
b=24010 lfs fid2path doesn't work for symlinks

o=thomas
i=andreas
i=rahul

9 years agob=24007 obj mode should be covered by MA_INODE.
Eric Mei [Mon, 1 Nov 2010 16:15:02 +0000 (10:15 -0600)]
b=24007 obj mode should be covered by MA_INODE.

r=di.wang
r=vitaly.fertman

9 years agob=23924 LBUG: ASSERTION(hs == exp->exp_lock_hash) failed
LiuYing [Fri, 22 Oct 2010 09:29:37 +0000 (17:29 +0800)]
b=23924 LBUG: ASSERTION(hs == exp->exp_lock_hash) failed

fix the wrong assertion in lprocfs_exp_print_hash()

i=andreas
i=eric.mei

9 years agob=23913 fix "ASSERTION(!cfs_list_empty(&dquot->dq_hash)) failed"
Landen [Tue, 2 Nov 2010 07:02:52 +0000 (15:02 +0800)]
b=23913 fix "ASSERTION(!cfs_list_empty(&dquot->dq_hash)) failed"

1. In an error handling, free_dquot() should be used.
2. Check if quota is on in dqacq_handler()

i=johann
i=panda

9 years agob=23868 fix "sanity-quota test_18c: @@@@@@ FAIL: quotaon failed!"
Landen [Tue, 2 Nov 2010 07:24:48 +0000 (15:24 +0800)]
b=23868 fix "sanity-quota test_18c: @@@@@@ FAIL: quotaon failed!"

i=johann
i=mikhail.pershin

9 years agob=23289 skip big loop in mds_lov_dump_objids if D_INFO is not set
Jian Yu [Tue, 2 Nov 2010 06:57:41 +0000 (14:57 +0800)]
b=23289 skip big loop in mds_lov_dump_objids if D_INFO is not set

o=liang
i=andreas.dilger
i=vitaly.fertman

9 years agob=23289 remove mc_waitq
Jian Yu [Tue, 2 Nov 2010 07:05:29 +0000 (15:05 +0800)]
b=23289 remove mc_waitq

Remove legacy.

o=liang
i=mikhail.pershin
i=di.wang

9 years agob=23289 don't need call mdd_links_get for the new created file
Jian Yu [Tue, 2 Nov 2010 07:12:41 +0000 (15:12 +0800)]
b=23289 don't need call mdd_links_get for the new created file

We know it's the first link for new created file, so we don't need to
call mdd_links_get, which is an xattr_get and overhead, while creating file.

o=liang
i=mikhail.pershin
i=di.wang

9 years agob=23289 disable pdo locks in MDD
Jian Yu [Tue, 2 Nov 2010 06:46:15 +0000 (14:46 +0800)]
b=23289 disable pdo locks in MDD

MDT has already taken care of locks.

o=liang
i=mikhail.pershin
i=alex.zhuravlev

9 years agob=22546 fix errors in test_18c
Landen [Tue, 2 Nov 2010 07:29:03 +0000 (15:29 +0800)]
b=22546 fix errors in test_18c

i=johann
i=panda

9 years agob=20997 passthrough obd_force from lmv to mdc
Mikhail Pershin [Fri, 22 Oct 2010 14:32:23 +0000 (18:32 +0400)]
b=20997 passthrough obd_force from lmv to mdc

i=zam,vs

9 years agoUpdated for 2.1.0 build 06. 2.0.54.0 v2_0_54_0
Terry Rutledge [Fri, 22 Oct 2010 21:13:48 +0000 (14:13 -0700)]
Updated for 2.1.0 build 06.

9 years agob=20744 return ofed version to 1.5.1
Brian J. Murrell [Fri, 22 Oct 2010 11:31:43 +0000 (07:31 -0400)]
b=20744 return ofed version to 1.5.1

Since the patch from bug 20744 that updates the client to the
2.6.16.60-0.69.1 kernel has not landed yet, continue to build the
OFA OFED 1.5.1 until such time as the new kernel support has landed.

9 years agob=22386 disallow wrong conf_param options
Andrew Perepechko [Fri, 22 Oct 2010 00:23:19 +0000 (04:23 +0400)]
b=22386 disallow wrong conf_param options

i=Johann Lombardi
i=Nathan Rutman

9 years agob=18551 adapt lu_site to new cfs_hash
Jian Yu [Thu, 21 Oct 2010 05:33:16 +0000 (13:33 +0800)]
b=18551 adapt lu_site to new cfs_hash

o=zhen.liang
i=eric.mei
i=mikhail.pershin

9 years agob=22514 Update RHEL5.5 & OEL5.5 to latest kernel.
yangsheng [Thu, 21 Oct 2010 08:23:42 +0000 (16:23 +0800)]
b=22514 Update RHEL5.5 & OEL5.5 to latest kernel.

--RHEL5 2.6.18-194.17.1.el5.
  --OEL5  2.6.18-194.17.1.0.1.el5.
  --Switch using 'inkernel' OFED stack.
  --Build fixes for ppc64 & ia64.

9 years agob=19486 fix a typo in previous patch.
Eric Mei [Wed, 20 Oct 2010 23:57:12 +0000 (17:57 -0600)]
b=19486 fix a typo in previous patch.

r=di.wang

9 years agob=22417 remove obd arg from ptlrpc_start_thread
Alexander.Zarochentsev [Tue, 19 Oct 2010 16:34:07 +0000 (20:34 +0400)]
b=22417 remove obd arg from ptlrpc_start_thread

i=Mikhail.Pershin
i=Andew.Perepechko

9 years agob=20744 add changelog entry
Johann Lombardi [Tue, 19 Oct 2010 02:01:36 +0000 (22:01 -0400)]
b=20744 add changelog entry

o=johann
i=brian

9 years agob=20744 set arch correctly for ofed-devel
Brian J. Murrell [Tue, 19 Oct 2010 02:01:35 +0000 (22:01 -0400)]
b=20744 set arch correctly for ofed-devel

Oh boy, more fallout from our not building i586 kernels for sles.
If we are building lustre servers and NOT rebuilding the Suse ofed
RPM, then we need to look for the i586 ofed-devel package.

i=yangsheng
i=yibin.wang

9 years agob=20744 remedial fix for inkernel ofed handling
Vitaly Fertman [Thu, 21 Oct 2010 23:27:19 +0000 (03:27 +0400)]
b=20744 remedial fix for inkernel ofed handling

o=brian
i=yangsheng
i=yibin.wang

It seems that the patch to add inkernel handling for SLES10's OFED introduced a
bug for other inkernel uses.  This fixes that bug.

9 years agob=20744 hacks for Novell bug 639581
Brian J. Murrell [Tue, 19 Oct 2010 02:01:33 +0000 (22:01 -0400)]
b=20744 hacks for Novell bug 639581

i=mjmac
i=wangyb

9 years agob=20744 hacks for Novell bug 578796
Brian J. Murrell [Tue, 19 Oct 2010 02:01:32 +0000 (22:01 -0400)]
b=20744 hacks for Novell bug 578796

i=mjmac
i=wangyb

These hacks are needed to deal with Novell bug 578796.