Whamcloud - gitweb
fs/lustre-release.git
16 years agoUpdated ChangeLog for the next release, 1.6.8.
stinson1947 [Fri, 12 Dec 2008 01:06:14 +0000 (01:06 +0000)]
Updated ChangeLog for the next release, 1.6.8.

16 years agoBranch b1_6
anserper [Thu, 11 Dec 2008 17:47:26 +0000 (17:47 +0000)]
Branch b1_6
b=17938
i=Vitaly Fertman
i=Johann Lombardi

fdatasync crash test

16 years agoBranch b1_6
johann [Thu, 11 Dec 2008 14:51:58 +0000 (14:51 +0000)]
Branch b1_6
b=16522
i=adilger
i=wangdi

enable OBD_CONNECT_MDT flag when connecting from the MDS so that
the OSTs know that the MDS "UUID" can be reused for the same export
from a different NID, so we do not need to wait for the export to be
evicted.

16 years agoBranch b1_6
johann [Thu, 11 Dec 2008 14:08:02 +0000 (14:08 +0000)]
Branch b1_6
b=12596
i=grev
i=rread

fix file path in compare_stripe_info

16 years agoBranch b1_6
tianzy [Thu, 11 Dec 2008 08:11:43 +0000 (08:11 +0000)]
Branch b1_6
add ll_recover_lost_found_objs to lustre manual
b=16911

16 years agoclient_disconnect_export vs connect request race.
shadow [Thu, 11 Dec 2008 05:52:47 +0000 (05:52 +0000)]
client_disconnect_export vs connect request race.
if client will evicted at this time - we start invalidate
thread without reference to import and import can be freed
at same time.

Branch b1_6
b=17802
i=nathan
i=johann

16 years agoSet MDS_BFLAG_EXT_FLAGS on MDS in open call path.
girish [Wed, 10 Dec 2008 17:59:24 +0000 (17:59 +0000)]
Set MDS_BFLAG_EXT_FLAGS on MDS in open call path.
b=17465
i=adilger
i=johann

16 years agoFix lost unlinked ev in reply_in_callback (Isaac)
girish [Wed, 10 Dec 2008 17:12:59 +0000 (17:12 +0000)]
Fix lost unlinked ev in reply_in_callback (Isaac)
b=17052
i=nathan.rutman
i=eric.mei

16 years agob=17827
grev [Wed, 10 Dec 2008 09:36:24 +0000 (09:36 +0000)]
b=17827
i=H.Huang
run lsof fix

16 years agob=17495
zhanghc [Tue, 9 Dec 2008 09:05:11 +0000 (09:05 +0000)]
b=17495

in osc_precreated, check OSCC_FLAG_NOSPC before
checking last_id/next_id, remove unnecessary
spin_lock on oscc_lock at the same time.

i=Andreas
i=Nathan.Rutman

16 years agofix incorrect detecting readv/writev.
shadow [Tue, 9 Dec 2008 07:15:29 +0000 (07:15 +0000)]
fix incorrect detecting readv/writev.

Branch b1_6
b=17918
i=green

16 years agob=17917
anserper [Tue, 9 Dec 2008 03:07:13 +0000 (03:07 +0000)]
b=17917
i=Alexander Zarochentsev
i=Johann Lombardi

add a trailing zero to avoid oops from proc entry creation routines

16 years agobranch: b1_6
ericm [Mon, 8 Dec 2008 19:20:38 +0000 (19:20 +0000)]
branch: b1_6
fix calculation of sleeping time.
b=13659
r=nathan

16 years agoBranch b1_6
johann [Mon, 8 Dec 2008 15:51:28 +0000 (15:51 +0000)]
Branch b1_6
b=17379

Print value of symlink depth before checking it.

16 years agoBranch b1_6
fanyong [Sat, 6 Dec 2008 03:29:47 +0000 (03:29 +0000)]
Branch b1_6
b=15691
i=h.huang
i=johann

If there are too many locks on client-side, then some locks taken by
statahead maybe dropped automatically before the real "revalidate"
using them. Under such case the statahead should fail.

16 years agoBranch b1_6
huanghua [Sat, 6 Dec 2008 01:01:01 +0000 (01:01 +0000)]
Branch b1_6

i=tappro
i=yury.umanets
i=adilger

wait a bit before client umount if vfsmnt still has references.

16 years agob=17661
grev [Sat, 6 Dec 2008 00:56:52 +0000 (00:56 +0000)]
b=17661
i=Brian
run mpi tests as MPI_USER

16 years agob=17758
yury [Fri, 5 Dec 2008 11:43:13 +0000 (11:43 +0000)]
b=17758
r=shadow,johann

- Do NOT call server_deregister_mount() here. This leads to
    inability cleanup cleanly and free lsi and other stuff when
    mgs calls server_put_mount() in error handling case.

16 years agoBranch b1_6
adilger [Thu, 4 Dec 2008 18:43:06 +0000 (18:43 +0000)]
Branch b1_6
Changing build version.

16 years agob=17747
grev [Thu, 4 Dec 2008 17:48:11 +0000 (17:48 +0000)]
b=17747
i=Nathan
run_one: print PASS/FAIL depending on new TEST_FAILED var

16 years agoBranch b1_6
vitaly [Wed, 3 Dec 2008 18:44:59 +0000 (18:44 +0000)]
Branch b1_6
b=17644
i=green
i=adilger

send 1 extra rpc in flight if this is a high priority request

16 years agob=17853
grev [Wed, 3 Dec 2008 18:37:17 +0000 (18:37 +0000)]
b=17853
i=Adilger
check_config fix for NETTYPE=ptl

16 years agoBranch b1_6
vitaly [Wed, 3 Dec 2008 17:18:08 +0000 (17:18 +0000)]
Branch b1_6
b=17748
i=grev
i=adilger

the sanityN test issue is fixed

16 years agoBranch b1_6
anserper [Wed, 3 Dec 2008 16:55:24 +0000 (16:55 +0000)]
Branch b1_6
b=17770
i=Elena Gryaznova

move the cleanup/setup test to the end of the series

16 years agoBranch b1_6
anserper [Wed, 3 Dec 2008 16:55:22 +0000 (16:55 +0000)]
Branch b1_6
i=Elena Gryaznova
avoid using quota_usr/quota_2usr groups

16 years agoBranch b1_6
anserper [Wed, 3 Dec 2008 16:33:25 +0000 (16:33 +0000)]
Branch b1_6
b=17371
i=Andreas Dilger

move the cleanup/setup test to the end of the series

16 years agob=16432
zhanghc [Wed, 3 Dec 2008 15:50:10 +0000 (15:50 +0000)]
b=16432

fix mgs_setparam, which will return -EINVAL when param
related to llite(PARAM_LLITE) is set by MDT or OST

i=johann
i=nathan.rutman

16 years agob=13584
grev [Wed, 3 Dec 2008 15:48:19 +0000 (15:48 +0000)]
b=13584
i=Scjody
test_99a fix: use $TMP as working dir

16 years agob=16432
zhanghc [Wed, 3 Dec 2008 15:39:17 +0000 (15:39 +0000)]
b=16432

fix mgs_setparam, which will return -EINVAL when param
related to llite is set by MDT or OST

i=johann
i=nathan.rutman

16 years agoshrink LOV EAs before replying
shadow [Wed, 3 Dec 2008 07:26:58 +0000 (07:26 +0000)]
shrink LOV EAs before replying

Branch b1_6
b=16693
i=shadow
i=johann

16 years agodon't resend llog cancels,
shadow [Wed, 3 Dec 2008 04:33:49 +0000 (04:33 +0000)]
don't resend llog cancels,
fix resend requests for ldlm imports.

Branch b1_6
b=17695
i=umka
i=tappro

16 years ago b=17813
yury [Tue, 2 Dec 2008 12:20:22 +0000 (12:20 +0000)]
 b=17813
  r=shadow
  - take into account the limit on the host because the higher limit the longer it takes to kill some locks.

16 years agob=16897
grev [Mon, 1 Dec 2008 21:27:37 +0000 (21:27 +0000)]
b=16897
i=Adilger
i=Tappro
test_70b fix: use do_nodes instead of loop to run rundbench

16 years agob=17696
grev [Mon, 1 Dec 2008 09:30:44 +0000 (09:30 +0000)]
b=17696
i=Adilger
skip liblustre for different client/mds versions

16 years agoBranch b1_6
tianzy [Mon, 1 Dec 2008 03:52:08 +0000 (03:52 +0000)]
Branch b1_6
fix an error in the test_18b of sanity-quota.sh
b=17832
i=tianzy

16 years agobranch=b1_6
zhanghc [Sun, 30 Nov 2008 15:44:59 +0000 (15:44 +0000)]
branch=b1_6

b=17031

during refreshing locks waiting its I/O to complete,
take current service time into account, not only using
the timeout gotten by ldlm_get_enqueue_timeout

i=Andreas
i=Nathan.Rutman

16 years agob=17761
grev [Fri, 28 Nov 2008 18:42:53 +0000 (18:42 +0000)]
b=17761
i=Adilger
i=Johann
test_6 fix

16 years agoBranch b1_6
anserper [Fri, 28 Nov 2008 10:17:29 +0000 (10:17 +0000)]
Branch b1_6
b=17772
i=Johann Lombardi
i=ZhiYong Tian
lov_quota_check shall return an error when a target goes offline

16 years agoBranch b1_6
tianzy [Fri, 28 Nov 2008 07:01:06 +0000 (07:01 +0000)]
Branch b1_6
let dqacq_in_flight() hold lock for qunit
b=16890
i=johann
i=panda

16 years agoBranch b1_6
johann [Thu, 27 Nov 2008 10:56:24 +0000 (10:56 +0000)]
Branch b1_6
b=12596
i=grev
i=adilger

check striping after setstripe in recovery-small test 18*.
move get_stripe_info() to t-f.

16 years agoBranch b1_6
adilger [Thu, 27 Nov 2008 05:33:11 +0000 (05:33 +0000)]
Branch b1_6
If an expected error is returned from llapi_ping() for an inactive device
print a more useful message.
b=16208

16 years agoBranch b1_6
yangsheng [Wed, 26 Nov 2008 11:10:01 +0000 (11:10 +0000)]
Branch b1_6
b=17153
i=johann, adilger

Remove 2.4 compatibility.

16 years agoBranch b1_6
tianzy [Wed, 26 Nov 2008 08:23:29 +0000 (08:23 +0000)]
Branch b1_6
fix the false qunit_put in qctxt_wait_pending_dqacq()
b=17794
i=tianzy
i=panda

16 years agoRestore the changes I made yesterday which broke build because I used a stale kernel.
jxiong [Wed, 26 Nov 2008 07:15:54 +0000 (07:15 +0000)]
Restore the changes I made yesterday which broke build because I used a stale kernel.

16 years agoBranch b1_6
anserper [Tue, 25 Nov 2008 23:04:43 +0000 (23:04 +0000)]
Branch b1_6
b=17371
i=Elena
fail each time cleanup/setup went wrong

16 years agoCleanup ChangeLog comments.
adilger [Tue, 25 Nov 2008 21:42:38 +0000 (21:42 +0000)]
Cleanup ChangeLog comments.

16 years agob=17326
grev [Tue, 25 Nov 2008 20:13:26 +0000 (20:13 +0000)]
b=17326
i=Brian
load_modules fn modprobe.d fix

16 years agob=17477
grev [Tue, 25 Nov 2008 19:27:18 +0000 (19:27 +0000)]
b=17477
i=Huang Hua
check_config fn default network type fix

16 years agoBranch b1_6
yangsheng [Tue, 25 Nov 2008 12:31:56 +0000 (12:31 +0000)]
Branch b1_6
b=17630

Add a comment to explain the change.

16 years agoBranch b1_666666
vs [Tue, 25 Nov 2008 11:24:04 +0000 (11:24 +0000)]
Branch b1_666666
b=17359
i=adilger,bzzz
use time obtained from a client to update inode timestamps on mds
mds_reint_link, mds_reint_unlink and mds_reint_rename updated
inode timestamps with local server time

16 years agoBranch b1_6
johann [Tue, 25 Nov 2008 09:53:41 +0000 (09:53 +0000)]
Branch b1_6
i=umka
i=panda
b=17611

don't override lcm->lcm_name

16 years ago- make margin 10 sec more in 124a to let client chance to kill some locks
yury [Tue, 25 Nov 2008 08:38:36 +0000 (08:38 +0000)]
- make margin 10 sec more in 124a to let client chance to kill some locks

16 years agorevert on chunk from patch, due startup race.
shadow [Tue, 25 Nov 2008 07:54:03 +0000 (07:54 +0000)]
revert on chunk from patch, due startup race.

Branch b1_6
b=16492

16 years agoBranch b1_6
tianzy [Tue, 25 Nov 2008 05:59:05 +0000 (05:59 +0000)]
Branch b1_6
fix "should take longer" problem of test_18 of sanity-quota.sh
b=17773
i=johann
i=panda

16 years agoBranch b1_6
tianzy [Tue, 25 Nov 2008 05:52:22 +0000 (05:52 +0000)]
Branch b1_6
change target_handle_dqacq_callback() error handling
b=16890
i=johann
i=panda

16 years agoFixed the raid5 patches.
jxiong [Tue, 25 Nov 2008 03:53:38 +0000 (03:53 +0000)]
Fixed the raid5 patches.
- rebuild policy for rhel5 .21 kernel
- soft lockups fixed

b=17084
r=adilger,jay

16 years agoBranch b1_6
yangsheng [Tue, 25 Nov 2008 03:11:08 +0000 (03:11 +0000)]
Branch b1_6
b=17786

i=adilger, huanghua

Initialize the request.

16 years agob=17747
grev [Mon, 24 Nov 2008 22:22:27 +0000 (22:22 +0000)]
b=17747
i=Tappro
FAIL_ON_ERROR=false fix: force suits exit 1 if some tests failed

16 years agob=17631
yury [Mon, 24 Nov 2008 15:13:11 +0000 (15:13 +0000)]
b=17631

- fix previous wrong commit in part related to changes in ptlrpc_abort_bulk()

16 years agoDrop slow OSCs if we can, but not for requested start idx.
shadow [Mon, 24 Nov 2008 12:20:11 +0000 (12:20 +0000)]
Drop slow OSCs if we can, but not for requested start idx.
This means "if OSC is slow and it is not the requested
start OST, then it can be skipped, otherwise skip it only
if it is inactive/recovering/out-of-space.

Branch b1_6
b=16081
i=shadow
i=green

16 years agoFixes related to mount failure path cleanup
deshmukh [Mon, 24 Nov 2008 07:28:21 +0000 (07:28 +0000)]
Fixes related to mount failure path cleanup

b=17752
i=umka
i=shadow

16 years agoBranch b1_6
yangsheng [Mon, 24 Nov 2008 03:51:49 +0000 (03:51 +0000)]
Branch b1_6
b=17630

i=green, adilger

Disable NFS export when the THREAD_SIZE < 8192.

16 years agob=17631
yury [Sun, 23 Nov 2008 20:38:31 +0000 (20:38 +0000)]
b=17631
r=panda,shadow

- fixes possible sync long bulk unlink in ptlrpcd which would lead to assertion in forced umount time. Basically the fix is identical to 17310 where we move req to special phase UNREGISTERING and go processing other rpcs until bulk unlink is done;

- in sync bulk and reply unlink we check for wakeup condition every 1 sec to act quckly if unlink come instead of doing it every 20 sec as before.

16 years agob=17750
yury [Sun, 23 Nov 2008 12:40:34 +0000 (12:40 +0000)]
b=17750
r=shadow,deen

- fixes writing cookie beyond of llcd boundaries.

16 years ago- commit missed bit from previous commit.
yury [Sun, 23 Nov 2008 12:32:38 +0000 (12:32 +0000)]
- commit missed bit from previous commit.

16 years agob=17690
yury [Sun, 23 Nov 2008 12:14:50 +0000 (12:14 +0000)]
b=17690
r=shadow

- fixes in replay-single.sh test_59b

16 years agob=17751
yury [Sun, 23 Nov 2008 11:54:00 +0000 (11:54 +0000)]
b=17751
r=grev

- fixes and cleanups in test_124a from sanity.sh

16 years agob=17735
grev [Fri, 21 Nov 2008 21:54:16 +0000 (21:54 +0000)]
b=17735
i=Yury.Umanets
check_mem_leak fn fix: use echo instead of log fn

16 years agoBranch b1_6
yangsheng [Fri, 21 Nov 2008 16:20:31 +0000 (16:20 +0000)]
Branch b1_6
b=17201

i=shadow, bobijam

Update to RHEL5 kernel-2.6.18-92.1.17.el5.

16 years agoBranch b1_6
yangsheng [Fri, 21 Nov 2008 15:27:42 +0000 (15:27 +0000)]
Branch b1_6
b=16208

i=adilger, johann

Add utility for showing mounted hosts

16 years agoBranch b1_6
vs [Thu, 20 Nov 2008 23:07:38 +0000 (23:07 +0000)]
Branch b1_6
b=17132
i=adilger

Use raid5/6 rhel5 improvements

16 years agoBranch b1_6
vitaly [Thu, 20 Nov 2008 21:26:57 +0000 (21:26 +0000)]
Branch b1_6
b=16129
i=adilger
i=green

- a high priority request list is added into service;
- once a lock is canceled, all the IO requests, including coming
ones, under this lock, are moved into this list;
- PING is also added into this list;
- once a lock cancel timeout occurs, the timeout is prolonged
if there is an IO rpc under this lock;
- another request list is added into the export, used to speed up
the rpc-lock matching.

16 years agoBranch b1_6
fanyong [Thu, 20 Nov 2008 05:55:07 +0000 (05:55 +0000)]
Branch b1_6
b=16947
i=h.huang
i=yury.umanets

Hold lli_lock when access lli_sai to prevent NULL pointer.

16 years agoBranch b1_6
anserper [Thu, 20 Nov 2008 01:00:20 +0000 (01:00 +0000)]
Branch b1_6
b=17371
i=Johann Lombardi

testcase for 17371

16 years agob=17477
grev [Wed, 19 Nov 2008 19:00:07 +0000 (19:00 +0000)]
b=17477
i=Yury.Umanets
run acc-sm:formatall() only if forced

16 years agob=17477
grev [Wed, 19 Nov 2008 18:02:08 +0000 (18:02 +0000)]
b=17477
i=Adilger
force replay-dual to check and mount MOUNT2

16 years agob=17686
yury [Wed, 19 Nov 2008 09:14:20 +0000 (09:14 +0000)]
b=17686
r=panda,shadow

- fixes race in ptlrpcd which leads to busy import and obd;
- cleanups and debugs in llcd code.

16 years agob=17653
grev [Wed, 19 Nov 2008 08:52:15 +0000 (08:52 +0000)]
b=17653
i=Adilger
test_21c fix: restore config

16 years agofix handle ost additional correctly
shadow [Wed, 19 Nov 2008 06:02:20 +0000 (06:02 +0000)]
fix handle ost additional correctly

Branch b1_6
b=16492
i=umka
i=tappro

16 years agoBranch b1_6
bobijam [Wed, 19 Nov 2008 01:39:07 +0000 (01:39 +0000)]
Branch b1_6
b=16992
o=johann
i=oleg.drokin (green)
i=zhenyu.xu (bobijam)

During ll_intent_lock(), server looks up parent and child, lock them, between these events parent could be deleted, then vfs_create may_access() fails with -ENOENT.

Then client intent disposition got DISP_OPEN_CREATE | DISP_LOOKUP_NEG | DISP_LOOKUP_EXECD | DISP_IT_EXECD, and the request got double free.

Solution: Clear DISP_ENQ_COMPLETE when we are going to release the intent (request cannot be reused anyway)

16 years agoBranch b1_6
anserper [Tue, 18 Nov 2008 03:43:31 +0000 (03:43 +0000)]
Branch b1_6
b=12433
i=Oleg Drokin
i=Yury Umanets

fix the message about imp_inval

16 years agoBranch b1_6
tianzy [Mon, 17 Nov 2008 06:47:30 +0000 (06:47 +0000)]
Branch b1_6
decay qos ost/oss penalties if MDS is not creating objects
i=nathan
i=johann

16 years agoBranch b1_6
tianzy [Mon, 17 Nov 2008 06:27:29 +0000 (06:27 +0000)]
Branch b1_6
fix lov_brw_check() calls lov_stripe_intersects() with incorrect parameter.
written by nikita

16 years agoBranch b1_6
tianzy [Mon, 17 Nov 2008 06:19:11 +0000 (06:19 +0000)]
Branch b1_6
fix the error handling on quota slaves
i=johann
i=panda

16 years agoBranch b1_6
adilger [Sat, 15 Nov 2008 08:34:09 +0000 (08:34 +0000)]
Branch b1_6
Remove trailing whitespace.

16 years agob=16488
grev [Fri, 14 Nov 2008 18:51:39 +0000 (18:51 +0000)]
b=16488
i=Oleg.Drokin
RACER acc-sm test suit

16 years agob=17122
grev [Fri, 14 Nov 2008 10:30:09 +0000 (10:30 +0000)]
b=17122
i=Adilger
skip sanity test_100 for NETTYPE != tcp

16 years agob=17479
yury [Thu, 13 Nov 2008 09:04:45 +0000 (09:04 +0000)]
b=17479
r=adilger,behlendorf1

- avoid div/mod in lustre_hash code

16 years agoBranch b1_6
tianzy [Thu, 13 Nov 2008 08:09:26 +0000 (08:09 +0000)]
Branch b1_6
fix lquota.ko fails to install with --disable-liblustre used
b=17620
i=johann
i=brian

16 years agob=16823
green [Thu, 13 Nov 2008 03:12:23 +0000 (03:12 +0000)]
b=16823
r=shadow,adilger

Lift 4G limit on stripe_size*stripe_count
4G limit on stripe_size remains in place, though.

16 years agob=17634
grev [Wed, 12 Nov 2008 21:08:20 +0000 (21:08 +0000)]
b=17634
i=Yury.Umanets
insanity cleanup (remove dup fn, sync with HEAD t-f)

16 years agob=17310
yury [Wed, 12 Nov 2008 18:43:33 +0000 (18:43 +0000)]
b=17310
r=shadow,vitaly
- correct check for phase in ptlrpc_expired_set() and couple of other places.

16 years agob=16488
grev [Wed, 12 Nov 2008 16:08:27 +0000 (16:08 +0000)]
b=16488
i=Oleg.Drokin
new runracer script

16 years agob=17037
yury [Wed, 12 Nov 2008 15:40:17 +0000 (15:40 +0000)]
b=17037
r=tappro,wangdi

- fixes ost cleanup issue due to missed llcd_put() in the case ost does not receive disconnect from mds;

  - do not sleep on hanging llcd. Instead assert on it _after_ stopping recov_thread's ptlrpcd which should kill any remeining llcds;

  - fixes and cleanups, comments.

16 years agob=17555
grev [Wed, 12 Nov 2008 11:11:20 +0000 (11:11 +0000)]
b=17555
i=Adilger
use current config instead of reformat fs to have a single ost

16 years agobranch b1_6
zhanghc [Wed, 12 Nov 2008 02:57:47 +0000 (02:57 +0000)]
branch b1_6

b=17505

remove "mfd" from "cloing_list" for the "mfd" will be freed in mds_mfd_close

i=robert.read

16 years agoBranch b1_6
huanghua [Wed, 12 Nov 2008 02:52:18 +0000 (02:52 +0000)]
Branch b1_6
b=17602
i=yury.umanets
i=tappro

use 1.8/2.0 compatible MDT config for 1.6 mds, easy to upgrade.

16 years agob=16551
grev [Tue, 11 Nov 2008 21:54:05 +0000 (21:54 +0000)]
b=16551
i=Adilger
conf-sanity test_32* fix to not be skipped for remote setup

16 years agoBranch b1_6
yangsheng [Tue, 11 Nov 2008 06:12:49 +0000 (06:12 +0000)]
Branch b1_6
b=17374
i=shadow, bobijam

kernel update for sles9 2.6.5-7.314.

16 years agobranch b1_6
zhanghc [Tue, 11 Nov 2008 06:11:35 +0000 (06:11 +0000)]
branch b1_6

b=17176

fixed a bug in 14774 patch -- compare peer's nid instead of self's nid
in ptlrpc_connection during select failover MDS/OST nodes

i=deen