Whamcloud - gitweb
pschwan [Sun, 6 Oct 2002 00:24:44 +0000 (00:24 +0000)]
b=618962
- Temporary fix for 618962 by handling OPEN intents in revalidate
adilger [Sat, 5 Oct 2002 14:11:23 +0000 (14:11 +0000)]
Move debugging message outside spinlock.
adilger [Sat, 5 Oct 2002 13:56:27 +0000 (13:56 +0000)]
Dump debug log at cleanup time, in case of errors.
adilger [Sat, 5 Oct 2002 13:54:58 +0000 (13:54 +0000)]
Document the --dump option properly.
adilger [Sat, 5 Oct 2002 13:54:25 +0000 (13:54 +0000)]
Dump debug log at end of test in case of error.
adilger [Sat, 5 Oct 2002 13:42:39 +0000 (13:42 +0000)]
Two fixed:
- refcounts on MDS module fixed, and problems with bogus exports
- unsafe list walking on the open file list of the MDS could cause
random memory problems
- added real handles for open files on the MDS to ensure we don't
dereference bogus pointers - mike you still need to clean up open
files on the MDS for disconnect
adilger [Sat, 5 Oct 2002 09:45:20 +0000 (09:45 +0000)]
Comment out unused variables to quiet compiler warnings.
Remove dead code (it can always be had from CVS again).
Minor formatting.
adilger [Sat, 5 Oct 2002 04:31:45 +0000 (04:31 +0000)]
Fix breakage in lconf.
I guess when you are doing recovery, you never shut down cleanly...
shaver [Sat, 5 Oct 2002 00:56:45 +0000 (00:56 +0000)]
Unlock a lock that exists, fixing the build. (Thanks to Hari for pointing this
out; I don't know how I built before.)
adilger [Fri, 4 Oct 2002 23:42:33 +0000 (23:42 +0000)]
Add verbosity flag.
shaver [Fri, 4 Oct 2002 23:07:07 +0000 (23:07 +0000)]
- Add support for umount -f: it invalidates all in-flight and delayed requests
and marks the connection(s) as invalid, so that future operations fail.
- Make the NEWCONN ioctl clear the CONN_INVALID flag.
- Remove a handful of unused members from ptlrpc_request.
- More informative and uniform req-failure (timeout, recovery, umount -f)
reporting.
- Update runfailure-net a little; more work needed here.
- Add --force / -f to lconf for "umount -f" umounting.
adilger [Fri, 4 Oct 2002 23:03:27 +0000 (23:03 +0000)]
Add openclose into the mix, to test EA creation and removal on the MDS.
Seems to have payed off rather quickly.
Increment thread counts by 5 each time, and exit at < 196 threads (which
caused a problem for shmem somehow.
pschwan [Fri, 4 Oct 2002 21:32:56 +0000 (21:32 +0000)]
- Make lustre_lite.h userspace-include-safe
- Make openclose.c do an ioctl a-la IOR for more testing
eeb [Fri, 4 Oct 2002 19:24:28 +0000 (19:24 +0000)]
. request buffering parameters as checked out on MCR
. #warning + reduced strength assertion on unlined_me
. fixed dynamically allocated IOVs getting freed too early
adilger [Fri, 4 Oct 2002 17:20:42 +0000 (17:20 +0000)]
Add BKL for obdfilter also.
pschwan [Fri, 4 Oct 2002 14:29:59 +0000 (14:29 +0000)]
Make ldlm_cli_cancel_unused not return an error for invalid resource. This is
a very common case, where a file is opened and then closed without locks having
been taken.
adilger [Fri, 4 Oct 2002 10:36:34 +0000 (10:36 +0000)]
Minor cleanups.
adilger [Fri, 4 Oct 2002 09:54:32 +0000 (09:54 +0000)]
Update journal_{start,stop} to have the BKL around them. This fixes a
number of serious bugs that were seen in the MDS.
adilger [Fri, 4 Oct 2002 08:45:36 +0000 (08:45 +0000)]
A script along the lines of runregression-net.sh and runregression-brw.sh,
which is rather simple-minded in its operations, but does lots of them.
adilger [Fri, 4 Oct 2002 08:00:09 +0000 (08:00 +0000)]
If we are passed a bad connection handle don't oops.
adilger [Fri, 4 Oct 2002 07:59:24 +0000 (07:59 +0000)]
Don't try to disconnect connections we don't have.
adilger [Fri, 4 Oct 2002 05:20:52 +0000 (05:20 +0000)]
Program to create/destroy many files.
adilger [Fri, 4 Oct 2002 05:00:46 +0000 (05:00 +0000)]
Add a "threads" parameter too openclose, so you can easily launch lots of
threads.
thantry [Fri, 4 Oct 2002 00:05:45 +0000 (00:05 +0000)]
Removed strtok, replaced with strsep(threadsafe) for 2.5 kernel
adilger [Thu, 3 Oct 2002 22:45:55 +0000 (22:45 +0000)]
Remove all of the single-threaded-single-page read verifications.
adilger [Thu, 3 Oct 2002 22:29:57 +0000 (22:29 +0000)]
Pack inode data into reply for mds_open().
thantry [Thu, 3 Oct 2002 22:25:54 +0000 (22:25 +0000)]
Added more debug to LProc cleanup
adilger [Thu, 3 Oct 2002 22:15:56 +0000 (22:15 +0000)]
Helper functions for OST file handles.
thantry [Thu, 3 Oct 2002 21:49:14 +0000 (21:49 +0000)]
Updated version of the header file for SNMP first code drop
thantry [Thu, 3 Oct 2002 21:48:26 +0000 (21:48 +0000)]
Added SNMP namespace minus the hierarchical info
thantry [Thu, 3 Oct 2002 21:01:43 +0000 (21:01 +0000)]
First set of changes for SNMP, allows reading of UUID
pschwan [Thu, 3 Oct 2002 20:00:29 +0000 (20:00 +0000)]
- rewrap lustre_lib, so it's readable in an 80-column emacs
- add a touch more debugging to extent locking
shaver [Thu, 3 Oct 2002 19:45:04 +0000 (19:45 +0000)]
- Fix dumb iloop bug in dump_connection_list by properly using the list_head
API. Sorry.
adilger [Thu, 3 Oct 2002 19:35:25 +0000 (19:35 +0000)]
Don't hold i_sem for invalidate_inode_pages() in ll_lock_callback() (I
could have sworn I committed this yesterday).
Don't put obdos on the stack if we can avoid it.
adilger [Thu, 3 Oct 2002 19:26:02 +0000 (19:26 +0000)]
Handle error case when mdc_readpage() fails.
adilger [Thu, 3 Oct 2002 19:09:28 +0000 (19:09 +0000)]
Add client.c and target.c in place of l_net.c
pschwan [Thu, 3 Oct 2002 19:06:32 +0000 (19:06 +0000)]
b=617434
shaver's fix for the recovery NULL deref
adilger [Thu, 3 Oct 2002 17:55:15 +0000 (17:55 +0000)]
More makefile fixes.
adilger [Thu, 3 Oct 2002 17:37:05 +0000 (17:37 +0000)]
Don't use relative paths for files as it breaks automake.
pschwan [Thu, 3 Oct 2002 17:27:28 +0000 (17:27 +0000)]
- Add extent information to LDLM_DEBUG
- When we receive a completion AST for an extent lock, copy the new extent
out of the lock_desc
- fix unbalanced ENTRY
shaver [Thu, 3 Oct 2002 17:02:19 +0000 (17:02 +0000)]
- Add some more verbose logging of the cases that get clients into recovery.
- l_wait_event returns -EINTR, not -ERESTARTSYS.
shaver [Thu, 3 Oct 2002 16:45:43 +0000 (16:45 +0000)]
- Add mount-epoch checking in most (all?) inode-using operations, in preparation
for inode poisoning during failed recovery.
- Chain superblocks off the associated connections, because they were the only
things we weren't yet keeping in a list or three. (Also, so we can find the
right superblocks to poison during recovery.)
- Split l_net.c into client.c and target.c to avoid OST/MDS dependencies on
llite, and soothe my soul.
- Build client.c and target.c as sources from other directories as needed,
instead of symlinking them. (Someone could do this with the rest of the
symlinked stuff, or I'll get to it later.)
adilger [Thu, 3 Oct 2002 05:37:28 +0000 (05:37 +0000)]
Fix problems if ext3 and extN have different sb_info structs.
pschwan [Thu, 3 Oct 2002 02:34:05 +0000 (02:34 +0000)]
b=617434
Add some bulletproofing to sync_io_timeout, to try to fix this.
gord-fig [Thu, 3 Oct 2002 01:34:59 +0000 (01:34 +0000)]
BuildArch: noarch is still broken.
adilger [Wed, 2 Oct 2002 23:52:47 +0000 (23:52 +0000)]
Always fill in the full body on reply to keep ll_lookup2 happy.
adilger [Wed, 2 Oct 2002 21:33:03 +0000 (21:33 +0000)]
Fix breakage caused earlier this week by over-optimization.
adilger [Wed, 2 Oct 2002 20:18:23 +0000 (20:18 +0000)]
Set the ino earlier - it appears that the second RPC reply does not set it.
adilger [Wed, 2 Oct 2002 20:12:42 +0000 (20:12 +0000)]
Formatting.
adilger [Wed, 2 Oct 2002 20:07:16 +0000 (20:07 +0000)]
Formatting.
adilger [Wed, 2 Oct 2002 20:06:51 +0000 (20:06 +0000)]
Fix circutious xattr_set.
Add commented-out (un)lock_kernel for transactions, if we decide to go there.
adilger [Wed, 2 Oct 2002 20:02:41 +0000 (20:02 +0000)]
Minor updates for readability.
adilger [Wed, 2 Oct 2002 19:54:11 +0000 (19:54 +0000)]
Helper routine for passing OBD handles around.
pschwan [Wed, 2 Oct 2002 19:53:50 +0000 (19:53 +0000)]
- Change an LDLM LBUG into just a warning, now that it can happen with local
locks. The warning will eventually be removed, but I want to know where it's
happening before I do that.
adilger [Wed, 2 Oct 2002 19:51:53 +0000 (19:51 +0000)]
Minor updates to 0.5.13 release notes.
pschwan [Wed, 2 Oct 2002 18:56:27 +0000 (18:56 +0000)]
- Comment out put_connection until we resolve the dep issue
- Can c ever be NULL in ptlrpc_put_connection? I don't know, but I want to
find out.
adilger [Wed, 2 Oct 2002 17:58:30 +0000 (17:58 +0000)]
Comment about expected compile warning (to disappear in the future).
pschwan [Wed, 2 Oct 2002 17:05:16 +0000 (17:05 +0000)]
- Do proper refcounting on export->exp_connection -- maybe this will fix the
leaking conns we see?
- add some unbalanced ENTRYs and some LASSERTs to help track down a bug
adilger [Wed, 2 Oct 2002 16:38:19 +0000 (16:38 +0000)]
mds_open:
- Don't leak a journal handle, dentry, mntget on mds_open failure.
- Wrap all filesystem operations inside push_ctxt.
- Don't lose the return status from mds_update_last_rcvd() when stopping
the journal handle.
mds_close:
- Don't do a fid2dentry, since we don't use the result anyways.
- Do all of the filesystem close operations before possibly hitting an
RPC error to avoid filesystem leaks (shouldn't happen anyways, since
we are allocating zero bytes).
pschwan [Tue, 1 Oct 2002 22:32:47 +0000 (22:32 +0000)]
- If a non-CONNECT request is made with an invalid export, we need to
instantiate a temporary lustre_connection with which to send a reply. This fix
is untested, but Mike says that he can reproduce this scenario easily.
- make ptlrpc_get_connection happier about receiving a NULL uuid
- avoid a possible connection reference leak in target_handle_connect
adilger [Tue, 1 Oct 2002 22:10:45 +0000 (22:10 +0000)]
Fix problem with duplicate inodes being created on the client. This should
fix the single-client open-unlink issue, and potentially other issues with
locking (the intent lock handle is also attached to the inode).
shaver [Tue, 1 Oct 2002 21:53:52 +0000 (21:53 +0000)]
Add support for FAILCONN ioctl, for recovery testing and (possibly) adminstrative
use.
shaver [Tue, 1 Oct 2002 18:55:37 +0000 (18:55 +0000)]
- Add RD_TROUBLED state for items that need to start recovery, and rename
the rd_state values to RD_*, to avoid namespace conflict with the recovd_state
values.
- Gobs of debugging and NULL-checking in expired_completion_wait, trying to track
down a can't-happen OOPS that phil is seeing.
- Better error reporting in the case where we can't run the recovery upcall.
- Disable the silly immediate restart of a failed client recovery, because it
causes a nice tight loop in the case that we can't run the recovery upcall.
- Mild .cvsignore fixes.
pschwan [Tue, 1 Oct 2002 17:50:22 +0000 (17:50 +0000)]
- ChangeLog update for 0.5.13
- Small formatting and wrapping changes that litter my tree
- Fixed unbalanced ENTRY/EXIT in my last symlink checkin
- Add a little more debugging info, to help track down an MDS LBUG
adilger [Tue, 1 Oct 2002 07:01:52 +0000 (07:01 +0000)]
Add header removed from obd_filter.h.
adilger [Tue, 1 Oct 2002 06:44:47 +0000 (06:44 +0000)]
Add export from obdfilter to hold open file handles.
Add dentry data so we can track destroy-while-file-open.
adilger [Tue, 1 Oct 2002 05:41:00 +0000 (05:41 +0000)]
Add OST handle to llite open file data.
adilger [Tue, 1 Oct 2002 05:40:20 +0000 (05:40 +0000)]
TODO comment for mds_body.
adilger [Tue, 1 Oct 2002 05:39:13 +0000 (05:39 +0000)]
Add LOV export handle to reference open handles.
Make a union out of MDS/LOV/obdfilter export data, not sure if LDLM data
can share the same export of not.
adilger [Tue, 1 Oct 2002 01:58:30 +0000 (01:58 +0000)]
Rename "export" with "exp" so it doesn't screw up my syntax highlighting
(export is a reserved C keyword).
adilger [Tue, 1 Oct 2002 01:56:12 +0000 (01:56 +0000)]
Add handle for exports from obdfilter.
Add handle for LOV open.
adilger [Tue, 1 Oct 2002 01:54:02 +0000 (01:54 +0000)]
Add OBD_MD_FLHANDLE
adilger [Tue, 1 Oct 2002 01:51:12 +0000 (01:51 +0000)]
Don't index unused directory indexes.
pschwan [Mon, 30 Sep 2002 03:01:43 +0000 (03:01 +0000)]
b=614622
All filesystems need to implement their own follow_link method, and we weren't.
So we never followed a symlink. Ever.
This file doesn't bear even the slightest relation to ext2/symlink.c anymore,
so I removed the old copyright notice.
gord-fig [Sat, 28 Sep 2002 20:01:40 +0000 (20:01 +0000)]
Do some patch math.
pschwan [Fri, 27 Sep 2002 22:34:24 +0000 (22:34 +0000)]
b=611892
Avoid conversion deadlocks by acquiring new locks instead of trying to
convert PR locks to PW.
adilger [Fri, 27 Sep 2002 21:26:49 +0000 (21:26 +0000)]
Use types for journal_callback_set() as in RH 12.5 rawhide and as submitted
for the 2.4.20 kernel. Will cause a compile warning on chaos22 kernels.
adilger [Fri, 27 Sep 2002 21:11:53 +0000 (21:11 +0000)]
Mask off the type bits and not the permission bits.
adilger [Fri, 27 Sep 2002 00:12:52 +0000 (00:12 +0000)]
Silence bogus compiler warning by moving code closer together.
adilger [Fri, 27 Sep 2002 00:03:08 +0000 (00:03 +0000)]
Add update for v0_5_12
adilger [Thu, 26 Sep 2002 23:59:40 +0000 (23:59 +0000)]
Remove unused variable.
pschwan [Thu, 26 Sep 2002 20:56:00 +0000 (20:56 +0000)]
b=611336
Until we really fix 611336 to never send larger IOV requests than the server
can handle, we should at least avoid _crashing_ the server.
pschwan [Thu, 26 Sep 2002 17:55:55 +0000 (17:55 +0000)]
I don't much like crashing
pschwan [Thu, 26 Sep 2002 17:45:30 +0000 (17:45 +0000)]
b=614867
- Remove replies from lock callbacks; Monish verified that this would not harm
the BlueArcs.
- Fixup an overlooked instrumentation detail
- Add an LBUG() to a lock conversion case that I don't think we should ever hit
yet.
pschwan [Thu, 26 Sep 2002 16:37:17 +0000 (16:37 +0000)]
b=613941
Previously, if a filesystem user called readlink() without enough buffer space,
Lustre would allocate a reply packet that's too small and we'd drop the packet
and go into recovery.
Now Lustre sends the maximum size to the server, which will truncate its own
readlink and reply appropriately.
gord-fig [Thu, 26 Sep 2002 03:29:58 +0000 (03:29 +0000)]
Don't forget to distribute the ext3 fixes patch.
shaver [Wed, 25 Sep 2002 23:32:33 +0000 (23:32 +0000)]
- #include good, implicit-decl warning bad.
shaver [Wed, 25 Sep 2002 22:58:09 +0000 (22:58 +0000)]
- Use l_wait_event in ldlm_completion_ast to both trigger recovery and make
hangs there due to dead MDSs/OSTs interruptible.
adilger [Wed, 25 Sep 2002 22:20:17 +0000 (22:20 +0000)]
Remove "helpful" error message, because it screws things up.
adilger [Wed, 25 Sep 2002 21:43:12 +0000 (21:43 +0000)]
Add more obvious error message after a patch failure, instead of before.
adilger [Wed, 25 Sep 2002 21:27:19 +0000 (21:27 +0000)]
Reverse order of i_version and extN_mark_inode_dirty tests, for stock kernels.
List the chaos22 patch as an alternative.
gord-fig [Wed, 25 Sep 2002 20:49:08 +0000 (20:49 +0000)]
Automatically detect when to apply the fixes patch.
gord-fig [Wed, 25 Sep 2002 20:23:49 +0000 (20:23 +0000)]
Update UML fire-putting-out patch.
adilger [Wed, 25 Sep 2002 19:59:54 +0000 (19:59 +0000)]
Don't automatically add patches to CVS.
Don't run with set -vx, but echo some status.
Remove "EXTRA" files before trying to apply existing patches.
Formatting.
shaver [Wed, 25 Sep 2002 19:43:33 +0000 (19:43 +0000)]
- Remove cancelled locks from the waiting list. Revoking locks from a dead
client to service another client now seems to work quite well. (The dead
client can even reconnect successfully.)
gord-fig [Wed, 25 Sep 2002 17:52:29 +0000 (17:52 +0000)]
Make patch generation more accurate with new files.
thantry [Wed, 25 Sep 2002 16:23:00 +0000 (16:23 +0000)]
Fixed Bug #614157, not uncommentig the registration/deregistration part
gord-fig [Wed, 25 Sep 2002 04:06:49 +0000 (04:06 +0000)]
First cut at uml40 extN patch.
braam [Wed, 25 Sep 2002 03:20:04 +0000 (03:20 +0000)]
- add ucred structure. MDS now sets the fsuid and fsgid to that of
the caller to enforce permissions, with a new argument to push_ctxt.
shaver [Wed, 25 Sep 2002 02:25:26 +0000 (02:25 +0000)]
- Cancel any and all outstanding locks when an export is disconnected.
- Remove icky and unused ldlm_destroy_export and mds_destroy_export hooks.
- Trigger recovery from timed-out lock callbacks.
- Support for replyless requests:
- add 1 to initial request refcount, balanced in request_out_callback
- don't set up reply portal buffer if replen is 0
- Ignore reply to DLM blocking/completed ASTs. (Note: we still very much care
about cancellation in response to blocking ASTs.)
- Server-side recovery now "simply" forces a disconnect of every export using
the failed connection.
- Handle (better, not perfectly) the case where we signal failure on a connection
that is already undergoing recovery. We need to do more here, but this will
keep us from going too deeply insane for now.