Whamcloud - gitweb
fs/lustre-release.git
21 years agoFile containing read/write fptr implementations for MDS. To be added to make path...
thantry [Wed, 23 Oct 2002 03:03:16 +0000 (03:03 +0000)]
File containing read/write fptr implementations for MDS. To be added to make path after review

21 years agoFile containing read/write fptr implementations for MDC. To be added to make path...
thantry [Wed, 23 Oct 2002 03:02:49 +0000 (03:02 +0000)]
File containing read/write fptr implementations for MDC. To be added to make path after review

21 years agoFile containing read/write fptr implementations for LOV. To be added to make path...
thantry [Wed, 23 Oct 2002 03:02:20 +0000 (03:02 +0000)]
File containing read/write fptr implementations for LOV. To be added to make path after review

21 years agoFile containing read/write fptr implementations for mount_point. To be added to make...
thantry [Wed, 23 Oct 2002 03:01:45 +0000 (03:01 +0000)]
File containing read/write fptr implementations for mount_point. To be added to make path after review

21 years agoAdding to the source repository. Will be linked in to the makefile after
thantry [Wed, 23 Oct 2002 02:59:02 +0000 (02:59 +0000)]
Adding to the source repository. Will be linked in to the makefile after
review.

21 years agoPass ioctls on LOV device through to all underlying OSCs if unknown.
adilger [Tue, 22 Oct 2002 22:36:40 +0000 (22:36 +0000)]
Pass ioctls on LOV device through to all underlying OSCs if unknown.

21 years agoFix LOV striping code:
adilger [Tue, 22 Oct 2002 21:22:58 +0000 (21:22 +0000)]
Fix LOV striping code:
- don't set default stripe count at lov_connect() time if it is zero, defer
  it to lov_create() time in case the number of OSTs has changed
- validate the "stripe width < 4GB" requirement at lov_create() time, in
  case bad user-specified striping patterns are given or #OSTs has increased
- don't override lsm_stripe_count if it is already set
- choose from all OSTs as the starting OST, instead of only [0-active]
- return -ENOTTY for unimplemented ioctls

21 years agoMinor tweak to match RH 2.4.18-14 function name in kernel.
adilger [Tue, 22 Oct 2002 20:41:47 +0000 (20:41 +0000)]
Minor tweak to match RH 2.4.18-14 function name in kernel.

21 years agoRemove old -12.5 kernel patch, the -14 patch is exactly the same, and uptodate.
adilger [Tue, 22 Oct 2002 20:33:28 +0000 (20:33 +0000)]
Remove old -12.5 kernel patch, the -14 patch is exactly the same, and uptodate.

21 years ago- teeny fix to ll_revalidate2; this will be trumped by my fix for 626244
pschwan [Tue, 22 Oct 2002 06:55:14 +0000 (06:55 +0000)]
- teeny fix to ll_revalidate2; this will be trumped by my fix for 626244
  in ways that can scarcely be described by mere words
- multiple CONNECTs to the LOV would do more CONNECTs to the OSCs, each
  clobbering the last connection and preventing cleanup; fixed.
- LOV targets now have an 'active' bit; inactive targets will be skipped at
  CREATE and DISCONNECT time
- added locking for the above, along with a function to toggle the bit given an
  LOV and a UUID
- added a LOV_SET_OSC_ACTIVE ioctl to the lov driver and lctl

21 years ago- fixes to osc_brw path to address fsx failures. We in fact uncovered
braam [Mon, 21 Oct 2002 23:37:39 +0000 (23:37 +0000)]
- fixes to osc_brw path to address fsx failures. We in fact uncovered
  serious problems with writes of less than one page.
- minor fixes for 2.5

21 years agoCommit new intent patch for 2.4.18.
adilger [Mon, 21 Oct 2002 22:05:33 +0000 (22:05 +0000)]
Commit new intent patch for 2.4.18.

21 years agoRemove sig from end of patch.
adilger [Mon, 21 Oct 2002 22:05:11 +0000 (22:05 +0000)]
Remove sig from end of patch.

21 years agoThe curse of copy-n-paste. Fix compile warnings and maybe code too.
adilger [Mon, 21 Oct 2002 21:45:15 +0000 (21:45 +0000)]
The curse of copy-n-paste.  Fix compile warnings and maybe code too.

21 years agoGetting ready to land OBD open file handles, and OSC/LOV local lock handle
adilger [Mon, 21 Oct 2002 21:32:53 +0000 (21:32 +0000)]
Getting ready to land OBD open file handles, and OSC/LOV local lock handle
allocations.

21 years agoA little debugging I've been running in my tree for a long time.
adilger [Mon, 21 Oct 2002 21:31:04 +0000 (21:31 +0000)]
A little debugging I've been running in my tree for a long time.
It caught a couple of minor bugs, and should prevent more in the future.

21 years agoAdd lov_getconfig to lctl docs.
adilger [Mon, 21 Oct 2002 21:25:05 +0000 (21:25 +0000)]
Add lov_getconfig to lctl docs.

21 years agoChange lctl lovconfig command into lov_getconfig, and add lov_setconfig.
adilger [Mon, 21 Oct 2002 21:16:46 +0000 (21:16 +0000)]
Change lctl lovconfig command into lov_getconfig, and add lov_setconfig.
Update all scripts/docs which refer to the lovconfig command.

lov_getconfig compiles OK, and does not affect normal operation, but it
is not working yet (user-space segfault) and I need to run it on a real
system (can't run GDB under UML).

21 years agoAdd permission bits to open(O_CREAT) (b=624321).
adilger [Mon, 21 Oct 2002 18:26:53 +0000 (18:26 +0000)]
Add permission bits to open(O_CREAT) (b=624321).
Add a verifiable data pattern so that we can confirm if data is going to the
correct stripe.

21 years agoremoved demos/Makefile from configure.in
eeb [Mon, 21 Oct 2002 17:29:53 +0000 (17:29 +0000)]
removed demos/Makefile from configure.in

21 years agoOops, missed a hunk.
pschwan [Mon, 21 Oct 2002 07:17:26 +0000 (07:17 +0000)]
Oops, missed a hunk.

21 years agoMove AM_CONDITIONAL into configure.in
pschwan [Mon, 21 Oct 2002 07:16:15 +0000 (07:16 +0000)]
Move AM_CONDITIONAL into configure.in

21 years agoInclude version.h so we can check version.
adilger [Mon, 21 Oct 2002 07:05:15 +0000 (07:05 +0000)]
Include version.h so we can check version.

21 years ago- fix to deal with other versions of automake.... :(
braam [Mon, 21 Oct 2002 07:03:25 +0000 (07:03 +0000)]
- fix to deal with other versions of automake.... :(

21 years ago- compile fixes for 2.5 -- somewhat tentative here and there.
braam [Mon, 21 Oct 2002 06:47:07 +0000 (06:47 +0000)]
- compile fixes for 2.5 -- somewhat tentative here and there.

21 years ago2.4 fixes
braam [Mon, 21 Oct 2002 06:35:22 +0000 (06:35 +0000)]
2.4 fixes

21 years ago- 2.5 fixes for MDS
braam [Mon, 21 Oct 2002 05:58:41 +0000 (05:58 +0000)]
- 2.5 fixes for MDS

21 years ago- compile fixes for 2.5.44
braam [Mon, 21 Oct 2002 04:25:20 +0000 (04:25 +0000)]
- compile fixes for 2.5.44
- echo works again

21 years ago- put obdfilter before extN + mds to facilitate collab between peter +
braam [Sun, 20 Oct 2002 22:40:11 +0000 (22:40 +0000)]
- put obdfilter before extN + mds to facilitate collab between peter +
  andreas on 2.5

21 years ago2.4 build fixes
pschwan [Sun, 20 Oct 2002 19:18:38 +0000 (19:18 +0000)]
2.4 build fixes

21 years ago- Lustre Lite at least superficially ready for 2.5
braam [Sun, 20 Oct 2002 19:09:24 +0000 (19:09 +0000)]
- Lustre Lite at least superficially ready for 2.5

21 years ago- Back out the ldlm_everything_lock patch, which causes a deadlock on cleanup
pschwan [Sun, 20 Oct 2002 00:16:14 +0000 (00:16 +0000)]
- Back out the ldlm_everything_lock patch, which causes a deadlock on cleanup
- Move backslashes in lustre_lite.h macros to fit in 80 columns
- Added a couple of structure fields that support my LOV work

21 years ago- buffer checks are different for non-intent
braam [Sat, 19 Oct 2002 22:43:14 +0000 (22:43 +0000)]
- buffer checks are different for non-intent
- remove a it = ->d_it assignment
- set the intent at the end of lookup2

21 years ago- fix rdev stuff
braam [Sat, 19 Oct 2002 19:53:09 +0000 (19:53 +0000)]
- fix rdev stuff
- assign d_fsdata in lookup2

21 years ago- keep extN before MDS
braam [Sat, 19 Oct 2002 16:43:42 +0000 (16:43 +0000)]
- keep extN before MDS

21 years agoRestore Phil's header fix that I whacked.
braam [Sat, 19 Oct 2002 16:29:00 +0000 (16:29 +0000)]
Restore Phil's header fix that I whacked.

21 years agoONLY UPDATE IF YOU NEED THIS (i.e. Andreas probably will)
braam [Sat, 19 Oct 2002 16:09:43 +0000 (16:09 +0000)]
ONLY UPDATE IF YOU NEED THIS (i.e. Andreas probably will)

- more fixes for 2.5 builds, much overlap with Andreas' check in from
  yesterday.

- add vfs_inode to Lustre inode info: for 2.5 this makes this the
  whole inode.

- two versions of intent_release: Robert's and the new one for 2.5

- Robert's patch - so this may break some stuff, we'll sort that out.

- convert device numbers

- small fixes to make thread handling compatible with 2.5

- always compile super25.c and super.c; the contents is conditional on
  the version

21 years ago- Replace per-namespace recursive lock with an ldlm-global lock, to close the
pschwan [Sat, 19 Oct 2002 11:33:33 +0000 (11:33 +0000)]
- Replace per-namespace recursive lock with an ldlm-global lock, to close the
  race between ldlm_handle2lock and the lock destroy portion of ldlm_lock_put
- Add a special interface for ldlm_handle2lock to resolve the following
  conflict between our invariants:

  1. ldlm_handle2lock should never give a reference to a "dying" lock
  2. ldlm_lock_decref needs to use ldlm_handle2lock to get a reference to a lock
     so that it can decrease the readers/writers refcount
  3. The lock can't finish dying until readers/writers hits zero.

  That should be fixed now.

21 years ago- Split import reconnection and replay (OSC only needs to reconnect).
shaver [Sat, 19 Oct 2002 11:19:14 +0000 (11:19 +0000)]
- Split import reconnection and replay (OSC only needs to reconnect).
- Restore recovd-management of server-side connections (why did I take that out
  in the first place?).
- Abort inflight, uncommitted and waiting requests for imports that are
  invalidated by reconnection to OST.  (Generates -EIO, or should!)
- Mete out harsh, harsh justice to locks held by recovery-invalidated imports.
- Remove LBUG()s now that some error returns from ldlm_cli_cancel are "normal".
- Cancel locks on disconnection from OST (really filter).
- Ignore replies that come in after we've started recovery on a given connection.
  The socknal's implicit retransmission can otherwise cause us worlds and worlds
  of hurt.
- Wake up when we get aborted, and return -EIO.
- Remove connections from the recovd's care when they go unused.  (Which might
  actually happen, once Phil lands his refcounting fixes!)

21 years ago- Rename ldlm_namespace_cleanup's "local" parameter to "local_only" for clarity.
shaver [Sat, 19 Oct 2002 11:09:16 +0000 (11:09 +0000)]
- Rename ldlm_namespace_cleanup's "local" parameter to "local_only" for clarity.
- Remove spurious LBUG() or two.
- Export the cleanup and dump symbols we use in recovery.

21 years agoFix the connection refcount leaks. There were actually 3 separate bugs.
pschwan [Sat, 19 Oct 2002 09:12:24 +0000 (09:12 +0000)]
Fix the connection refcount leaks.  There were actually 3 separate bugs.

21 years agoUnlockPage->unlock_page. The two are identical, except that the former
adilger [Sat, 19 Oct 2002 06:21:46 +0000 (06:21 +0000)]
UnlockPage->unlock_page.  The two are identical, except that the former
does not exist in 2.5.

21 years agoHiding will not save you.
adilger [Fri, 18 Oct 2002 23:43:17 +0000 (23:43 +0000)]
Hiding will not save you.

21 years agoDon't refer to junk which doesn't exist.
adilger [Fri, 18 Oct 2002 23:39:18 +0000 (23:39 +0000)]
Don't refer to junk which doesn't exist.

21 years agoYet more junk.
adilger [Fri, 18 Oct 2002 23:38:54 +0000 (23:38 +0000)]
Yet more junk.

21 years agoMore cruftectomy.
adilger [Fri, 18 Oct 2002 23:37:55 +0000 (23:37 +0000)]
More cruftectomy.

21 years agoRemove demos directory entirely.
adilger [Fri, 18 Oct 2002 23:35:27 +0000 (23:35 +0000)]
Remove demos directory entirely.

21 years agoRemove ancient and unworking snapshot demo scripts.
adilger [Fri, 18 Oct 2002 23:34:45 +0000 (23:34 +0000)]
Remove ancient and unworking snapshot demo scripts.

21 years agoOne more straggler in obdfs.
adilger [Fri, 18 Oct 2002 23:27:46 +0000 (23:27 +0000)]
One more straggler in obdfs.

21 years agoRemove ancient and entirely non-working obdfs code from the HEAD.
adilger [Fri, 18 Oct 2002 23:20:52 +0000 (23:20 +0000)]
Remove ancient and entirely non-working obdfs code from the HEAD.

21 years agoMore 2.5-SMP build fixes
pschwan [Fri, 18 Oct 2002 22:32:45 +0000 (22:32 +0000)]
More 2.5-SMP build fixes

21 years agoFix build on 2.4SMP
pschwan [Fri, 18 Oct 2002 22:20:54 +0000 (22:20 +0000)]
Fix build on 2.4SMP

I don't have a working 2.5 llite directory yet, so can you test these
changes, Andreas?

21 years agoI really broke the tree last night. I guess nobody has been using it much
pschwan [Fri, 18 Oct 2002 22:06:16 +0000 (22:06 +0000)]
I really broke the tree last night.  I guess nobody has been using it much
today.

- Remove a bogus assertion in ldlm_completion_ast
- Add a class 9 hack in ldlm_destroy_lock to get things working
- LINUX_VERSION_CODE requires version.h

21 years agoFix breakage from missing wait_on_buffer() inline.
adilger [Fri, 18 Oct 2002 21:45:48 +0000 (21:45 +0000)]
Fix breakage from missing wait_on_buffer() inline.

21 years agoautomake 1.6.x doesn't like the redefinition, and I bet that's not what we meant
shaver [Fri, 18 Oct 2002 21:42:03 +0000 (21:42 +0000)]
automake 1.6.x doesn't like the redefinition, and I bet that's not what we meant
anyway.

21 years agoHeader changes needed to compile under 2.5 (compiled also with 2.4 to verify).
adilger [Fri, 18 Oct 2002 21:19:57 +0000 (21:19 +0000)]
Header changes needed to compile under 2.5 (compiled also with 2.4 to verify).

21 years ago* allow lmc and lconf paths to be overridden in installed scripts
rread [Fri, 18 Oct 2002 19:39:03 +0000 (19:39 +0000)]
* allow lmc and lconf paths to be overridden in installed scripts

21 years ago* fix braino
rread [Fri, 18 Oct 2002 19:29:06 +0000 (19:29 +0000)]
* fix braino

21 years agoMore merging from the 2.5 tree.
adilger [Fri, 18 Oct 2002 19:25:48 +0000 (19:25 +0000)]
More merging from the 2.5 tree.

21 years agoReplace the man page for lmc to describe the current options for lmc. Added a new...
radhikav [Fri, 18 Oct 2002 19:21:51 +0000 (19:21 +0000)]
Replace the man page for lmc to describe the current options for lmc. Added a new file which describes the proposed new implementation for lmc.

21 years agoAnother minor "make the current code closer to intent code" change.
adilger [Fri, 18 Oct 2002 19:13:03 +0000 (19:13 +0000)]
Another minor "make the current code closer to intent code" change.

21 years agoMerge some of the no-op intent changes to the core tree before branching.
adilger [Fri, 18 Oct 2002 19:02:00 +0000 (19:02 +0000)]
Merge some of the no-op intent changes to the core tree before branching.

21 years ago* change --with-portalslib to require the full path and also use the
rread [Fri, 18 Oct 2002 19:01:50 +0000 (19:01 +0000)]
* change --with-portalslib to require the full path and also use the
  --with-portals arg as the base dir for the default
* remove old config scripts and put examples in /usr/lib/lustre/examples

21 years agoAdd missing symlink hunk.
adilger [Fri, 18 Oct 2002 17:53:15 +0000 (17:53 +0000)]
Add missing symlink hunk.

21 years agoMinor cleanup.
adilger [Fri, 18 Oct 2002 17:51:54 +0000 (17:51 +0000)]
Minor cleanup.

21 years agobdevname changed in 2.5
pschwan [Fri, 18 Oct 2002 17:40:06 +0000 (17:40 +0000)]
bdevname changed in 2.5

21 years agoUpdated the lctl manpage and Lustre How-To.
radhikav [Fri, 18 Oct 2002 16:30:15 +0000 (16:30 +0000)]
Updated the lctl manpage and Lustre How-To.

21 years agoShow the size tha we were truncating too.
adilger [Fri, 18 Oct 2002 07:40:21 +0000 (07:40 +0000)]
Show the size tha we were truncating too.

21 years agoAdd some variables to the config scripts so they are easier to customize.
adilger [Fri, 18 Oct 2002 07:36:44 +0000 (07:36 +0000)]
Add some variables to the config scripts so they are easier to customize.

21 years agoAdd a minor check to the punch call.
adilger [Fri, 18 Oct 2002 07:34:23 +0000 (07:34 +0000)]
Add a minor check to the punch call.

21 years agoGive a valid return code for the non-intent ll_unlink and ll_rmdir.
adilger [Fri, 18 Oct 2002 06:34:08 +0000 (06:34 +0000)]
Give a valid return code for the non-intent ll_unlink and ll_rmdir.
-ENOENT is what ext2_unlink uses, so it must be correct...

21 years agoAmruts patch for building extN outside the source
braam [Fri, 18 Oct 2002 05:11:14 +0000 (05:11 +0000)]
Amruts patch for building extN outside the source

21 years agoAmrut's patch for building extN outside the source.
braam [Fri, 18 Oct 2002 05:10:48 +0000 (05:10 +0000)]
Amrut's patch for building extN outside the source.

21 years agoI believe this makefile change fixes the problems in the build outside
braam [Fri, 18 Oct 2002 05:00:26 +0000 (05:00 +0000)]
I believe this makefile change fixes the problems in the build outside
the source tree.  Interestingly I did NOT need Amrut patches, but only
tested this for UML.

21 years agoFirst check in of the new intent patch that protects d_it better.
braam [Fri, 18 Oct 2002 04:32:25 +0000 (04:32 +0000)]
First check in of the new intent patch that protects d_it better.
Not much tested.
This one works against Red Hat 8.0 2.4.18-14 kernel sources.

21 years ago- Modify ldlm_completion_ast to wake up when locks are cancelled as well as
pschwan [Fri, 18 Oct 2002 01:50:19 +0000 (01:50 +0000)]
- Modify ldlm_completion_ast to wake up when locks are cancelled as well as
  granted
- Wake sleepers in ldlm_lock_destroy

21 years ago- Add a slab for allocating OSC lock handles - 16-byte handles are a waste
adilger [Thu, 17 Oct 2002 23:51:54 +0000 (23:51 +0000)]
- Add a slab for allocating OSC lock handles - 16-byte handles are a waste
  of 32-byte slab kmallocs.
- Rename a couple of the slabs to have an ll_ prefix for easy /proc/slabinfo
  grepping.

21 years agoDon't store LOV EA data on mds_reint_create() for non-intent case - we
adilger [Thu, 17 Oct 2002 23:30:51 +0000 (23:30 +0000)]
Don't store LOV EA data on mds_reint_create() for non-intent case - we
always do deferred object allocation now.

21 years agoDuh, repeat after me - compile before you commit.
adilger [Thu, 17 Oct 2002 23:07:46 +0000 (23:07 +0000)]
Duh, repeat after me - compile before you commit.

21 years agoFix merge error.
adilger [Thu, 17 Oct 2002 23:04:58 +0000 (23:04 +0000)]
Fix merge error.

21 years agoMore debugging related to the portals/rpc weirdness.
pschwan [Thu, 17 Oct 2002 22:45:02 +0000 (22:45 +0000)]
More debugging related to the portals/rpc weirdness.

21 years agoSplit ldlm_namespace_free into ldlm_namespace_free and _cleanup. Cleanup
pschwan [Thu, 17 Oct 2002 22:43:52 +0000 (22:43 +0000)]
Split ldlm_namespace_free into ldlm_namespace_free and _cleanup.  Cleanup
will just discard all locks and resources.

This is not quite finished, because it needs to also wake up people who
were sleeping on these locks (and cause EIO or similar) before it can free
them.

21 years agoMiscellaneous fixes:
pschwan [Thu, 17 Oct 2002 22:39:07 +0000 (22:39 +0000)]
Miscellaneous fixes:
- READLINK wasn't well represented in ll_lookup2, so failed readlinks would have
  gone very poorly indeed.
- Remove #if 0s from ll_common_unlink; these were a temporary fix to
  the "rmdir `pwd`" problem, which should no longer be necessary.

21 years ago- improved handling of errors returned from MDS intent operations.
adilger [Thu, 17 Oct 2002 09:14:48 +0000 (09:14 +0000)]
- improved handling of errors returned from MDS intent operations.
- remove bogus setting of MDS reply body fields at client in ll_create_node
- always do deferred object creation with ll_create() for non-intent ops

21 years agoMinor optimization.
adilger [Thu, 17 Oct 2002 09:09:25 +0000 (09:09 +0000)]
Minor optimization.

21 years agoEnsure that we get a valid rootino back to avoid polluting the inode hash.
adilger [Thu, 17 Oct 2002 09:06:24 +0000 (09:06 +0000)]
Ensure that we get a valid rootino back to avoid polluting the inode hash.

21 years agoMore pre-emtpive Changelog entries.
adilger [Thu, 17 Oct 2002 08:35:22 +0000 (08:35 +0000)]
More pre-emtpive Changelog entries.

21 years agoAdd permission bits when we open with O_CREAT (b=624321).
adilger [Thu, 17 Oct 2002 08:31:32 +0000 (08:31 +0000)]
Add permission bits when we open with O_CREAT (b=624321).

21 years agoMinor edits to remove apparently unfinished sentences.
adilger [Thu, 17 Oct 2002 07:16:25 +0000 (07:16 +0000)]
Minor edits to remove apparently unfinished sentences.

21 years ago- helper lov_stripe_size() calculates apparent file size from each stripe
adilger [Wed, 16 Oct 2002 20:13:39 +0000 (20:13 +0000)]
- helper lov_stripe_size() calculates apparent file size from each stripe
- helper lov_merge_attrs() uses lov_stripe_size() and also merges other attrs
- lov_getattr() and lov_open() use lov_merge_attrs() to return size+blocks
- fix lov_stripe_offset() to work properly for truncate up operations
- don't allocate obdos on the stack if we can avoid it
- lov_setattr() is not being used - add LBUG() to ensure we notice when/if
  it does start being used and do a code audit for correctness

21 years ago- Extract the file size and blocks from osc_open() in ll_file_open(). Avoids
adilger [Wed, 16 Oct 2002 17:27:18 +0000 (17:27 +0000)]
- Extract the file size and blocks from osc_open() in ll_file_open().  Avoids
  a bug where the file size is incorrectly stored on the MDS (not sure why,
  but then we don't trust the file size on the MDS anyways.
- Fix the unlock half of the boolean locking check I fixed last week.
- Use the common ll_create_objects() helper to create objects, so the BA OST
  workaround is in use for all creates.

21 years agoReduce the number of NBUFS slightly for small machines, so that you can
adilger [Wed, 16 Oct 2002 17:15:29 +0000 (17:15 +0000)]
Reduce the number of NBUFS slightly for small machines, so that you can
load 2 OSTs into a 64MB UML.  The upper limit is exactly the same as
before.

21 years ago- Fix the mdc_replay_open fixup so that it updates the handle in the Lustre
shaver [Wed, 16 Oct 2002 06:21:34 +0000 (06:21 +0000)]
- Fix the mdc_replay_open fixup so that it updates the handle in the Lustre
  fh, not just in the request.
- Diagnostic for close-releasing-open req balancing.
- Don't allocate new transnos for replayed requests, or all hell _does_ break
  loose with the committing of "old" requests as things get renumbered.
- Set request level to LUSTRE_CONN_RECOVD during replay.
- Only call the replay callback if the replay succeeded (which it had darned
  well better do, but still...)
- Unlink old MEs when we free old reply buffers -- very important, sigh.
- Bullet-proof some resend diagnostics.

21 years agoRemove bogus hunks from the -12.5 RH kernel patch.
adilger [Wed, 16 Oct 2002 06:00:41 +0000 (06:00 +0000)]
Remove bogus hunks from the -12.5 RH kernel patch.

21 years agoBump the sanity-max to 1000, since we have way, way more than 50
pschwan [Wed, 16 Oct 2002 01:50:56 +0000 (01:50 +0000)]
Bump the sanity-max to 1000, since we have way, way more than 50
connections on MCR.

21 years ago- Move recovery setup into the (network-using) connect methods, to fix
shaver [Tue, 15 Oct 2002 23:25:14 +0000 (23:25 +0000)]
- Move recovery setup into the (network-using) connect methods, to fix
  LOVs.
- Updated lov.xml file.

21 years ago- make_bad_inode as part of epoch-check mismatch handling
shaver [Tue, 15 Oct 2002 20:54:30 +0000 (20:54 +0000)]
- make_bad_inode as part of epoch-check mismatch handling
- cleanup and comment-for-phil in ll_lock
- partial fix for the crash in recovery that I introduced with my patch yesterday
- better diagnostics in lustre_msg_buf
- finished, not free_req, in reconnection

21 years agoRemove guaranteed-or-your-money-back deadlock from double taking of recovd_lock.
shaver [Tue, 15 Oct 2002 19:05:27 +0000 (19:05 +0000)]
Remove guaranteed-or-your-money-back deadlock from double taking of recovd_lock.
(Sorry, phil.)

21 years agoDowngrade to ps2pdf until changebars work with pdftex.
gord-fig [Tue, 15 Oct 2002 13:02:04 +0000 (13:02 +0000)]
Downgrade to ps2pdf until changebars work with pdftex.

21 years ago- Rename the ptlrpc-general reconnection and replay functions, and export them
shaver [Tue, 15 Oct 2002 02:39:34 +0000 (02:39 +0000)]
- Rename the ptlrpc-general reconnection and replay functions, and export them
  for use in other places (such as llite).
- Replace many dangerous ptlrpc_free_req calls with its refcount-friendly
  ptlrpc_req_finished counterpart.
- Remove excess and harmful p_req_finished in ll_file_open.
- Move recovd_conn_manage calls from client_obd_connect to ll_read_super,
  since the recovery function is now llite-specific.
- Resurrect llite/recover.c for said function (coming soon).
- Rationalized the portals-owned refcount (added in ptl_send_rpc, removed in
  request_out_callback).
- Free repmsg from free_req, not req_finished, in case one of the remaining
  ref-holders has plans for it.
- Diagnostics for freed reqs with outstanding refcounts, to help track down
  any remaining cases.
- Initialize c_recovd_data.rd_managed_chain at allocation time, for better karma.
- Marginally nicer dump_connection_list output, and LBUG check for ilooping.
- Add diagnostics for double-managing of connections, which turned out not to be
  the bug I was chasing earlier, but might well be in the future.
- Set rd_phase and rd_next_phase before calling the recovery state-machine hooks,
  so that adjustments within those hooks (such as calls to recovd_conn_fixed)
  don't result in an inconsistent state.
- Failure to run the upcall no longer aborts recovery, so that an administrator
  can manually salvage things.
- More complete diagnostics for request replay and sleeping.