Whamcloud - gitweb
eeb [Wed, 11 Jun 2003 13:51:17 +0000 (13:51 +0000)]
* Single event queue per network interface.
* Pre-allocated server requests, individually posted for receives
Servers now configured by XXX_NREQ (# requests) and XXX_MAXREQSIZE
(size of each replies buffer).
* Asynchronous server-side reply completion/notification. struct
ptlrpc_reply_state objects splits off state reply so request can be
reposted immediately.
* exp_outstanding_reply now a list of replies
* bulk descriptors sized so bd_iov can contain all the fragments.
ptlrpc_prep_bulk_page() is now returns void (i.e. no errors; may fail
an LASSERT if the bulk desc was sized incorrectly).
* NOT tested saved locks yet; bringing up to date with b_devel first...
adilger [Tue, 10 Jun 2003 23:00:53 +0000 (23:00 +0000)]
Add some comments to the iopen patches to make intended behaviour clear.
pschwan [Tue, 10 Jun 2003 10:48:27 +0000 (10:48 +0000)]
b_devel -> b_orphan merge for
20030610
rread [Sat, 7 Jun 2003 00:07:44 +0000 (00:07 +0000)]
beginning of multinet in lconf
- start the acceptor on gateways
- use mynid for all network types
- many routing code cleanups
- experimental support for cluster id
rread [Thu, 5 Jun 2003 04:12:30 +0000 (04:12 +0000)]
merge from b_devel -> b_ad
DEVEL_AD_PARENT_20030526 -> DEVEL_AD_PARENT_20030605
Large, squirrelly merge. It mounts, and acceptance_small.sh finishes
the test and then fails in cleanup, just like b_devel.
adilger [Wed, 4 Jun 2003 21:34:43 +0000 (21:34 +0000)]
Merge from b_devel -> b_orphan
DEVEL_ORPHAN_UPDATE_PARENT_20030528 -> DEVEL_ORPHAN_UPDATE_PARENT_20030604
This is a non-functional commit, with osc/osc_request.c not compiling because
it is missing the "llog_commit_master" data struct, and does not have any
commit threads started yet. I'm beginning to think the right thing to do
here is to create a single global llog_commit_master struct for the OSC, and
start a single thread at module setup time, and clean it all up at module
unload time, but I could be convinced otherwise.
wangdi [Wed, 4 Jun 2003 01:59:10 +0000 (01:59 +0000)]
fix the a flaw in iopen-2.4.18.patch
adilger [Tue, 3 Jun 2003 23:27:02 +0000 (23:27 +0000)]
Remove bogus extra arg from delete thread warning message.
adilger [Tue, 3 Jun 2003 22:52:07 +0000 (22:52 +0000)]
Include the EXT3_MOUNT_ASYNCDEL addition from ext3-delete-thread into the
context of the iopen patch (as it is in the 2.4.20 version), otherwise the
iopen patch fails to apply.
adilger [Tue, 3 Jun 2003 07:38:09 +0000 (07:38 +0000)]
Add THREAD_NAME compat macro, to remove duplication of UML thread-name stuff.
Make list_move() available if not defined (based on presence of
list_for_each_entry(), which is also defined in 2.5).
wangdi [Mon, 2 Jun 2003 09:07:28 +0000 (09:07 +0000)]
add jdike-2.5.69-uml.config
wangdi [Sat, 31 May 2003 09:53:36 +0000 (09:53 +0000)]
add check_get_page in i386 arch
adilger [Wed, 28 May 2003 18:28:35 +0000 (18:28 +0000)]
More from b_devel->b_orphan merge (I guess I didn't commit anything last night).
DEVEL_ORPHAN_UPDATE_PARENT_20030522->DEVEL_ORPHAN_UPDATE_PARENT_20030528
adilger [Wed, 28 May 2003 17:32:11 +0000 (17:32 +0000)]
More from b_devel->b_orphan merge.
DEVEL_ORPHAN_UPDATE_PARENT_20030522->DEVEL_ORPHAN_UPDATE_PARENT_20030528
adilger [Wed, 28 May 2003 17:19:20 +0000 (17:19 +0000)]
More from b_devel->b_orphan merge.
DEVEL_ORPHAN_UPDATE_PARENT_20030522->DEVEL_ORPHAN_UPDATE_PARENT_20030528
adilger [Wed, 28 May 2003 08:50:45 +0000 (08:50 +0000)]
Merge b_devel->b_orphan:
DEVEL_ORPHAN_UPDATE_PARENT_20030522->DEVEL_ORPHAN_UPDATE_PARENT_20030528
meijia [Wed, 28 May 2003 06:58:55 +0000 (06:58 +0000)]
[TCPNAL]: make tcpnal_recv() could handle multi-iovectors.
meijia [Wed, 28 May 2003 03:05:09 +0000 (03:05 +0000)]
[TCPNAL]: make tcpnal_send() would send large buffers array.
adilger [Tue, 27 May 2003 20:59:13 +0000 (20:59 +0000)]
Minor change to noread patch (was missing fix from not-yet-working 2.5 version).
zab [Tue, 27 May 2003 01:25:16 +0000 (01:25 +0000)]
get b_ad compiling after the b_devel merge.
- move the possibly broken waitfor_one_page to one place
- o_mtime is seconds, that's what ll_ts2u64 returns. s/%Lu/LPU64/
- get rid of client_tgt2uuid2obd copies, export it from obdclass
- move client_obd_{setup,cleanup} into obdclass, export, kill dupes
- remove empty ptlrpc_lib
- get rid of random sanobd dupe in lov
- hoist ll_alloc_brw out of a kernel version ifdef
- no, seriously, unsigned long flags.
pschwan [Tue, 27 May 2003 00:43:11 +0000 (00:43 +0000)]
- Commit wangdi's configure macro for detecting 2.4.20-rh (with backported 2.5
structure changes)
- move the signal compat macros to a new portals_compat25.h
- use them everywhere for 2.5.x and 2.4.20-rh
zab [Mon, 26 May 2003 22:50:20 +0000 (22:50 +0000)]
- bring recent b_devel changes into b_ad, only minor corrections needed. the
only significant of which was making sure changes in super.c made it to
b_ad's llite_lib.c
wangdi [Mon, 26 May 2003 11:04:05 +0000 (11:04 +0000)]
fix portals/archdep.m4
pschwan [Mon, 26 May 2003 08:18:18 +0000 (08:18 +0000)]
b=1173
Remove all files from lib/, put the functions into various other
modules as appropriate, and remove the bits to create the symlinks.
Tested in UML and lightly on dev. Builds, inserts, mounts.
adilger [Sun, 25 May 2003 18:17:54 +0000 (18:17 +0000)]
Mostly working full cookie create-send-cancel cycle for MDS unlinks.
MDS now establishes a connection (LOV only, getting UUID from LOVDESC pending
bug 1280 resolution).
We cannot clean up as a result of this (MDS->LOV connection). Might be
possible to fix by changing cleanup order so we do llite/mdc/mds cleanup
before lov/osc/ost cleanup.
meijia [Sat, 24 May 2003 08:49:12 +0000 (08:49 +0000)]
fix for build portals in user space
eeb [Fri, 23 May 2003 22:04:03 +0000 (22:04 +0000)]
* Added portals + lustre wire constant and struct size/offset assertions
* Added portals + lustre wirecheck programs to generate these assertions
on the reference platform.
meijia [Fri, 23 May 2003 10:01:12 +0000 (10:01 +0000)]
merge b_devel to b_eq since there are problems with old b_devel
DEVEL_EQ_UPDATE_PARENT_20030523 -> DEVEL_EQ_UPDATE_CHILD_20030523
adilger [Fri, 23 May 2003 07:03:13 +0000 (07:03 +0000)]
Merge b_devel -> b_orphan.
DEVEL_ORPHAN_UPDATE_PARENT_20030522 -> DEVEL_ORPHAN_UPDATE_PARENT_20030522
meijia [Wed, 21 May 2003 15:43:09 +0000 (15:43 +0000)]
merge b_devel to b_eq:
DEVEL_EQ_UPDATE_PARENT_20030521 -> DEVEL_EQ_UPDATE_CHILD_20030521
adilger [Wed, 21 May 2003 06:25:00 +0000 (06:25 +0000)]
Remove bogus local function prototypes that were causing me grief.
Fix header mess as necessary to be able to compile.
Most of MDS unlink cookie generation complete.
Remove some local variable declarations that are only used in debug messages
that appeared when compiling with CDEBUG() as a no-op (reduce stack usage).
mmex [Mon, 19 May 2003 17:07:44 +0000 (17:07 +0000)]
- Update kernel patchs to work on hp4_pnnl6. Note that a new branch of
dev_read_only was made. dev_read_only_hp can not longer be used
because multiple other series (not the hp ones) now use it and regularly
update it in conflicting ways.
meijia [Mon, 19 May 2003 17:04:54 +0000 (17:04 +0000)]
merge b_devel into b_eq:
DEVEL_EQ_UPDATE_PARENT_20030519 -> DEVEL_EQ_UPDATE_CHILD_20030519
only resolve confliction.
pschwan [Mon, 19 May 2003 06:29:23 +0000 (06:29 +0000)]
merge b_ad (cleaned up a bit) into b_devel; entirely 2.5 compat fixes
pschwan [Mon, 19 May 2003 05:19:54 +0000 (05:19 +0000)]
the chaos kernel can use the iopen-2.4.18.patch
pschwan [Mon, 19 May 2003 03:15:41 +0000 (03:15 +0000)]
land b_proto on b_devel:
- major rework of the rpc and bulk APIs, lots of new assertions and cleanups
- many rpc and bulk bugfixes, including bugs 937, 1057, 1072, 1162
- perform brw and getattr operations to multiple OSTs in parallel
- all bulks are tied to exactly one request; multiple requests can be
tied to a single request_set
pschwan [Sun, 18 May 2003 16:39:34 +0000 (16:39 +0000)]
landing b_open onto b_devel:
- kernel patches for bug 1203
- fixes directory open()s
- re-enables metadata caching on the MDS
- fixes bug in ext3_noread kernel patch
- fixes some symbol export problems in other kernels
- fixes compilation problems when running on Red Hat 2.4.20 kernels
If you update past this merge, you will require kernel version 18
braam [Sat, 17 May 2003 21:18:06 +0000 (21:18 +0000)]
- remainder of the b_devel into b_open merge.
thantry [Fri, 16 May 2003 05:55:52 +0000 (05:55 +0000)]
Added lprocfs counters for obd_ops stats, obdfilter and ptlrpc services. Bugzilla#1107.
adilger [Thu, 15 May 2003 17:01:48 +0000 (17:01 +0000)]
Merge b_devel (ORPHAN_BASE6 -> ORPHAN_BASE7) to
b_orphan (DEVEL_ORPHAN_UPDATE_CHILD_20030514).
pschwan [Wed, 14 May 2003 17:48:31 +0000 (17:48 +0000)]
Update the .pc to match the renamed patch
rread [Wed, 14 May 2003 17:44:08 +0000 (17:44 +0000)]
b=1046
update the config version.
rread [Wed, 14 May 2003 17:42:55 +0000 (17:42 +0000)]
b=1046
- add a new argument to upcalls. The first argument identifies the reason
of upcall. The upcall script path is now /proc/sys/lustre/upcall, and the
config option has changed from --recovery_upcall to --lustre_upcall.
Current lustre upcall types and arguments:
FAILED_IMPORT tgt_uuid dev_uuid net_uuid
RECOVERY_OVER dev_uuid
- The config tools also support the new --portals_upcall, and both of the
upcalls can be set to the same script with the --upcall option.
There is only one portals upcall type:
LBUG filename function line
alex_tomas [Tue, 13 May 2003 19:07:38 +0000 (19:07 +0000)]
- applying uml-2.4.20-4 to vanilla 2.4.20 makes kernel non-compilable for i386
alex_tomas [Tue, 13 May 2003 19:04:36 +0000 (19:04 +0000)]
- core UML patch
rread [Tue, 13 May 2003 00:46:08 +0000 (00:46 +0000)]
- add ptlrpc_dump_connections() for debugging ldlm cleanup problems
- cleanup the import's export ptlrpc_import_connect if the connect fails.
this prevents an obd_refcount clean on the client during a failed mount.
rread [Wed, 7 May 2003 23:39:57 +0000 (23:39 +0000)]
b=1215
r=shaver@clusterfs.com
- add obd_refcount, and use it to ensure the device not being used
during cleanup. If --force is used, then cleanup will set a stopping
flag on the device, disconnect the exports, and then wait until
the device is unused.
- removed class_disconnect_all and moved/renamed forcibly_detatch_exports
to class_disconnect_exports.
- ptlrpc_import_disconnect now disconnects imp_export
- if --failover, then filter cleanup will not delete files marked
for delete on close.
- manage imp_active flag in ptlrpc_set_import_active
rread [Wed, 7 May 2003 19:12:10 +0000 (19:12 +0000)]
b=1026
r=adilger@clusterfs.com
- add initial support for creating client connections on the MDS,
configs will need to be recreated. (The version has been updated
to force this.)
- create a new config object, "filesystem" which represents a
specific filesystem. In the zero-config future, the MDS will be
need to be aware of what filesystems it "owns," so this is perhaps one
tiny step in that direction.
wangdi [Wed, 7 May 2003 11:14:29 +0000 (11:14 +0000)]
add config-linux-2.4.20-uml
wangdi [Wed, 7 May 2003 11:06:35 +0000 (11:06 +0000)]
add uml.spec
pschwan [Mon, 5 May 2003 04:29:49 +0000 (04:29 +0000)]
partial merge of b_open into b_devel:
- Lustre kernel patch version 16
- extN no longer built; ext3 patched as part of the kernel patch
- related changes to lconf/lmc, to load and mkfs ext3 instead of extN now
pschwan [Mon, 5 May 2003 03:32:49 +0000 (03:32 +0000)]
fixed -ESTALE bug in vfs_intent-2.4.18-18.patch (already fixed in other
vfs_intent patches)
wangdi [Sun, 4 May 2003 11:04:14 +0000 (11:04 +0000)]
remove exports_2.4.20-rh.pc, which can be replaced by exports_hp_2.4.20.pc
wangdi [Sun, 4 May 2003 11:01:41 +0000 (11:01 +0000)]
remove exports_2.4.20-rh.patch, which can be replaced by exports_hp_2.4.20.patch
wangdi [Sun, 4 May 2003 10:22:16 +0000 (10:22 +0000)]
add ext3-noread-2.4.20.patch
braam [Sun, 4 May 2003 05:04:40 +0000 (05:04 +0000)]
- fixes to b_open
- new UML patch 2.4.20-4 which works with Red Hat 9
- minor fixes to handling handles in open/close to avoid oopsing
- fix erroneous mds_mfd_put in mds_open
- fix the CVS tests added to sanity
- the tests exposed more problem in this branch, probably related to
the lack of a file handle cache. Only good parts of the branch will
be merged.
adilger [Sun, 4 May 2003 00:33:30 +0000 (00:33 +0000)]
Merge from b_devel to b_orphan (ORPHAN_BASE5 -> ORPHAN_BASE6).
braam [Sat, 3 May 2003 07:48:44 +0000 (07:48 +0000)]
- this is a reconstruction of b_kernel and a merge from current
b_devel (as of OPEN_BASE). Due to tag confusion we had to do this
by hand.
- DO NOT USE B_KERNEL, MAIL DIFF's TO PETER & PHIL
pschwan [Sat, 3 May 2003 06:00:59 +0000 (06:00 +0000)]
merge b_devel into b_proto
from DEVEL_PROTO_UPDATE_PARENT_20030502 tag on b_devel
to DEVEL_PROTO_UPDATE_CHILD_20030502 tag on b_proto
adilger [Fri, 2 May 2003 23:24:54 +0000 (23:24 +0000)]
Adding it to b_devel too.
pschwan [Sun, 27 Apr 2003 17:49:18 +0000 (17:49 +0000)]
b=1178
Preparations for removing unused "addr" from lustre_handle
pschwan [Fri, 25 Apr 2003 06:06:04 +0000 (06:06 +0000)]
- merge b_devel into b_proto
- simplify the check_set/set_wait logic a bit
adilger [Thu, 24 Apr 2003 08:28:30 +0000 (08:28 +0000)]
Merge from b_devel -> b_orphan (up to ORPHAN_BASE5 tag on b_devel)
rread [Thu, 24 Apr 2003 01:36:34 +0000 (01:36 +0000)]
b=1075
* added --failover option to lconf and lctl cleanup
* failover flag added to obd_disconnect and obd_cleanup
* if failover ==1, then MDS and OST will not update the client
state in last_rcvd.
* Also changes to lconf to add the --group support to MDS.
rread [Mon, 21 Apr 2003 20:00:03 +0000 (20:00 +0000)]
b=1075
If a reply comes back with -ENOTCONN, then this means that we've
either been evicted, or the device has been removed from the
server. First attempt to reconnect, then call the upcall and attempt
recovery.
shaver [Sun, 20 Apr 2003 16:55:04 +0000 (16:55 +0000)]
Fix build: ldlm_hooks_referenced wasn't defined or set.
pschwan [Sun, 20 Apr 2003 15:43:29 +0000 (15:43 +0000)]
b=1111
- add ptlrpc_ldlm_hooks_referenced(), so we can determine in
ldlm_cleanup if "force" is needed
- add a little bit more debug information to hunt connection leaks
pschwan [Sat, 19 Apr 2003 15:53:46 +0000 (15:53 +0000)]
Merge b_devel changes into b_proto
rread [Sat, 19 Apr 2003 01:07:24 +0000 (01:07 +0000)]
b=1075
The config part of failback. Lconf now supports --group for setup and
cleanup. If --group is used, lconf will only setup/cleanup the
devices that are in the group AND are currently active on this node. Lconf
will still setup the modules needed by the devices (networking, routing),
but cleanup will leave these untouched if there are still other devices
left after cleaning up the group.
* small fix enable config version support in ldap
rread [Tue, 15 Apr 2003 21:46:42 +0000 (21:46 +0000)]
- add verison to config, so lconf can make sure the xml/ldap was created
by a compatible lmc.
- recreate clients if remounting
meijia [Sun, 13 Apr 2003 14:52:58 +0000 (14:52 +0000)]
[liblustre]:
- initial support symlink
meijia [Sun, 13 Apr 2003 13:56:51 +0000 (13:56 +0000)]
[liblustre]:
- add support to mkdir
meijia [Sun, 13 Apr 2003 01:45:05 +0000 (01:45 +0000)]
[liblustre]:
- support setattr. but currently we don't involved any locking, thus
contains ugly workaround, maybe totally wrong.
pschwan [Sat, 12 Apr 2003 20:07:45 +0000 (20:07 +0000)]
- merge b_devel into b_proto (incl b_swab changes)
- fix leak-all-bulk-pages bug
- Mike fixed a bug where a failed ptlrpc_queue_wait send buf would
fail, but the user would still have to wait for the timeout to pass
before being able to interrupt
pschwan [Sat, 12 Apr 2003 17:42:08 +0000 (17:42 +0000)]
b=1111
- Pass 'force' down to all of the cleanup methods, instead of just
having special meaning for exports
- in ldlm_cleanup, if forcing, remove the ldlm hooks from ptlrpc so we
can unload the ldlm module
rread [Fri, 11 Apr 2003 19:27:59 +0000 (19:27 +0000)]
b=1084
* add a --group=tag option to mds and ost devices, so related devices
can be failed over together.
* rewrite lactve to failover a specific device group, instead of all
devices on a node.
meijia [Fri, 11 Apr 2003 07:48:34 +0000 (07:48 +0000)]
[liblustre]:
- do proper cleanup with an inode is about to be destroyed
meijia [Fri, 11 Apr 2003 07:12:39 +0000 (07:12 +0000)]
[liblustre]:
- add inode operation close()
meijia [Fri, 11 Apr 2003 06:12:54 +0000 (06:12 +0000)]
[liblustre]:
- actually retrive stripe ea data from server
- fix logic in name resolution, save fid for each file
nfshp [Thu, 10 Apr 2003 02:07:12 +0000 (02:07 +0000)]
[liblustre]:
- add a fake ioctl which could dump mount options to file
- sysio_mount use UUIDs from the mount options
- minor fix for pnode count
nfshp [Wed, 9 Apr 2003 09:43:19 +0000 (09:43 +0000)]
[liblustre]: now we don't need the syscall workaround in libsysio anymore
nfshp [Wed, 9 Apr 2003 08:43:21 +0000 (08:43 +0000)]
[liblustre]: following the recent libsysio changes.
rread [Wed, 9 Apr 2003 00:20:49 +0000 (00:20 +0000)]
fix cut and paste braino
nfshp [Tue, 8 Apr 2003 11:33:54 +0000 (11:33 +0000)]
[liblustre]: must feed NULL-terminated string to mdc_getattr_name()
nfshp [Tue, 8 Apr 2003 10:22:09 +0000 (10:22 +0000)]
land b_cray on b_devel
eeb [Tue, 8 Apr 2003 00:32:50 +0000 (00:32 +0000)]
* Landing b_swab
nfshp [Mon, 7 Apr 2003 07:21:35 +0000 (07:21 +0000)]
merge b_devel into b_cray
nfshp [Fri, 4 Apr 2003 16:16:39 +0000 (16:16 +0000)]
merge b_devel into b_cray
shaver [Thu, 3 Apr 2003 21:20:53 +0000 (21:20 +0000)]
- set_osc_active becomes ptlrpc_set_import_active (though it still only works on
OSCs) and is automatically called for non-recoverable OSCs before we invoke the
upcall. (part of b=1038)
- When aborting recovery, shut down the recovery timer, abort the recovery
queue's requests, and don't do it twice.
- Make sure we call obd_disconnect correctly for all non-lctl imports when
aborting recovery or forcing shutdown, so that we don't leave the last_rcvd bits
around for the next time.
- Fix leak of last_rcvd-spawn exports (every one!).
- Get and put exports for recovery-queued requests and replies, so that
we don't race badly with teardown.
rread [Thu, 3 Apr 2003 18:02:21 +0000 (18:02 +0000)]
b=1089
Python does not automatically convert strings to integers when it might
seem appropriate.
pschwan [Wed, 2 Apr 2003 03:56:25 +0000 (03:56 +0000)]
b=957
- Infrastructure for the pinger thread; needs multi-rpc and some refinements
- Add last_request_time to obd_export
- Fixed export leak in lov_brw success case
- test22 rears its head again: tar gets pissy if it can't access $PWD,
which is common for homedirs at LLNL. cd to /tmp before running tar.
Merged many of the changes from b_proto:
- multi-rpc infrastructure, but not the key bits
- removed some bitfield flags in favour of individual struct members
- merged obd_brw_write and obd_brw_read into a single obd_brw_internal
- move the Elan-related page wrangling from the IOD into the OSC
amrutjoshi [Tue, 1 Apr 2003 08:08:17 +0000 (08:08 +0000)]
adding cmdline.py to pymod_SCRIPTS
amrutjoshi [Tue, 1 Apr 2003 07:38:03 +0000 (07:38 +0000)]
adding scripts to EXTRA_DIST.
pschwan [Mon, 31 Mar 2003 18:33:36 +0000 (18:33 +0000)]
landing b_recovery on b_devel
* bug fixes
- imports and exports cleanup too early, need refcounts (349, 879, 1045)
- per-import/export recovery handling (958, 931, 959)
- multiple last-rcvd slots, for serving multiple FSes (949)
- connections are again shared between multiple imp/exports (963, 964)
- "umount -f" would hang if any requests needed to be sent (393, 978)
- avoid pinning large req buffer by copying for queued messages (989)
- add "uuid" to "lctl device" command to help upcalls (991)
- "open" RPCs with transnos would confuse recovery counters (1037)
* major user-visible changes
- fail out/fail over policy now controlled by the upcall (993)
* protocol changes
- add OBD_PING to check server availability and failure (954)
pschwan [Mon, 31 Mar 2003 18:05:30 +0000 (18:05 +0000)]
last merge of b_devel into b_recovery
shaver [Wed, 26 Mar 2003 07:09:48 +0000 (07:09 +0000)]
Merge from b_devel.
shaver [Wed, 19 Mar 2003 18:32:17 +0000 (18:32 +0000)]
Merge b_devel into b_recovery.
pengzhao [Tue, 11 Mar 2003 18:10:12 +0000 (18:10 +0000)]
Bug828 fix. Approved by Andreas.
zab [Tue, 11 Mar 2003 02:34:52 +0000 (02:34 +0000)]
- purge the write cache before truncate stomps on the local i_size so that we
don't zero-extend the file when truncate's lock acquisition purges the
writeback after having updated the i_size. I'm not sure how 2.5 will do this,
but it probably needs to.
shaver [Sun, 9 Mar 2003 21:29:15 +0000 (21:29 +0000)]
Merge from b_devel, mainly to get the fixes for 977/978/722.