Whamcloud - gitweb
shaver [Thu, 26 Jun 2003 20:20:01 +0000 (20:20 +0000)]
adding otree files (not yet used)
mmex [Thu, 26 Jun 2003 19:37:36 +0000 (19:37 +0000)]
Script to test ls speed on a single host using two mount points.
Not a great test but can give basic information on ls speed issues.
pschwan [Thu, 26 Jun 2003 17:49:35 +0000 (17:49 +0000)]
- Many unused variable warning fixes (several of which were not really a
problem, I think)
- list_splice_init() isn't defined in 2.4.18
- FSTYPE change for lov.sh
pschwan [Wed, 25 Jun 2003 22:38:32 +0000 (22:38 +0000)]
- fixes the maximum OST packet size, a lot of list management, and a
couple other bugs
- starts threads at OST and MDS startup
- able to complete a round of create/setattr/cancel if SENDNOW is set
- some corruption happens when I ask it to build up and cancel whole
pages; I need to run on multiple systems to start narrowing down where.
meijia [Wed, 25 Jun 2003 05:09:24 +0000 (05:09 +0000)]
[liblustre]: add back the script
adilger [Tue, 24 Jun 2003 23:34:21 +0000 (23:34 +0000)]
Fix for bug 1466, delete thread locking up machine during orphan recovery.
adilger [Tue, 24 Jun 2003 00:04:20 +0000 (00:04 +0000)]
Simple utime testing script, as I can't concentrate on anything more complex.
adilger [Mon, 23 Jun 2003 18:20:54 +0000 (18:20 +0000)]
Use a better variable name than "i" for the open file descriptor.
meijia [Mon, 23 Jun 2003 11:56:54 +0000 (11:56 +0000)]
[liblustre]:
- treat an closed inode as invalid.
- reap inode right after unlink/rmdir and when found stale ones.
braam [Sun, 22 Jun 2003 21:12:01 +0000 (21:12 +0000)]
add patches for 2.5.72. Minor Portals miscompilation still needs a
fix, but I cannot do that now I think.
braam [Sat, 21 Jun 2003 20:34:46 +0000 (20:34 +0000)]
many changes to the intent patch. Most of sanity runs now.
behlendo [Fri, 20 Jun 2003 23:57:46 +0000 (23:57 +0000)]
- Ensure stdout is flushed after each new line, this makes it possible to
redirect the resulting output to a file.
- %Lu doesn't format properly, changed to %lu
girish_c [Fri, 20 Jun 2003 22:43:18 +0000 (22:43 +0000)]
Fixes Connectathon bigfile(30 MB)test bug
pschwan [Fri, 20 Jun 2003 16:54:24 +0000 (16:54 +0000)]
b_orphan builds again; now for a some testing
eeb [Wed, 18 Jun 2003 22:08:38 +0000 (22:08 +0000)]
* 1439 fix
adilger [Wed, 18 Jun 2003 18:10:06 +0000 (18:10 +0000)]
Fix patches that were eaten by patch scripts.
girish_c [Wed, 18 Jun 2003 16:39:43 +0000 (16:39 +0000)]
NFS export patch for lustre
1) Patch is updated with review comments
2) All Connectathon tests passes except bigfile(30 MB)test
adilger [Wed, 18 Jun 2003 09:07:19 +0000 (09:07 +0000)]
Fix delete thread so that it doesn't sleep uninterruptibly and raise load avg.
Do not start delete thread if asyncdel mount option is not given.
pschwan [Tue, 17 Jun 2003 09:58:09 +0000 (09:58 +0000)]
haven't even tried to build, but getting closer:
- move the recovd_thread to ptlrpc, because it uses ptlrpc functions
- have one global llog_commit_master, in ptlrpc
meijia [Tue, 17 Jun 2003 03:12:53 +0000 (03:12 +0000)]
[liblustre]: add script to generate libraries
zab [Sat, 14 Jun 2003 20:45:52 +0000 (20:45 +0000)]
These patches bring mcore style dumping to 2.5 via lkcd. Suparana did this
work a while back and committed it to lkcd. She describes the
functionality in http://lwn.net/Articles/22085/ before it was committed.
The patches aren't added to a series/pc quite yet as they're against the
stand-alone 2.5.69 tree. But they give an idea of what is required and
needed to be recorded somewhere.
coopcoopbware [Fri, 13 Jun 2003 23:25:01 +0000 (23:25 +0000)]
b=1383
r=robert
- a less invasive way of propagating OBD_MAX_IOCTL_BUFFER. Still set by passing the --with-obd-buffer-size=[size] arg to configure
eeb [Fri, 13 Jun 2003 13:21:58 +0000 (13:21 +0000)]
* Merged b_devel changes into b_eq (again). b_eq and b_devel have same
acceptance-small symptoms viz....
1/ Need to sleep before lconf -d on llite client to ensure all
connection refcounts are 0 (llmountcleanup.sh on b_eq has a 10 second
sleep).
2/ When running dbench 50, the OST complains about long-running
obdfilter ops. Eventually the MDS or OST will crash/hang.
adilger [Fri, 13 Jun 2003 08:39:50 +0000 (08:39 +0000)]
Script from Evan (with tweaks) for loading modules into crash, if the
"mod -S" command does not work (e.g. if the modules have been moved).
meijia [Fri, 13 Jun 2003 08:21:44 +0000 (08:21 +0000)]
[liblustre]: add back missing files, update test code
meijia [Fri, 13 Jun 2003 07:30:09 +0000 (07:30 +0000)]
[liblustre]: again add back liblustre.c
zab [Fri, 13 Jun 2003 05:36:50 +0000 (05:36 +0000)]
Stabalize the clump paths by clamping the number of pending sets and
cleaning up the sleeping logic. Also tie in ll_readpage. Now to start
profiling and tuning this..
- record the rw mode in llwp so completion knows what to do with the pages
- introduce a simple little 'page list' data structure that uses page->private
to chain pages off llis in fifo order
- have both read and write plists in the lli
- get rid of the old unused work_pending member
- always put the lli back on the list in _give_plist so that we don't
lose llis
- introduce a 'new_arrival' member so lliod can only come out of l_wait_event
when new material arrives
- pass rc into llwp completion so we can make use of it someday
- move the concurrent set checks into the set building loop, duh
- add some debugging here and there
freym [Thu, 12 Jun 2003 23:03:25 +0000 (23:03 +0000)]
checking statfs cache expiration in obd_statfs
adilger [Thu, 12 Jun 2003 18:46:01 +0000 (18:46 +0000)]
Change ia64 to compile with -g instead of -gstabs now that we are using crash.
eeb [Thu, 12 Jun 2003 17:34:04 +0000 (17:34 +0000)]
* Merged in b_devel again
girish_c [Thu, 12 Jun 2003 14:43:39 +0000 (14:43 +0000)]
updated lstat utility
Changes:
1. Summary info for client lproc tree
2. added "--all" option to print the contents of lproc
meijia [Thu, 12 Jun 2003 09:36:40 +0000 (09:36 +0000)]
- fix compilation of liblustre, not test yet.
pschwan [Thu, 12 Jun 2003 07:28:57 +0000 (07:28 +0000)]
more extN cleanup
adilger [Thu, 12 Jun 2003 05:38:42 +0000 (05:38 +0000)]
Remove use of local stack variables (size, ptr) for {PORTAL,OBD}_{ALLOC,FREE}().
I think these were intended to avoid doing multiple evals of math/function
calls for the size, but all of our usage so far is passing in a constant
or a variable that has already computed the size.
By reordering the *_FREE() macros slightly, we do not need to store the size
of the free in a local stack variable either. We have to be careful though,
because "size" may be part of the struct we are poisoning/freeing.
thantry [Wed, 11 Jun 2003 22:01:15 +0000 (22:01 +0000)]
Bug#1380-added to read /proc/fs/lustre*/stat in obdecho, obdfilter, llite
eeb [Wed, 11 Jun 2003 18:04:11 +0000 (18:04 +0000)]
* Merged latest b_devel
* **** NOT TESTED ****
adilger [Wed, 11 Jun 2003 11:06:31 +0000 (11:06 +0000)]
Disable zerocopy by default while kernel symbols get straightened out.
adilger [Tue, 10 Jun 2003 08:58:55 +0000 (08:58 +0000)]
Use GFP_NOFS in portals, so that we don't call into the filesystem to try
and free memory when we are trying to send an RPC for a write (which is
trying to free memory). With this change, we can now run dbench and iozone
on a single UML again. Of importance is the addition of sk->allocation to
socknal.
adilger [Tue, 10 Jun 2003 03:45:11 +0000 (03:45 +0000)]
Fix the socknal TCP zero-copy detection. "zercopy" does not a SOCKNAL_ZC make.
wangdi [Mon, 9 Jun 2003 08:12:57 +0000 (08:12 +0000)]
remove 2.4.19 hp patch
wangdi [Mon, 9 Jun 2003 07:53:43 +0000 (07:53 +0000)]
remove 2.4.19 hp patch
adilger [Sun, 8 Jun 2003 20:57:44 +0000 (20:57 +0000)]
Add fixed from MDS for bug 1403 to obdfilter also.
Make mds_client_add() and filter_client_add() more closely resemble each
other (one day we may want to move the last_rcvd handling from mds/obdfilter
into some common code in obdclass or something).
adilger [Fri, 6 Jun 2003 21:43:36 +0000 (21:43 +0000)]
Fix compile warnings on ia64 (especially the many, many in wirecheck.c).
adilger [Fri, 6 Jun 2003 21:34:09 +0000 (21:34 +0000)]
Fix lock ordering deadlock on the MDS when clients are connecting at the
same time that regular MDS operations are ongoing.
adilger [Fri, 6 Jun 2003 21:08:04 +0000 (21:08 +0000)]
Fix braces around math, so GCC doesn't complain on ia64.
pschwan [Fri, 6 Jun 2003 19:09:42 +0000 (19:09 +0000)]
b=1228
More debugging: pass the real inode down from ll_clear_inode, so we
can determine in cancel_unused whether the lock's data pointer is wrong
adilger [Fri, 6 Jun 2003 17:49:55 +0000 (17:49 +0000)]
Go back to STACK_SIZE - frame_address() now that we use __builtin_dwarf_cfa().
eeb [Fri, 6 Jun 2003 16:02:50 +0000 (16:02 +0000)]
* fixes for multinet
braam [Fri, 6 Jun 2003 07:42:46 +0000 (07:42 +0000)]
- This brings 2.5 forward to pass most, but not all of sanity.
A few problems remain with dangling symlinks.
- Mostly this is a fairly dramatic simplification of many issues,
maybe not possible like this in 2.4.
adilger [Thu, 5 Jun 2003 23:57:13 +0000 (23:57 +0000)]
Quiet compiler warnings in portals. Basically, just pass void parameters to
match the function pointer declarations and cast back to the original type
inside the function.
adilger [Thu, 5 Jun 2003 23:31:29 +0000 (23:31 +0000)]
Allow runvmstat to log to a file for perf monitoring.
Add a similar logging obdstat script also.
eeb [Thu, 5 Jun 2003 17:39:43 +0000 (17:39 +0000)]
* Added (cluster-wide) NID offset capability to qswnal
* Made lctl::mynid accept NIDs specified as a hostname string, a.b.c.d IP
address, %i and %x (just like the routing commands). This command now
works with the qswnal, to tell it its cluster-wide NID offset. Temp
hack in lconf to actually do this function when network is 'elan' as
well as 'tcp' or 'toe'.
* Added lctl::shownid (no args) to print the NID
rread [Thu, 5 Jun 2003 04:12:56 +0000 (04:12 +0000)]
merge from b_devel -> b_ad
DEVEL_AD_PARENT_20030526 -> DEVEL_AD_PARENT_20030605
Large, squirrelly merge. It mounts, and acceptance_small.sh finishes
the test and then fails in cleanup, just like b_devel.
zab [Wed, 4 Jun 2003 23:40:35 +0000 (23:40 +0000)]
plug into ->writepages so the vm can send us batches of pages for writeback.
In the process we make ->writepage non-blocking and hand writeback off to a
single worker thread.
we build a singly linked list of pages off of lli as writepage receives them.
when our writepage or writepages decides it wants to start io it puts the lli
into a list for the writeback thread (lliod) and wakes it.
- make llwp allocing and freeing a little more regular
-
- introduce lliod, putting its control state in the super block.
- ad ll_info2i() so that lliod can find the inode from the lli
- abstract out what 2.4 and 2.5 have to do to end writeback on a page
- consolidate initialization of lli into one place
- move set_page_clean to above the only place it was used
pschwan [Wed, 4 Jun 2003 23:39:18 +0000 (23:39 +0000)]
__arch_ia64__ is apparently not set, so we were hitting the gcc bug
and eating enormous amounts of stack on ia64. Changed to __ia64__
rread [Wed, 4 Jun 2003 22:27:19 +0000 (22:27 +0000)]
make the portals' definition of USERMODEHELPER blocking, too.
rread [Wed, 4 Jun 2003 22:13:21 +0000 (22:13 +0000)]
b=1400 timeout causes crash.
vfs_fstat should call intent_release before fput, and only if f != NULL
wangdi [Wed, 4 Jun 2003 03:05:19 +0000 (03:05 +0000)]
add ext3-2.4-ino_t-2.4.19.pc
wangdi [Wed, 4 Jun 2003 03:01:16 +0000 (03:01 +0000)]
add ext3-2.4-ino_t-2.4.19.patch
wangdi [Wed, 4 Jun 2003 02:56:44 +0000 (02:56 +0000)]
fix a flaw in iopen-2.4.19
wangdi [Wed, 4 Jun 2003 02:41:44 +0000 (02:41 +0000)]
add ext3-orhan_lock-2.4.19.pc
wangdi [Wed, 4 Jun 2003 02:39:38 +0000 (02:39 +0000)]
add ext3-orphan_lock-2.4.19.patch
wangdi [Wed, 4 Jun 2003 02:34:25 +0000 (02:34 +0000)]
add ext3-delete_thread-2.4.19.pc
wangdi [Wed, 4 Jun 2003 02:32:35 +0000 (02:32 +0000)]
add ext3-delete_thread-2.4.19.patch
adilger [Wed, 4 Jun 2003 02:05:52 +0000 (02:05 +0000)]
Put an obd_ucred struct in the mds_update_record struct so we can pass it
directly to push_ctxt(). This saves allocating an additional obd_run_ctxt
struct on the stack in mds_reint_rec() (in one of the major stack using paths).
adilger [Wed, 4 Jun 2003 00:00:30 +0000 (00:00 +0000)]
Make LBUG() just a special case of LBUG_WITH_LOC().
adilger [Tue, 3 Jun 2003 23:27:01 +0000 (23:27 +0000)]
Remove bogus extra arg from delete thread warning message.
adilger [Tue, 3 Jun 2003 22:52:06 +0000 (22:52 +0000)]
Include the EXT3_MOUNT_ASYNCDEL addition from ext3-delete-thread into the
context of the iopen patch (as it is in the 2.4.20 version), otherwise the
iopen patch fails to apply.
braam [Tue, 3 Jun 2003 13:32:26 +0000 (13:32 +0000)]
- intent fixes for exec.c
- it->disposition status fix (should be backported, sent email to
it->phil)
- pick up Alex sunrpc fixes.
- redo ll_follow_link for 2.5
alex_tomas [Tue, 3 Jun 2003 10:55:06 +0000 (10:55 +0000)]
- net/sunrpc/rpc_pipe.c didn't pass 3rd argument to lookup_hash()
adilger [Tue, 3 Jun 2003 07:51:16 +0000 (07:51 +0000)]
Implement cancel thread:
- we only start two threads to begin with
- new threads are started if it is possible that all threads will be blocked,
up to a predefined maximum number of threads (should be ~num_osts)
- cookie pages are kept in a pool (refilled by the threads normally), but if
the pool is empty and we need a page we try a GFP_ATOMIC allocation
- donot block the cookie commit callback when sending log cancels
Todo
- start up first commit callback thread on the MDS
- figure out where to stash llog_commit_parent pointer (ideally in MDS, since
it is the one who will be allocating it and starting the thread, but we
need it when sending the cancel cookies). Parameter to obd_log_cancel?
- clean up all commit callback threads at shutdown
- emulate "no recovery" behaviour of osc_brw_prep_request()/osc_brw_internal()
so the cancel thread can detect OST failures and start MDS<->OST recovery
wangdi [Tue, 3 Jun 2003 01:34:31 +0000 (01:34 +0000)]
add ext3-delete_thread-2.4.18.pc
wangdi [Tue, 3 Jun 2003 01:31:44 +0000 (01:31 +0000)]
add ext3-delete_thread-2.4.18.patch
rread [Mon, 2 Jun 2003 17:25:56 +0000 (17:25 +0000)]
add --force to cleanup in the clumanager service script.
i think that is also necessary now.
braam [Mon, 2 Jun 2003 09:33:25 +0000 (09:33 +0000)]
- many further changes to clean up intent handling
- indicate clearly to the FS when the last component is hit in
lookup with an nd flag LOOKUP_TAIL
- change prototype of lookup_it and revalidate_it to have struct
nameidata * for this reason.
- upshot: createmany works, finally!
wangdi [Mon, 2 Jun 2003 09:26:14 +0000 (09:26 +0000)]
remove config-2.4.18-linux-2.4.18-p4smp-52chaos
wangdi [Mon, 2 Jun 2003 09:25:30 +0000 (09:25 +0000)]
remove config-2.4.18-linux-2.4.18-i386-52chaos
wangdi [Mon, 2 Jun 2003 09:09:34 +0000 (09:09 +0000)]
add config-files
pschwan [Mon, 2 Jun 2003 06:28:04 +0000 (06:28 +0000)]
update invalidate_show again for 2.4.20-9; 2.4.20-8 is no longer supported.
braam [Mon, 2 Jun 2003 05:05:44 +0000 (05:05 +0000)]
- more 2.5 cleanups:
- initialization function for intents
- remove d_it
- clean up llite accordingly
- multiple mcreates now work
pschwan [Mon, 2 Jun 2003 04:02:12 +0000 (04:02 +0000)]
- print the error code if portals_do_debug_dumplog fails
- demote "mmap readpage - check locks" message to a CDEBUG
- don't try to set mtime on OSTs if there's no LSM (file not yet opened)
- print which client (nid) returned an error from a blocking AST
pschwan [Sun, 1 Jun 2003 18:49:20 +0000 (18:49 +0000)]
b=1384
r=braam
Clear supplementary groups in push_ctxt (restore in pop_ctxt), to
avoid inheriting all supp. groups for all operations.
braam [Sun, 1 Jun 2003 07:20:07 +0000 (07:20 +0000)]
- remove unused references to d_it etc.
- make fixes for getattr and setattr methods
braam [Sun, 1 Jun 2003 02:26:47 +0000 (02:26 +0000)]
- Makefile for tests
zab [Sun, 1 Jun 2003 01:33:20 +0000 (01:33 +0000)]
- NULL d_it in the 2.5 LL_SAVE_INTENT macro
- get rid of the early ll_intent_release EXIT, it should be fine now that
d_it is always null..
- bring the 'called from open' argument to 2.5's do_truncate so mdc_reint
and mds_reint don't deadlock on O_TRUNC
thantry [Fri, 30 May 2003 16:51:00 +0000 (16:51 +0000)]
Bug1275 - enhance debug_daemon to dump out previouse log, as debug_daemon, on starts up
braam [Fri, 30 May 2003 10:23:54 +0000 (10:23 +0000)]
- retore -O2, it really doesn't work without it.
braam [Fri, 30 May 2003 09:35:57 +0000 (09:35 +0000)]
- fixes to yesterday morning's check in: fix broken vfs_create_it
shaver [Thu, 29 May 2003 21:30:14 +0000 (21:30 +0000)]
b=1378: (bizarro-twin of 1272, also DBA 1366) --force again means "don't talk to
the servers" when shutting down, regardless of recovery state. (This
will go back to a try-once mode after the pinger appears.)
braam [Thu, 29 May 2003 14:40:10 +0000 (14:40 +0000)]
- fixes for open, it _may_ just work now, but I suspect that we also need to
give the open method itself a nameidata as a parameter to pass the intent
(we no longer save intents in 2.5)
- some fixes to the patches: the scripts have a problem, poppatch works too
easily.
- Zach over to you for the night / day shift!
meijia [Thu, 29 May 2003 08:56:47 +0000 (08:56 +0000)]
[liblustre]:
- various fixes, mostly related to intent. Now iozone and IOR2 could
run again.
braam [Thu, 29 May 2003 08:47:13 +0000 (08:47 +0000)]
- add new patch for the VFS race conditions we are fixing.
braam [Thu, 29 May 2003 08:43:55 +0000 (08:43 +0000)]
- continue to reorganize the patches
wangdi [Thu, 29 May 2003 06:56:22 +0000 (06:56 +0000)]
add tcp-zero-copy patch
wangdi [Thu, 29 May 2003 06:19:59 +0000 (06:19 +0000)]
add tcp-zero-copy patch
wangdi [Thu, 29 May 2003 06:11:18 +0000 (06:11 +0000)]
rename tcp-zero-copy-2.4.18.patch tcp-zero-copy.patch
wangdi [Thu, 29 May 2003 06:07:14 +0000 (06:07 +0000)]
fix a flaw in invalidate_show-2.4.20-rh.patch
wangdi [Thu, 29 May 2003 02:52:26 +0000 (02:52 +0000)]
add include to extra_dist
shaver [Thu, 29 May 2003 00:15:32 +0000 (00:15 +0000)]
More fixing for recovery-small/b=1300:
- fix type of o_destroy_export
- getattr reconstruction can't rely on the mcd or exp_outstanding_reply,
because getattrs no longer have transnos. Happily, they don't need to.
- reconstruct_open doesn't need to worry about the lock enqueued on the
client's behalf, because we already fix all that up in
fixup_handle_for_resent_req -- and it really doesn't need to LBUG.
Fix for MCR's MDS-side recovery LBUG today: pack a reply to send when aborting
queued recovery requests at timer-expiry.
When we fail an LASSERT, call the LBUG upcall with the location of the
assertion, not the constant-and-useless location of kportal_assertion_failed.
shaver [Wed, 28 May 2003 21:17:27 +0000 (21:17 +0000)]
b=1300: Factor disconnect into disconnect and destroy_export, so that we can
delay freeing of export-specific data until all in-play requests have
finished with it.
Removed an assertion in ptlrpc_abort_reply: if we just skip sending the reply,
it's OK for rq_want_ack to still be 1 when we fail to unlink the MD.
A little bit of debugging for the waiting-lock list.
$FSTYPE for recovery-small.sh.