Whamcloud - gitweb
pjkirner [Thu, 22 Sep 2005 03:23:27 +0000 (03:23 +0000)]
* Fixed asserts
pjkirner [Thu, 22 Sep 2005 03:07:33 +0000 (03:07 +0000)]
* Additional checks for correct locking
pjkirner [Thu, 22 Sep 2005 03:04:31 +0000 (03:04 +0000)]
* Add support in for irqs_disabled() on all patforms
pjkirner [Wed, 21 Sep 2005 21:19:21 +0000 (21:19 +0000)]
* Add additional asserts to check caller context
pjkirner [Wed, 21 Sep 2005 20:41:14 +0000 (20:41 +0000)]
* Cleanedup send path
* Fixed lock being heald while calling lnet_finalize (only in error path)
* Fixed calls to kptllnd_peer_queue_tx_locked(), to actually hold the lock.
eeb [Wed, 21 Sep 2005 18:45:40 +0000 (18:45 +0000)]
* Moved some #defines out of klnds/ptllnd.h into a shared place
pjkirner [Wed, 21 Sep 2005 18:31:25 +0000 (18:31 +0000)]
* Adjusted TX Pool to meet new LNET requiremenst (no blocking)
* Fixed bug with routed REPLY
pjkirner [Wed, 21 Sep 2005 18:15:26 +0000 (18:15 +0000)]
* Fix a typo
eeb [Wed, 21 Sep 2005 17:45:46 +0000 (17:45 +0000)]
* Fixed LND registration / setting the_lnet.ln_init ordering
eeb [Wed, 21 Sep 2005 16:54:30 +0000 (16:54 +0000)]
* Added lnet/ulnds/ptllnd
* Changed userspace LNET to register all LNDs it has been linked with
and construct the default set of networks from them.
* Added support for userspace LNET config via environment variables
LNET_NETWORKS and LNET_ROUTES. They work just like the kernel module
parameters.
pjkirner [Wed, 21 Sep 2005 14:30:36 +0000 (14:30 +0000)]
* Fixed dist problem with ptllnd_wire.h file that moved
* Fixed dist problem with missing file in libcfs/linux
eeb [Wed, 21 Sep 2005 08:21:08 +0000 (08:21 +0000)]
* fixed a ./config warning
* moved klnds/ptllnd/ptllnd_wire.h into lnet/include/lnet/ so
ulnds/ptllnd can get at it.
* fixed/added some .cvsignore files
* fixed ulnds/tcplnd to give itself some send credits
pjkirner [Wed, 21 Sep 2005 05:13:14 +0000 (05:13 +0000)]
* Fixed some XT3 Compilation issues
* Clarified a few LNET vs PTL ambiguities
* Added code for mapping kiov -> iovec
pjkirner [Wed, 21 Sep 2005 02:08:05 +0000 (02:08 +0000)]
b=7982
* Portals LND
pjkirner [Wed, 21 Sep 2005 00:38:08 +0000 (00:38 +0000)]
* Undoing changes from the b_newconfig_rdmarouting landing that have negativily affected the ptllrpc build.
pjkirner [Tue, 20 Sep 2005 18:10:37 +0000 (18:10 +0000)]
* Fix buffalo build error with removed file.
pjkirner [Tue, 20 Sep 2005 17:43:10 +0000 (17:43 +0000)]
* Removed problematic building of ut (unit test tool) so we can move ahead with buffalo testing.
eeb [Tue, 20 Sep 2005 17:19:08 +0000 (17:19 +0000)]
* removed lnet_parse() rc ambiguity: 0 on success
pjkirner [Tue, 20 Sep 2005 17:00:04 +0000 (17:00 +0000)]
b=7981
* Landing of b_newconfig_rdmarouting
* Passed sanity.sh
* 9348 is still open, but this landing hasn't introduced it.
pjkirner [Mon, 19 Sep 2005 21:56:31 +0000 (21:56 +0000)]
* Fix incorrect path in godb file
pjkirner [Mon, 19 Sep 2005 20:41:31 +0000 (20:41 +0000)]
* Fixed problem with distributed build.
pjkirner [Mon, 19 Sep 2005 13:50:41 +0000 (13:50 +0000)]
* Add simple LNET unit test modules
pjkirner [Fri, 16 Sep 2005 16:42:38 +0000 (16:42 +0000)]
* Fixed build warning (and possible 64-bit error)
pjkirner [Fri, 16 Sep 2005 15:33:32 +0000 (15:33 +0000)]
b=8021
* Landing EEB's b_newconfig_rdmarouting branch
pjkirner [Fri, 16 Sep 2005 13:21:26 +0000 (13:21 +0000)]
* Apply Nikita's patch from portals tree to lnet
pjkirner [Fri, 16 Sep 2005 13:10:48 +0000 (13:10 +0000)]
* Removed extra unnecessary message
liangzhen [Fri, 16 Sep 2005 04:12:40 +0000 (04:12 +0000)]
Fix problem of build ptllnd.
pjkirner [Thu, 15 Sep 2005 13:32:28 +0000 (13:32 +0000)]
* Fix 2.6.5 build issue
liangzhen [Thu, 15 Sep 2005 13:17:32 +0000 (13:17 +0000)]
Remove unused socklnd files in ulnds, they have been
moved ulnds/socklnd.
liangzhen [Thu, 15 Sep 2005 09:56:38 +0000 (09:56 +0000)]
1. two options for build
a. --with-portals=<path to portals>, build ptllnd with external portals
b. --with-lustre-portals, build ptllnd and lustre portals
2. ulnd build patch, tcplnd is built as lnet/ulnds/socklnd
3. smallfix for lnet/ulnds/socklnd
eeb [Thu, 15 Sep 2005 08:54:50 +0000 (08:54 +0000)]
* Moved PTL_{MTU,_MD_MAX_IOV) into types.h
* Removed ref in lustre_net.h to lib-types.h and simplified how
PTLRPC_MAX_BRW_{SIZE,PAGES} are defined.
nathan [Wed, 14 Sep 2005 19:04:23 +0000 (19:04 +0000)]
fix build
pjkirner [Wed, 14 Sep 2005 13:22:02 +0000 (13:22 +0000)]
b=9318
r=eeb
* Removed MOST of the instances of CRAY_PORTALS, especially the ones that were related to the networking portion.
* In addition to what was in the patch for 9318, also included the removal of build_check.h and refrences.
Note: This does NOT complete the work on this bug, there are still a number of outstanding refrences to CRAY_PORTALS.
pjkirner [Wed, 14 Sep 2005 03:56:39 +0000 (03:56 +0000)]
* Fixed LNET undefined symbol PDE() problem on 2.4 kernel (specifically Cray XT3)
nathan [Tue, 13 Sep 2005 03:28:32 +0000 (03:28 +0000)]
b=8080
update from b1_4
eeb [Mon, 12 Sep 2005 17:41:23 +0000 (17:41 +0000)]
* Changed nal_send() to include 'target_is_router' and 'routing' flags
Where 'target_is_router' == the immediate destination is a router
and 'routing' == This message is being forwarded from another LND.
NB The routing flag isn't set yet (but will be when all routing is done in
lib-move.
* Added support for RDMA-ed REPLYs in all relevent LNDs ready for RDMA
routing. LNDs must send IMMEDIATE GETs if the local node or the target
are routers, but may RDMA the REPLY (just lika a PUT) on the return
route.
eeb [Mon, 12 Sep 2005 14:17:32 +0000 (14:17 +0000)]
* viblnd: applied ARP retry patch
eeb [Mon, 12 Sep 2005 13:49:36 +0000 (13:49 +0000)]
* tidied up NID printing (s/LPX64/%s/ && s/nid/libcfs_nid2str(nid)/)
eeb [Sun, 11 Sep 2005 13:54:35 +0000 (13:54 +0000)]
* Cleaned up portals compatibility tests into a couple of inlines
in lib-lnet.h
* Added (but didn't test) portals compatibility support for
gm, openib and ra.
eeb [Sat, 10 Sep 2005 17:12:16 +0000 (17:12 +0000)]
* Added check for portals compatibility mode in LNDs that don't support it
yet.
eeb [Sat, 10 Sep 2005 17:05:04 +0000 (17:05 +0000)]
* Got vibnal LNET/portals wire compatibility working
* Removed bad LPSZ in format strings (I got rid of lnet_size_t)
* Changed vibnal NID printing from LPX64 to %s(libcfs_nid2str(nid))
eeb [Sat, 10 Sep 2005 03:48:48 +0000 (03:48 +0000)]
* LNET/portals wire compatibility working on elan and tcp. Set the lnet
module parameter "portals_compatibility" to...
"strong" Compatible with portals and LNET "strong" and "weak"
"weak" Compatible with any value of LNET portals_compatibility
"none" Compatible with LNET "weak" and "none". This is the default.
Old XML and existing old configuration profiles (logs) can be used as-is.
* Updated GM README
* Backed out most of the change to lconf that used hostaddr to construct the
LNET NID. It now signals an error if the XML contains > 1 --hostaddr, or
if the --hostaddr doesn't match the NID, since it's likely manual
intervention will be required in these cases.
liangzhen [Fri, 9 Sep 2005 15:40:08 +0000 (15:40 +0000)]
optional build for portals.
1. Portals will not be built by default
2. To build portals: configure --with-portals=yes .....
pjkirner [Thu, 8 Sep 2005 20:37:10 +0000 (20:37 +0000)]
* Fix ogdb-host file generation, pickup the correct lnet modules as well as the legacy portals modules (for testing purposes only)
eeb [Thu, 8 Sep 2005 17:18:24 +0000 (17:18 +0000)]
* Added GM README
eeb [Thu, 8 Sep 2005 15:18:55 +0000 (15:18 +0000)]
* Removed unused parameters from LNet??? APIs (e.g. interface handle)
* Removed unused LNet??? APIs.
* Removed many scalar typedefs inherited from portals.
* fixed up alignment in some decls that s/ptl/lnet/ had unaligned.
* updated sanity.sh to s/portals.debug/lnet.debug/
* verified lnet can zeroconf mount a pre-lnet filesystem after
lctl --write_config <pre-lnet-xml>
liangzhen [Wed, 7 Sep 2005 13:33:08 +0000 (13:33 +0000)]
Smallfix for function define.
eeb [Tue, 6 Sep 2005 09:30:54 +0000 (09:30 +0000)]
* Added support for routable RDMA-ed REPLY messages to qswlnd
eeb [Tue, 6 Sep 2005 07:42:56 +0000 (07:42 +0000)]
* ptllnd: added .cvsignore
eeb [Mon, 5 Sep 2005 19:21:16 +0000 (19:21 +0000)]
* Removed nal_{send,recv}_pages() LND APIs (send and receive are passed
either VM frags (iov != NULL) or page frags (kiov != NULL) but not both.
* Ensure that the order of networks declared in the "networks" and "routes"
breaks ties when determining which peer NID to use.
eeb [Fri, 2 Sep 2005 18:56:56 +0000 (18:56 +0000)]
* Added hopcounts to route table
* Changed ptlrpc_uuid_to_peer() to choose the matching UUID with the shortest
hopcount
* Changed lconf to use a single UUID string for all target NIDs, so the
client can choose which one to use at runtime.
* Stripped out all the unused network configuration stuff from lconf
pjkirner [Fri, 2 Sep 2005 12:52:24 +0000 (12:52 +0000)]
Add PTLLND to the LND enum.
liangzhen [Fri, 2 Sep 2005 09:32:11 +0000 (09:32 +0000)]
Patch for Lustre Networking Reorganization
DONE:
1. Fixing of building both lnet and portals
2. Fixing of conflicting symbols in lnet and portals
- exported APIs of lnet/libcfs with name like ptl_* are renamed to libcfs_*
- exported APIs of lnet/lnet with name like ptl_* are renamed to lnet_*
- exported APIs of portals/libcfs with name like libcfs_* are renamed to libptl_*
- modules name of portals/libcfs/libcfs.ko to portals/libcfs/libptl.ko
3. /proc entry for lnet is /proc/sys/lnet
4. Listen port of socklnd is 988, listen port of socknal is 989
5. Pseudo device for lnet is /dev/lnet
6. Fixing of build lnet/klnds/ptllnd
7. Fixing of module path and /proc path in lnet/utils lustre/utils lustre/tests
TODO:
1. Renaming of unexported symbols in lnet.
2. Renaming of types and macro
3. Add option for building portals
4. Misc fix and testing
pjkirner [Thu, 1 Sep 2005 16:21:57 +0000 (16:21 +0000)]
Added build infrastructre for PTLLND.
Plus dummy PTLLND that basic interactions between LNET and PORTALS.
pjkirner [Thu, 1 Sep 2005 15:16:49 +0000 (15:16 +0000)]
Reorganize LNET API files, so that PTLLND
can include both LNET and PORTALS.
pjkirner [Thu, 1 Sep 2005 13:02:53 +0000 (13:02 +0000)]
Fix missing NAL->LND
liangzhen [Thu, 1 Sep 2005 04:11:07 +0000 (04:11 +0000)]
Smallfix for lnet build
pjkirner [Thu, 1 Sep 2005 03:46:00 +0000 (03:46 +0000)]
Changes necessary to make liblustre build after LNET rename
pjkirner [Thu, 1 Sep 2005 02:52:35 +0000 (02:52 +0000)]
Changes for LNET rename of NAL -> LND
pjkirner [Thu, 1 Sep 2005 00:21:57 +0000 (00:21 +0000)]
Rename Directories in LNET
knals -> klnds
unals -> ulnds
And associated build fixes.
eeb [Wed, 31 Aug 2005 21:34:02 +0000 (21:34 +0000)]
* Applied Andreas' tcpnal compiler optimization bugfix patch
from HEAD portals (different way of constructing tcp HELLO
header to avoid pointer aliasing) to lnet
* Applied qswnal build fix to lustre-portals.m4 from HEAD
portals to lnet
* lnet version of gmnal running @ HP
* fixed bad 64bit cast in acceptor.c
* fixed lconf to work with newconfig modules under lnet
eeb [Wed, 31 Aug 2005 12:51:50 +0000 (12:51 +0000)]
* Applied implicit_loopback fixes to portals (it was previously
applied to lnet)
* Minor formatting changes to lnet/lnet/{lib-move,router}.c
pjkirner [Wed, 31 Aug 2005 03:32:53 +0000 (03:32 +0000)]
Fixes for LNET Directory rename
eeb [Tue, 30 Aug 2005 20:36:52 +0000 (20:36 +0000)]
* fixed implicit_loopback bug
pjkirner [Tue, 30 Aug 2005 20:18:33 +0000 (20:18 +0000)]
LNET Reorg Phase 1
r=pjk
r=eeb
pjkirner [Tue, 30 Aug 2005 20:15:51 +0000 (20:15 +0000)]
Phase 1 of LNET Reorg
r=pjk
r=eeb
liangzhen [Wed, 17 Aug 2005 14:16:06 +0000 (14:16 +0000)]
Portals -> LNet:
rename Ptl_* APIs to LNet_*
lincent [Mon, 15 Aug 2005 18:24:27 +0000 (18:24 +0000)]
mgs operations
eeb [Wed, 10 Aug 2005 11:04:18 +0000 (11:04 +0000)]
* iibnal and openibnal packed wire format updates from HEAD portals
eeb [Sun, 31 Jul 2005 13:32:25 +0000 (13:32 +0000)]
* 7164 fix newconfig acceptor startup assertion
nathan [Mon, 25 Jul 2005 23:55:57 +0000 (23:55 +0000)]
b=6663
adding lvfs_mount, begin changes to llite_lib.c
nathan [Thu, 14 Jul 2005 23:29:44 +0000 (23:29 +0000)]
Branch b_hd_newconfig
Remove the "-O2" requirement for htonl references
eeb [Mon, 11 Jul 2005 16:43:47 +0000 (16:43 +0000)]
* better ranal error message on FMA CQ overflow
* fixed socknal modparam that should have been RO under /proc or /sys
* skip the loopback interface in ptl_set_ip_niaddr() when no interfaces have
been specified.
* #if 0 out the 'allow_destination_aliases' portals module param. It's too
hard to describe to punters.
* change the meaning of the 'implicit_loopback' portals module parameter to
be more precisely just that.
* don't fail when adding a duplicate route; just ignore it.
eeb [Fri, 8 Jul 2005 16:39:24 +0000 (16:39 +0000)]
* More intuitive newconfig acceptor module params
eeb [Fri, 8 Jul 2005 14:51:27 +0000 (14:51 +0000)]
* newconfig liblustre/tcpnal runs sanity OK
eeb [Thu, 7 Jul 2005 17:51:53 +0000 (17:51 +0000)]
* first signs of life from newconfig tcpnal + liblustre echo_client
eeb [Thu, 7 Jul 2005 10:04:03 +0000 (10:04 +0000)]
* socknal working again
eeb [Wed, 6 Jul 2005 15:54:56 +0000 (15:54 +0000)]
* socknal: removed unused var
eeb [Tue, 5 Jul 2005 18:35:31 +0000 (18:35 +0000)]
* Compiles after merging b1_4
* Changed socknal...
- use PID as well as NID to match connections so userspace (tcpnal) clients
can be distinguished without changing the NID format.
- unprivileged port == userspace client
- don't create new connections to userspace clients
- derive the NID/PID of a userspace client from the remote IP/port
* Changed tcpnal...
- use non-privileged ports
- no concept of own NID (peer assigns)
- don't accept connections
eeb [Fri, 1 Jul 2005 09:33:29 +0000 (09:33 +0000)]
* Changed default vib tx descriptor pool sizes
eeb [Thu, 30 Jun 2005 17:33:32 +0000 (17:33 +0000)]
* More work on config error messages + some code restructure to trap config
errors better.
* Fixed bug in 'implicit loopback'
* Completely removed duplicate interface usage checks from generic network
setup. The NAL should be doing this. Currently socknal is the only one
that supports multiple instances and it doesn't mind at all if 2 different
network instances use the same interfaces, and that's dead useful for
testing.
* Tightened up lonal refcounting; there was a remote chance of a shutdown
race.
eeb [Wed, 29 Jun 2005 21:33:57 +0000 (21:33 +0000)]
* Improved router error messages and move some of the checking out of qswnal
and socknal and into the router itself.
* Don't force FMR unmap on error if it wasn't mapped in the first place.
* Make socknal close the incoming connection if it receives junk for
forwarding
* allow "lctl network down" == "lctl network unconfigure" (I kept wanting to
type 'down' and now I can :)
eeb [Tue, 28 Jun 2005 16:13:16 +0000 (16:13 +0000)]
* started getting tcpnal working with portals newconfig
eeb [Tue, 28 Jun 2005 09:43:53 +0000 (09:43 +0000)]
* vibnal: ensure FMR is flushed on error
behlendo [Mon, 27 Jun 2005 23:47:52 +0000 (23:47 +0000)]
Added router.h to EXTRA_DIST to fix 'make rpms'
eeb [Tue, 21 Jun 2005 11:56:32 +0000 (11:56 +0000)]
* Added console error to report acceptor startup failure
eeb [Tue, 21 Jun 2005 11:48:33 +0000 (11:48 +0000)]
* debugged single acceptor
* fixed PtlGetId() to skip the loopback NI
eeb [Mon, 20 Jun 2005 20:23:42 +0000 (20:23 +0000)]
* first cut single acceptor for SOCKNAL, RANAL, OPENIBNAL, with
backwards compatibility.
eeb [Fri, 17 Jun 2005 09:29:28 +0000 (09:29 +0000)]
* Updated vibnal from b1_4 to use FMR
eeb [Sat, 11 Jun 2005 13:15:26 +0000 (13:15 +0000)]
* vibnal: added module parameters for some low-level IB tunables (bug #6443)
* vibnal: changed some default tunables (bug #6474)
eeb [Wed, 8 Jun 2005 19:04:22 +0000 (19:04 +0000)]
* noticed difference between userspace and kernel x86_64 __u64 etc typdefs
and adjusted LPU64 etc in <libcfs/linux/kp30.h> appropriately
* libcfs_pause() -> inline cfs_pause() in <libcfs/linux/linux-prim.h> and
made more use of it
* cleaned up NAL loading/instantiated console messages
* fixed reconnect_interval in connection NALs, where I'd left out * HZ
* made ???_npeers atomic again; using ???_global lock round the decrement in
???_destroy_peer() could deadlock.
* fixed uninitialised list in vib:kibnal_peer_connect_failed()
* changed to "nicer" format for /proc/sys/portals/routes
* more x86_64 testing
eeb [Tue, 7 Jun 2005 23:18:56 +0000 (23:18 +0000)]
* Got rid of portals/router; getting it to build in portals/portals using
symlinks didn't work on 2.6 and router.c has changed enough for the diffs
not to make sense any more (and life is too short).
* reworked how reconnect timeouts work in all connection based NALs, to close
out a bug where reconnection could be delayed until jiffies wraps.
* tested vibnal with a single echo client + server @ LLNL on 2.6, x86_64
* make configure-on-demand rather than configure-on-load the default; modprobe
in 2.6 does locking and deadlocks if the module it is loading tries to load
more modules in its init sequence.
* removed some unused vars etc caught by better warnings when building for
2.6/x86_64
eeb [Mon, 6 Jun 2005 21:22:25 +0000 (21:22 +0000)]
* brought openibnal into the newconfig fold
* moved socket address query function out of NALs and into libcfs
eeb [Fri, 3 Jun 2005 11:20:52 +0000 (11:20 +0000)]
* merged 6361 vibnal fixes (IBNAL_OUS_DST_RD = 1, HCA names)
* vibnal to use HCA name as the interface name
- add hca and IPoIB interface name prefixes to tunables
- use 1:1 HCA number and IPoIB interface number to determine IP address
of given HCA
eeb [Fri, 3 Jun 2005 09:03:44 +0000 (09:03 +0000)]
* merged in 5858 fix (vibnal arp retries)
eeb [Thu, 2 Jun 2005 17:57:59 +0000 (17:57 +0000)]
* vibnal takes a single interface name which defaults to 'ipoib0'
eeb [Thu, 2 Jun 2005 07:09:25 +0000 (07:09 +0000)]
* moved #defines in qswnal.h around to separate fixed constants
from ones that can be set via modparams
* placeholder vibnal for newconfig; still need to get the IPoIB
IP address somehow.
eeb [Wed, 1 Jun 2005 21:23:21 +0000 (21:23 +0000)]
* Added interface and socket queries to libcfs (libcfs/linux-tcpip.c). Some
of this came from socknal_lib-linux.c but was generalised a little so all
the NALs that use TCP/IP to do connection establishment can use it.
CAVEAT EMPTOR! Just the linux versions are done; the darwin versions are
not implemented yet.
Changed socknal and ranal to use this.
* Brought ranal up-to-date with new config + mod params for all tunables.
Ranal gets its local address from the IP of a (single) specified
interface, or the first "suitable" one found if no interface is specified.
eeb [Thu, 19 May 2005 17:53:51 +0000 (17:53 +0000)]
* fixed use of %n at end of string in nidstrings.c
eeb [Thu, 19 May 2005 14:05:33 +0000 (14:05 +0000)]
* changed router lookups to always return the local destination
socknal + qswnal changes to match
simplified "will this be forwarded" predicate in ptl_send()
* fixed bug in multi-instance socknal to pass the correct NI to the tx
teardown function to allow for the case where the tx hasn't even had a conn
assigned yet.
* fixed routerstat stats formatting
* removed osc::check_elan_limit() and replaced with max_unfragmented_pages()
which actually works. NB it depend on the assumption that file and page
offsets are aligned!
* fixed lconf to stay away from portals, except to 'modprobe' for it first in
'load_module()', and to rmmod all the NALs + portals when ptlrpc gets
unloaded.