tba Cluster File Systems, Inc. * version 1.4.4 * bug fixes Severity : major Frequency : rare (large Voltaire clusters only) Bugzilla : 6993 Description: the default number of reserved transmit descriptors was too low for some large clusters Details : As a workaround, the number was increased. A proper fix includes a run-time tunable. 2005-06-02 Cluster File Systems, Inc. * version 1.4.3 * bug fixes Severity : major Frequency : occasional (large-scale events, cluster reboot, network failure) Bugzilla : 6411 Description: too many error messages on console obscure actual problem and can slow down/panic server, or cause recovery to fail repeatedly Details : enable rate-limiting of console error messages, and some messages that were console errors now only go to the kernel log Severity : enhancement Bugzilla : 1693 Description: add /proc/sys/portals/catastrophe entry which will report if that node has previously LBUGged 2005-04-06 Cluster File Systems, Inc. * bugs - update gmnal to use PTL_MTU, fix module refcounting (b=5786) 2005-04-04 Cluster File Systems, Inc. * bugs - handle error return code in kranal_check_fma_rx() (5915,6054) 2005-02-04 Cluster File Systems, Inc. * miscellania - update vibnal (Voltaire IB NAL) - update gmnal (Myrinet NAL), gmnalid 2005-02-04 Eric Barton * Landed portals:b_port_step as follows... - removed CFS_DECL_SPIN* just use 'spinlock_t' and initialise with spin_lock_init() - removed CFS_DECL_MUTEX* just use 'struct semaphore' and initialise with init_mutex() - removed CFS_DECL_RWSEM* just use 'struct rw_semaphore' and initialise with init_rwsem() - renamed cfs_sleep_chan -> cfs_waitq cfs_sleep_link -> cfs_waitlink - fixed race in linux version of arch-independent socknal (the ENOMEM/EAGAIN decision). - Didn't fix problems in Darwin version of arch-independent socknal (resetting socket callbacks, eager ack hack, ENOMEM/EAGAIN decision) - removed libcfs types from non-socknal header files (only some types in the header files had been changed; the .c files hadn't been updated at all).