* Support for kernels:
2.6.9-67.0.4.EL (RHEL 4),
2.6.16.54-0.2.5 (SLES 10),
- 2.6.18-53.1.13.el5 (RHEL 5).
+ 2.6.18-53.1.14.el5 (RHEL 5),
+ 2.6.22.14 vanilla (kernel.org).
* Client support for unpatched kernels:
(see http://wiki.lustre.org/index.php?title=Patchless_Client)
2.6.16 - 2.6.21 vanilla (kernel.org)
- * Recommended e2fsprogs version: 1.40.4-cfs1
+ * Recommended e2fsprogs version: 1.40.7-sun1
* Note that reiserfs quotas are disabled on SLES 10 in this kernel.
* RHEL 4 and RHEL 5/SLES 10 clients behaves differently on 'cd' to a
removed cwd "./" (refer to Bugzilla 14399).
+Severity : normal
+Bugzilla : 12652
+Description: Add FMODE_EXEC file flag for SLES10 SP1 kernel.
+
+Severity : enhancement
+Bugzilla : 13397
+Description: Update to support 2.6.22.14 vanilla kernel.
+
+Severity : normal
+Bugzilla : 14533
+Frequency : rare, on recovery
+Description: read procfs can produce deadlock in some situation
+Details : Holding lprocfs lock which send rpc can produce block for destroy
+ obd objects and this also block reconnect with -EALREADY. This isn't
+ fix all lprocfs bugs - but make it rare.
+
+Severity : enhancement
+Bugzilla : 15152
+Description: Update kernel to RHEL5 2.6.18-53.1.14.el5.
+
+Severity : major
+Frequency : frequent on X2 node
+Bugzilla : 15010
+Description: mdc_set_open_replay_data LBUG
+Details : Set replay data for requests that are eligible for replay.
+
+Severity : normal
+Bugzilla : 14321
+Description: lustre_mgs: operation 101 on unconnected MGS
+Details : When MGC is disconnected from MGS long enough, MGS will evict the
+ MGC, and late on MGC cannot successfully connect to MGS and a lot
+ of the error messages complaining that MGS is not connected.
+
Severity : major
Frequency : on start mds
Bugzilla : 14884
Bugzilla : 13537
Description: Correctly check stale fid, not start epoch if ost not support SOM
Details : open with flag O_CREATE need set old fid in op_fid3 because op_fid2
- overwrited with new generated fid, but mds can anwer with one of these
- two fids and both is not stale. setattr incorectly start epoch and
- assume will be called done_writeting, but without SOM done_writing
- never called.
+ overwrited with new generated fid, but mds can anwer with one of these
+ two fids and both is not stale. setattr incorectly start epoch and
+ assume will be called done_writeting, but without SOM done_writing
+ never called.
Severity : major
Frequency : rare, depends on device drivers and load
Frequency : occasional
Bugzilla : 13730
Description: Do not fail import if osc_interpret_create gets -EAGAIN
-Details : If osc_interpret_create got -EAGAIN it immediately exits and
- wakeup oscc_waitq. After wakeup oscc_wait_for_objects call
- oscc_has_objects and see OSC has no objests and call
+Details : If osc_interpret_create got -EAGAIN it immediately exits and
+ wakeup oscc_waitq. After wakeup oscc_wait_for_objects call
+ oscc_has_objects and see OSC has no objests and call
oscc_internal_create to resend create request.
Severity : enhancement
Severity : normal
Bugzilla : 12228
Description: LBUG in ptlrpc_check_set() bad phase ebc0de00
-Details : access to bitfield in structure is always rounded to long
+Details : access to bitfield in structure is always rounded to long
and this produce problem with not atomic change any bit.
Severity : normal
produce deadlock in llog code.
Details : If llog thread has last reference to obd and call class_import_put
this produce deadlock because llog_cleanup_commit_master wait when
- last llog_commit_thread exited, but this never success because was
+ last llog_commit_thread exited, but this never success because was
called from llog_commit_thread.
Severity : normal
Description: access already free / zero obd_namespace.
Details : if client_disconnect_export was called without force flag set,
and exist connect request in flight, this can produce access to
- NULL pointer (or already free pointer) when connect_interpret
+ NULL pointer (or already free pointer) when connect_interpret
store ocd flags in obd_namespace.
Severity : minor
Severity : enhancement
Bugzilla : 12702
Description: lost problems with lov objid file
-Details : Fixes some scability and access to not inited memory problems
+Details : Fixes some scability and access to not inited memory problems
in work with lov objdid file.
Severity : major
Severity : enhancement
Bugzilla : 13690
Description: Build SLES10 patchless client fails
-Details : The configure was broken by run ./configure with
+Details : The configure was broken by run ./configure with
--with-linux-obj=.... argument for patchless client. When the
configure use --with-linux-obj, the LINUXINCLUDE= -Iinclude
- can't search header adequately. Use absolute path such as
- -I($LINUX)/include instead.
+ can't search header adequately. Use absolute path such as
+ -I($LINUX)/include instead.
Severity : normal
Bugzilla : 13888
Description: interrupt oig_wait produce painc on resend.
Details : brw_redo_request can be used for resend requests from ptlrpcd and
private set, and this produce situation when rq_ptlrpcd_data not
- copyed to new allocated request and triggered LBUG on assert
+ copyed to new allocated request and triggered LBUG on assert
req->rq_ptlrpcd_data != NULL. But this member used only for wakeup
ptlrpcd set if request is changed and can be safety changed to use
rq_set directly.
Severity : enhancement
Bugzilla : 2262
Description: self-adjustable client's lru lists
-Details : use adaptive algorithm for managing client cached locks lru
+Details : use adaptive algorithm for managing client cached locks lru
lists according to current server load, other client's work
- pattern, memory activities, etc. Both, server and client
- side namespaces provide number of proc tunables for controlling
+ pattern, memory activities, etc. Both, server and client
+ side namespaces provide number of proc tunables for controlling
things
Severity : enhancement
Severity : normal
Bugzilla : 12192
Description: llapi_file_create() does not allow some changes
-Details : add llapi_file_open() that allows specifying the mode and
+Details : add llapi_file_open() that allows specifying the mode and
open flags, and also returns an open file handle.
Severity : normal
Severity : normal
Bugzilla : 10657
Description: Add journal checksum support.(Kernel part)
-Details : The journal checksum feature adds two new flags i.e
- JBD2_FEATURE_INCOMPAT_ASYNC_COMMIT and
- JBD2_FEATURE_COMPAT_CHECKSUM. JBD2_FEATURE_CHECKSUM flag
+Details : The journal checksum feature adds two new flags i.e
+ JBD2_FEATURE_INCOMPAT_ASYNC_COMMIT and
+ JBD2_FEATURE_COMPAT_CHECKSUM. JBD2_FEATURE_CHECKSUM flag
indicates that the commit block contains the checksum for
the blocks described by the descriptor blocks. Now commit
record can be sent to disk without waiting for descriptor
Severity : major
Bugzilla : 12223
Description: mds_obd_create error creating tmp object
-Details : When the user sets quota on root, llog will be affected and can't
+Details : When the user sets quota on root, llog will be affected and can't
create files and write files.
Severity : normal
Bugzilla : 12826
Description: Add EXPORT_SYMBOL check for node_to_cpumask symbol.
Details : This allows the patchless client to be loaded on architectures
- without this export.
+ without this export.
Severity : normal
Bugzilla : 13039
Severity : normal
Bugzilla : 11230
-Description: Tune the kernel for good SCSI performance.
+Description: Tune the kernel for good SCSI performance.
Details : Set the value of /sys/block/{dev}/queue/max_sectors_kb
to the value of /sys/block/{dev}/queue/max_hw_sectors_kb
in mount_lustre.
Bugzilla : 12234
Description: /proc/fs/lustre/devices broken on ppc
Details : The patch as applied to 1.6.2 doesn't look correct for all arches.
- We should make sure the type of 'index' is loff_t and then cast
- explicitly as needed below. Do not assign an explicitly cast
+ We should make sure the type of 'index' is loff_t and then cast
+ explicitly as needed below. Do not assign an explicitly cast
loff_t to an int.
Severity : normal
Bugzilla : 13304
Frequency : Always, for kernels after 2.6.16
Description: Fix warning idr_remove called for id=.. which is not allocated.
-Details : Last kernels save old s_dev before kill super and not allow
+Details : Last kernels save old s_dev before kill super and not allow
to restore from callback - restore it before call kill_anon_super.
Severity : minor
Bugzilla : 12948
Description: buffer overruns could theoretically occur
Details : llapi_semantic_traverse() modifies the "path" argument by
- appending values to the end of the origin string, and a
+ appending values to the end of the origin string, and a
overrun may occur. Adding buffer overrun check in liblustreapi.
Severity : normal
Bugzilla : 13751
Description: Kernel patches update for RHEL5 2.6.18-8.1.14.el5.
Details : Modify target file & which_patch.
- A flaw was found in the IA32 system call emulation provided
- on AMD64 and Intel 64 platforms. An improperly validated 64-bit
- value could be stored in the %RAX register, which could trigger an
- out-of-bounds system call table access. An untrusted local user
- could exploit this flaw to run code in the kernel
- (ie a root privilege escalation). (CVE-2007-4573).
+ A flaw was found in the IA32 system call emulation provided
+ on AMD64 and Intel 64 platforms. An improperly validated 64-bit
+ value could be stored in the %RAX register, which could trigger an
+ out-of-bounds system call table access. An untrusted local user
+ could exploit this flaw to run code in the kernel
+ (ie a root privilege escalation). (CVE-2007-4573).
Severity : major
Bugzilla : 13093
Description: Add jbd statistics patch for RHEL5 and 2.6.18-vanilla
Severity : minor
-Bugzilla : 13732
+Bugzilla : 13732
Description: change order of libsysio includes
Details : '#include sysio.h' should always come before '#include xtio.h'
Bugzilla : 14864
Description: better handle error messages in extents code
+Severity : normal
+Frequency : RHEL4 only
+Bugzilla : 14618
+Description: mkfs is very slow on IA64/RHEL4
+Details : A performance regression has been discovered in the MPT Fusion
+ driver between versions 3.02.73rh and 3.02.99.00rh. As a
+ consequence, we have downgraded the MPT Fusion driver in the RHEL4
+ kernel from 3.02.99.00 to 3.02.73 until this problem is fixed.
+
+Severity : enhancement
+Bugzilla : 14729
+Description: SNMP support enhancement
+Details : Adding total number of sampled request for an MDS node in snmp
+ support.
+
+Severity : enhancement
+Bugzilla : 14748
+Description: Optimize ldlm waiting list processing for PR extent locks
+Details : When processing waiting list for read extent lock and meeting read
+ lock that is same or wider to it that is not contended, skip
+ processing rest of the list and immediatelly return current
+ status of conflictness, since we are guaranteed there are no
+ conflicting locks in the rest of the list.
+
+Severity : normal
+Bugzilla : 14774
+Description: Time out and refuse to reconnect
+Details : When the failover node is the primary node, it is possible
+ to have two identical connections in imp_conn_list. We must
+ compare not conn's pointers but NIDs, otherwise we can defeat
+ connection throttling.
+
+Severity : normal
+Bugzilla : 13821
+Description: port llog fixes from b1_6 into HEAD
+Details : Port llog reference couting and some llog cleanups from b1_6
+ (bug 10800) into HEAD, for protect from panic and access to already
+ free llog structures.
+
+Severity : normal
+Bugzilla : 14483
+Description: Detect stride IO mode in read-ahead
+Details : When a client does stride read, read-ahead should detect that and
+ read-ahead pages according to the detected stride pattern.
+
+Severity : normal
+Bugzilla : 13805
+Description: data checksumming impacts single node performance
+Details : add support for several checksum algorithm. Currently, only CRC32
+ and Adler-32 are supported. The checksum type can be changed on
+ the fly via /proc/fs/lustre/osc/*/checksum_type.
+
+Severity : normal
+Bugzilla : 14648
+Description: use adler32 for page checksums
+Details : when available, use the Adler-32 algorithm instead of CRC32 for
+ page checksums.
+
+Severity : normal
+Bugzilla : 15033
+Description: build for x2 fails
+Details : fix compile issue on Cray systems.
+
+Severity : normal
+Bugzilla : 14379
+Description: Properly match for duplicate locks
+Details : Due to different lock order from skiplists code, we need to
+ traverse entire list for now
+
+Severity : normal
+Frequency : only on PPC/SLES10
+Bugzilla : 14855
+Description: "BITS_PER_LONG is not 32 or 64" in linux/idr.h
+Details : On SLES10/PPC, fs.h includes idr.h which requires BITS_PER_LONG to
+ be defined. Add a hack in mkfs_lustre.c to work around this compile
+ issue.
+
+Severity : normal
+Bugzilla : 14257
+Description: LASSERT on MDS when client holding flock lock dies
+Details : ldlm pool logic depends on number of granted locks equal to
+ number of released locks which is not true for flock locks, so
+ just exclude such locks from consideration.
+
+Severity : normal
+Bugzilla : 15188
+Description: MDS deadlock with many ll_sync_lov threads and I/O stalled
+Details : Use fsfilt_sync() for both the whole filesystem sync and
+ individual file sync to eliminate dangerous inode locking
+ with I_LOCK that can lead to a deadlock.
+
--------------------------------------------------------------------------------
2007-08-10 Cluster File Systems, Inc. <info@clusterfs.com>