+tbd Sun Microsystems, Inc.
* version 1.8.0
* Support for kernels:
- 2.6.5-7.287.3 (SLES 9),
- 2.6.9-55.0.12.EL (RHEL 4),
- 2.6.16.53-0.8 (SLES 10),
- 2.6.18-8.1.14.EL5 (RHEL 5).
- 2.6.18.8 vanilla (kernel.org)
+ 2.6.16.54-0.2.5 (SLES 10),
+ 2.6.18-53.1.14.el5 (RHEL 5),
+ 2.6.22.14 vanilla (kernel.org).
* Client support for unpatched kernels:
(see http://wiki.lustre.org/index.php?title=Patchless_Client)
- 2.6.9-42.0.10.EL (RHEL 4),
2.6.16 - 2.6.21 vanilla (kernel.org)
- * Recommended e2fsprogs version: 1.40.2-cfs1
+ * Recommended e2fsprogs version: 1.40.7-sun1
* Note that reiserfs quotas are disabled on SLES 10 in this kernel.
+ * RHEL 4 and RHEL 5/SLES 10 clients behaves differently on 'cd' to a
+ removed cwd "./" (refer to Bugzilla 14399).
+
+Severity : normal
+Bugzilla : 12888
+Description: mds_mfd_close() ASSERTION(rc == 0)
+Details : In mds_mfd_close(), we need protect inode's writecount change
+ within its orphan write semaphore to prevent possible races.
+
+Severity : minor
+Bugzilla : 14929
+Description: Obsolete CURRENT_SECONDS and use cfs_time_current_sec() instead.
+
+Severity : minor
+Bugzilla : 14645
+Frequency : rare, on shutdown ost
+Description: don't hit live lock with umount ost.
+Details : shrink_dcache_parent can be in long loop with destroy dentries,
+ use shrink_dcache_sb instead.
+
+Severity : minor
+Bugzilla : 14949
+Description: don't panic with use echo client
+Details : echo client pass NULL as client nid pointer and this produce null
+ pointer dereference.
+
+Severity : normal
+Bugzilla : 15278
+Description: fix build on ppc32
+Details : compile code with -m64 flag produce wrong object file for ppc32.
+
+Severity : normal
+Bugzilla : 12191
+Description: add message levels for liblustreapi
+
+Severity : normal
+Bugzilla : 13380
+Description: fix for occasional failure case of -ENOSPC in recovery-small tests
+Details : Move the 'good_osts' check before the 'total_bavail' check. This
+ will result in an -EAGAIN and in the exit call path we call
+ alloc_rr() which will with increasing aggressiveness attempt to
+ aquire precreated objects on the minimum number of required OSCs.
+
+Severity : major
+Bugzilla : 14326
+Description: Use old size assignment to avoid deadlock
+Details : This reverts the changes in bugs 2369 and bug 14138 that introduced
+ the scheduling while holding a spinlock. We do not need locking
+ for size in ll_update_inode() because size is only updated from
+ the MDS for directories or files without objects, so there is no
+ other place to do the update, and concurrent access to such inodes
+ are protected by the inode lock.
+
+Severity : normal
+Bugzilla : 14746
+Description: resolve "_IOWR redefined" build error on SLES10
+
+Severity : normal
+Bugzilla : 14763
+Description: dump the memory debugging after all modules are unloaded to
+ suppress false negative in conf_sanity test 39
+
+Severity : enhancement
+Bugzilla : 15316
+Description: build kernel-ib packages for OFED 1.3 in our release cycle
+
+Severity : minor
+Bugzilla : 13969
+Frequency : always
+Description: fix SLES kernel versioning
+Details : the kernel version for our SLES 10 kernel did not include a "-"
+ before the "smp" at the end. while this was not a problem in
+ general, it did mean that software trying to use the kernel
+ version to try to detect a vendor specific kernel would fail.
+ this was most evident by the OFED build scripts.
+
+Severity : normal
+Bugzilla : 14803
+Description: Don't update lov_desc members until making sure they are valid
+Details : When updating lov_desc members via proc fs, need fix their
+ validities before doing the real update.
+
+Severity : normal
+Bugzilla : 15069
+Description: don't put request into delay list while invalidate in flight.
+Details : ptlrpc_delay_request sometimes put in delay list while invalidate
+ import in flight. this produce timeout for invalidate and sometimes
+ can cause stale data.
+
+Severity : minor
+Bugzilla : 14856
+Frequency : on ppc only
+Description: not convert ost objects for directory because it's not exist.
+Details : ll_dir_getstripe assume dirrectory has ost objects but this wrong.
+
+Severity : normal
+Bugzilla : 12652
+Description: Add FMODE_EXEC file flag for SLES10 SP1 kernel.
+
+Severity : enhancement
+Bugzilla : 13397
+Description: Update to support 2.6.22.14 vanilla kernel.
+
+Severity : normal
+Bugzilla : 14533
+Frequency : rare, on recovery
+Description: read procfs can produce deadlock in some situation
+Details : Holding lprocfs lock which send rpc can produce block for destroy
+ obd objects and this also block reconnect with -EALREADY. This isn't
+ fix all lprocfs bugs - but make it rare.
+
+Severity : enhancement
+Bugzilla : 15152
+Description: Update kernel to RHEL5 2.6.18-53.1.14.el5.
+
+Severity : major
+Frequency : frequent on X2 node
+Bugzilla : 15010
+Description: mdc_set_open_replay_data LBUG
+Details : Set replay data for requests that are eligible for replay.
+
+Severity : normal
+Bugzilla : 14321
+Description: lustre_mgs: operation 101 on unconnected MGS
+Details : When MGC is disconnected from MGS long enough, MGS will evict the
+ MGC, and late on MGC cannot successfully connect to MGS and a lot
+ of the error messages complaining that MGS is not connected.
+
+Severity : major
+Frequency : on start mds
+Bugzilla : 14884
+Description: Implement get_info(last_id) in obdfilter.
+
+Severity : normal
+Frequency : occasional
+Bugzilla : 13537
+Description: Correctly check stale fid, not start epoch if ost not support SOM
+Details : open with flag O_CREATE need set old fid in op_fid3 because op_fid2
+ overwrited with new generated fid, but mds can anwer with one of these
+ two fids and both is not stale. setattr incorectly start epoch and
+ assume will be called done_writeting, but without SOM done_writing
+ never called.
+
+Severity : major
+Frequency : rare, depends on device drivers and load
+Bugzilla : 14529
+Description: MDS or OSS nodes crash due to stack overflow
+Details : Code changes in 1.8.0 increased the stack usage of some functions.
+ In some cases, in conjunction with device drivers that use a lot
+ of stack the MDS (or possibly OSS) service threads could overflow
+ the stack. One change which was identified to consume additional
+ stack has been reworked to avoid the extra stack usage.
+
+Severity : normal
+Frequency : occasional
+Bugzilla : 13730
+Description: Do not fail import if osc_interpret_create gets -EAGAIN
+Details : If osc_interpret_create got -EAGAIN it immediately exits and
+ wakeup oscc_waitq. After wakeup oscc_wait_for_objects call
+ oscc_has_objects and see OSC has no objests and call
+ oscc_internal_create to resend create request.
+
+Severity : enhancement
+Bugzilla : 14858
+Description: Update to SLES10 SP1 latest kernel-2.6.16.54-0.2.5.
+
+Severity : enhancement
+Bugzilla : 14876
+Description: Update to RHEL5 latest kernel-2.6.18-53.1.13.el5.
+
+Severity : normal
+Frequency : very rare
+Bugzilla : 3462
+Description: Fix replay if there is an un-replied request and open
+Details : In some cases, older replay request will revert the
+ mcd->mcd_last_xid on MDS which is used to record the client's
+ latest sent request.
+
+Severity : enhancement
+Bugzilla : 14720
+Description: Update to RHEL5 latest kernel-2.6.18-53.1.6.el5.
+
+Severity : enhancement
+Bugzilla : 14482
+Description: Add rhel5 support to HEAD.
+
+Serverity : enhancement
+Bugzilla : 14793
+Description: Update RHEL4 kernel to 2.6.9-67.0.4.
+
+Severity : minor
+Frequency : rare
+Bugzilla : 13196
+Description: Don't allow skipping OSTs if index has been specified.
+Details : Don't allow skipping OSTs if index has been specified, make locking
+ in internal create lots better.
+
+Severity : normal
+Bugzilla : 12228
+Description: LBUG in ptlrpc_check_set() bad phase ebc0de00
+Details : access to bitfield in structure is always rounded to long
+ and this produce problem with not atomic change any bit.
+
+Severity : normal
+Bugzilla : 13647
+Description: Lustre make rpms failed.
+Details : Remove ldiskfs spec file to avoids rpmbuild be confused when
+ builds Lustre rpms from tarball.
+
+Severity : normal
+Frequency : rare on shutdown ost
+Bugzilla : 14608
+Description: If llog cancel was not send before clean_exports phase, this can
+ produce deadlock in llog code.
+Details : If llog thread has last reference to obd and call class_import_put
+ this produce deadlock because llog_cleanup_commit_master wait when
+ last llog_commit_thread exited, but this never success because was
+ called from llog_commit_thread.
+
+Severity : normal
+Bugzilla : 9977
+Description: allow userland application know is lost one of stripes.
+Details : fill lvb_blocks with error code on ost and return it to
+ application if error flag found.
+
+Severity : normal
+Bugzilla : 14607
+Description: NULL lov_tgts causing MDS oops
+Details : more safe checks for NULL lov_tgts for avoid oops.
+
+Severity : enhancement
+Bugzilla : 14531
+Description: Update to RHEL4 latest kernel-2.6.9-67.0.1.EL.
+
+Severity : normal
+Bugzilla : 13375
+Descriptoin: make lov_create() will not stuck in obd_statfs_rqset()
+Details : If an OST is down the MDS will hang indefinitely in
+ obd_statfs_rqset() waiting for the statfs data. While for
+ MDS QOS usage of statfs, it should not stuck in waiting.
+
+Severity : enhancement
+Bugzilla : 11842
+Description: remote_acl support
+Details : Support ACL-based permission check for remote user.
+ Support setfacl/getfacl for remote user with the utils
+ "lfs {l,r}{s,g}etfacl" which follow the same parameter format as
+ the system "{s,g}etfacl" utils.
+
+Severity : enhancement
+Bugzilla : 14288
+Description: Update to RHEL4 U6 kernel-2.6.9-67.EL.
+
+Severity : enhancement
+Bugzilla : 14368
+Description: Update to RHEL5 latest kernel-2.6.18-53.1.4.el5.
+
+Severity : normal
+Bugzilla : 14136
+Description: make mgs_setparam() handle fsname containing dash
+Details : fsname containing a dash does not work with lctl conf_param
+
+Severity : enhancement
+Bugzilla : 14388
+Description: Update to SLES10 SP1 latest kernel-2.6.16.54-0.2.3.
+
+Severity : enhancement
+Bugzilla : 14289
+Description: Update to RHEL5 Update-1 kernel 2.6.18-53.el5.
+
+Severity : major
+Bugzilla : 14260
+Frequency : rare, at shutdown
+Description: access already free / zero obd_namespace.
+Details : if client_disconnect_export was called without force flag set,
+ and exist connect request in flight, this can produce access to
+ NULL pointer (or already free pointer) when connect_interpret
+ store ocd flags in obd_namespace.
+
+Severity : minor
+Bugzilla : 14418
+Frequency : only at startup
+Description: not alloc memory with spinlock held.
+Details : allocation memory with GFP_KERNEL can produce sleep deadlock,
+ if any spinlock held.
+
+Severity : enhancement
+Bugzilla : 12211
+Description: make lustre randomly fail allocating memory
+Details : Make lustre randomly failed allocating memory for testing purpose.
+
+Severity : enhancement
+Bugzilla : 12702
+Description: lost problems with lov objid file
+Details : Fixes some scability and access to not inited memory problems
+ in work with lov objdid file.
+
+Severity : major
+Frequency : always
+Bugzilla : 14270
+Description: lfs find does not continue on file error
+Details : Continue other files processing when a file/dir is absent.
+
+Severity : normal
+Bugzilla : 11791
+Description: Inconsistent usage of lustre_pack_reply()
+Details : Standardize the usage of lustre_pack_reply() such that it
+ always generate a CERROR on failure.
+
+Severity : major
+Frequency : occasional
+Bugzilla : 13917
+Description: MDS hang or stay in waiting lock
+Details : If client receive lock with CBPENDING flag ldlm need send lock
+ cancel as separate rpc, to avoid situation when cancel request
+ can't processed due all i/o threads stay in wait lock.
+
+Severity : normal
+Bugzilla : 13969
+Description: Update to RHEL5 kernel 2.6.18-8.1.15.el5.
+
+Severity : normal
+Bugzilla : 13874
+Description: Update to SLES10 SP1 kernel 2.6.16.53-0.16
Severity : normal
Bugzilla : 13889
Bugzilla : 14041
Description: Update to RHEL4 latest kernel.
-Severity : normal
-Bugzilla : 13568
-Description: Remove 2.4 kernel support from lustre 1.8.0
-Details : Remove 2.5.0 check from quota_check.c, quota_ctl.c,
- quota_interface.c in quota subsystem.
- Remove 2.4 kernel check from obd_mount.c, linux/linux-sysctl.c in
- obdclass subsystem.
- Remove lock_24kernel from fsfilt_ext3.c fsfilt_reiserfs.c
- lvfs_linux.c upcall_cache.c in lvfs subsystem.
- Remove 2.5.0 check from dcache.c dir.c file.c llite_internal.h
- llite_lib.c llite_mmap.c lproc_llite.c namei.c rw.c symlink.c
- in llite subsystem.
- Remove 2.5.0 check from handler.c lproc_mds.c mds_fs.c
- mds_internal.h mds_lib.c mds_open.c mds_reint.c mds_xattr.c
- in mds subsystem.
- Remove 2.4 kernel check from lproc_osc.c, osc_request.c in osc
- subsystem.
- Remove 2.4 kernel check from filter.c, lproc_obdfilter.c in
- obdfilter subsystem.
- Remove 2.4 kernel code from lustre/obdecho/echo_client.c.
- Remove 2.4 kernel check from lustre/mgs/lproc_mgs.c,
- lustre/mgs/mgs_fs.c.
- Remove 2.4 kernel code from lustre/mgc/mgc_request.c.
- Remove 2.4 kernel check from lustre/mdt/mdt_lproc.c.
- Remove 2.4 kernel code from lustre/mdc/mdc_request.c.
- Remove 2.4 kernel check from lustre/lov/lproc_lov.c.
- Remove 2.4 kernel check from lustre/lmv/lproc_lmv.c.
- Remove lock_24kernel from lustre/lvfs/fsfilt_ext3.c.
- Remove 2.4 kernel code from include/liblustre.h,
- include/lustre_disk.h, include/lustre_quota.h,
- include/linux/lprocfs_status.h, include/linux/lvfs.h,
- include/linux/lustre_lite.h, include/linux/lustre_net.h,
- include/linux/lustre_user.h, include/linux/lvfs_linux.h,
- include/linux/obd_class.h, include/linux/lustre_compat25.h.
-
Severity : enhancement
Bugzilla : 13690
Description: Build SLES10 patchless client fails
-Details : The configure was broken by run ./configure with
- --with-linux-obj=.... argument for patchless client. When the
+Details : The configure was broken by run ./configure with
+ --with-linux-obj=.... argument for patchless client. When the
configure use --with-linux-obj, the LINUXINCLUDE= -Iinclude
- can't search header adequately. Use absolute path such as
- -I($LINUX)/include instead.
+ can't search header adequately. Use absolute path such as
+ -I($LINUX)/include instead.
Severity : normal
Bugzilla : 13888
Description: interrupt oig_wait produce painc on resend.
Details : brw_redo_request can be used for resend requests from ptlrpcd and
private set, and this produce situation when rq_ptlrpcd_data not
- copyed to new allocated request and triggered LBUG on assert
+ copyed to new allocated request and triggered LBUG on assert
req->rq_ptlrpcd_data != NULL. But this member used only for wakeup
ptlrpcd set if request is changed and can be safety changed to use
rq_set directly.
Bugzilla : 13497
Description: LASSERT_{REQ,REP}SWAB macros are buggy
Details : If SWAB_PARANOIA is disabled, the LASSERT_REQSWAB and
- LASSERT_REPSWAB macros become no-ops, which is incorrect. Drop
- these macros and replace them with their difinitions instead.
+ LASSERT_REPSWAB macros become no-ops, which is incorrect. Drop
+ these macros and replace them with their difinitions instead.
Severity : normal
Bugzilla : 13521
Bugzilla : 12411
Description: Remove client patches from SLES 10 kernel.
Details : This causes SLES 10 clients to behave as patchless clients
- even on a Lustre-patched (server) kernel.
+ even on a Lustre-patched (server) kernel.
Severity : enhancement
Bugzilla : 2262
Description: self-adjustable client's lru lists
-Details : use adaptive algorithm for managing client cached locks lru
- lists according to current server load, other client's work
- pattern, memory activities, etc. Both, server and client
- side namespaces provide number of proc tunables for controlling
- things
+Details : use adaptive algorithm for managing client cached locks lru
+ lists according to current server load, other client's work
+ pattern, memory activities, etc. Both, server and client
+ side namespaces provide number of proc tunables for controlling
+ things
Severity : enhancement
Bugzilla : 13641
Severity : normal
Bugzilla : 12192
Description: llapi_file_create() does not allow some changes
-Details : add llapi_file_open() that allows specifying the mode and
+Details : add llapi_file_open() that allows specifying the mode and
open flags, and also returns an open file handle.
Severity : normal
Severity : normal
Bugzilla : 10657
Description: Add journal checksum support.(Kernel part)
-Details : The journal checksum feature adds two new flags i.e
- JBD2_FEATURE_INCOMPAT_ASYNC_COMMIT and
- JBD2_FEATURE_COMPAT_CHECKSUM. JBD2_FEATURE_CHECKSUM flag
+Details : The journal checksum feature adds two new flags i.e
+ JBD2_FEATURE_INCOMPAT_ASYNC_COMMIT and
+ JBD2_FEATURE_COMPAT_CHECKSUM. JBD2_FEATURE_CHECKSUM flag
indicates that the commit block contains the checksum for
the blocks described by the descriptor blocks. Now commit
record can be sent to disk without waiting for descriptor
Severity : major
Bugzilla : 12223
Description: mds_obd_create error creating tmp object
-Details : When the user sets quota on root, llog will be affected and can't
+Details : When the user sets quota on root, llog will be affected and can't
create files and write files.
Severity : normal
Bugzilla : 12826
Description: Add EXPORT_SYMBOL check for node_to_cpumask symbol.
Details : This allows the patchless client to be loaded on architectures
- without this export.
+ without this export.
Severity : normal
Bugzilla : 13039
Severity : normal
Bugzilla : 12398
-Description: testing performance impact of enabling checksumming
-Details : enable checksum by default, allow --disable-checksum
- configure option and "-o nochecksum" mount option
+Description: enable data checksumming by default
+Details : enable checksum by default, allow --disable-checksum
+ configure option and "-o nochecksum" mount option. Checksums
+ can also be disabled at runtime via $LPROC/osc/*/checksum_pages.
Severity : normal
Bugzilla : 11802
Details : void sending multiple quota reqs to mds, which will keep the
status between the reqs.
-Severity : normal
+Severity : normal
Bugzilla : 13125
Description: osts not allocated evenly to files
-Details : change the condition to increase offset_idx
+Details : change the condition to increase offset_idx
Severity : critical
Frequency : Always for filesystems larger than 2TB on 32-bit systems.
Severity : normal
Bugzilla : 11230
-Description: Tune the kernel for good SCSI performance.
+Description: Tune the kernel for good SCSI performance.
Details : Set the value of /sys/block/{dev}/queue/max_sectors_kb
to the value of /sys/block/{dev}/queue/max_hw_sectors_kb
in mount_lustre.
Bugzilla : 12234
Description: /proc/fs/lustre/devices broken on ppc
Details : The patch as applied to 1.6.2 doesn't look correct for all arches.
- We should make sure the type of 'index' is loff_t and then cast
- explicitly as needed below. Do not assign an explicitly cast
+ We should make sure the type of 'index' is loff_t and then cast
+ explicitly as needed below. Do not assign an explicitly cast
loff_t to an int.
Severity : normal
Severity : normal
Bugzilla : 13304
Frequency : Always, for kernels after 2.6.16
-Description: Fix warning idr_remove called for id=.. which is not
- allocated.
-Details : Last kernels save old s_dev before kill super and not allow
- to restore from callback - restore it before call kill_anon_super.
+Description: Fix warning idr_remove called for id=.. which is not allocated.
+Details : Last kernels save old s_dev before kill super and not allow
+ to restore from callback - restore it before call kill_anon_super.
Severity : minor
Bugzilla : 12948
Description: buffer overruns could theoretically occur
Details : llapi_semantic_traverse() modifies the "path" argument by
- appending values to the end of the origin string, and a
- overrun may occur. Adding buffer overrun check in liblustreapi.
+ appending values to the end of the origin string, and a
+ overrun may occur. Adding buffer overrun check in liblustreapi.
Severity : normal
Bugzilla : 13334
Bugzilla : 13452
Description: Kernel config for 2.6.18-vanilla.
Details : Modify targets/2.6-vanilla.target.in.
- Add config file kernel-2.6.18-2.6-vanilla-i686.config.
- Add config file kernel-2.6.18-2.6-vanilla-i686-smp.config.
- Add config file kernel-2.6.18-2.6-vanilla-x86_64.config.
- Add config file kernel-2.6.18-2.6-vanilla-x86_64-smp.config.
+ Add config file kernel-2.6.18-2.6-vanilla-i686.config.
+ Add config file kernel-2.6.18-2.6-vanilla-i686-smp.config.
+ Add config file kernel-2.6.18-2.6-vanilla-x86_64.config.
+ Add config file kernel-2.6.18-2.6-vanilla-x86_64-smp.config.
Severity : major
Bugzilla : 11710
Description: improve handling recoverable errors
Details : if request processig with error which can be recoverable on server
- request should be resend, otherwise page released from cache and
- marked as error.
+ request should be resend, otherwise page released from cache and
+ marked as error.
Severity : critical
Bugzilla : 13751
Description: Kernel patches update for RHEL5 2.6.18-8.1.14.el5.
Details : Modify target file & which_patch.
- A flaw was found in the IA32 system call emulation provided
- on AMD64 and Intel 64 platforms. An improperly validated 64-bit
- value could be stored in the %RAX register, which could trigger an
- out-of-bounds system call table access. An untrusted local user
- could exploit this flaw to run code in the kernel
- (ie a root privilege escalation). (CVE-2007-4573).
+ A flaw was found in the IA32 system call emulation provided
+ on AMD64 and Intel 64 platforms. An improperly validated 64-bit
+ value could be stored in the %RAX register, which could trigger an
+ out-of-bounds system call table access. An untrusted local user
+ could exploit this flaw to run code in the kernel
+ (ie a root privilege escalation). (CVE-2007-4573).
Severity : major
Bugzilla : 13093
Description: O_DIRECT bypasses client statistics.
Details : When running with O_DIRECT I/O, neither the client rpc_stats nor
- read_ahead_stats were updated. copied the stats section from
- osc_send_oap_rpc() into async_internal().
+ read_ahead_stats were updated. copied the stats section from
+ osc_send_oap_rpc() into async_internal().
Severity : normal
Bugzilla : 13454
Description: Add jbd statistics patch for RHEL5 and 2.6.18-vanilla
Severity : minor
-Bugzilla : 13732
+Bugzilla : 13732
Description: change order of libsysio includes
Details : '#include sysio.h' should always come before '#include xtio.h'
Bugzilla : 11673
Description: handle "serious error: objid * already exists" more gracefully
Details : If LAST_ID value on disk is smaller than the objects existing in
- the O/0/d* directories, it indicates disk corruption and causes an
- LBUG(). If the object is 0-length, then we should use the existing
- object. This will help to avoid a full fsck in most cases.
+ the O/0/d* directories, it indicates disk corruption and causes an
+ LBUG(). If the object is 0-length, then we should use the existing
+ object. This will help to avoid a full fsck in most cases.
Severity : enhancement
Bugzilla : 13207
Description: adapt the lustre_config script to support the upgrade case
Details : Add "-u" option for lustre_config script to support upgrading 1.4
- server targets to 1.6 in parallel.
+ server targets to 1.6 in parallel.
Severity : normal
Bugzilla : 13570
Description: To avoid grant space > avaible space when the disk is almost
- full. Without this patch you might see the error "grant XXXX >
- available" or some LBUG about grant, when the disk is almost
- full.
+ full. Without this patch you might see the error "grant XXXX >
+ available" or some LBUG about grant, when the disk is almost
+ full.
Details : In filter_check_grant, for non_grant cache write, we should
- check the left space by if (*left > ungranted + bytes), instead
- of (*left > ungranted), because only we are sure the left space
- is enough for another "bytes", then the ungrant space should be
- increase. In client, we should update cl_avail_grant only there
- is OBD_MD_FLGRANT in the reply.
+ check the left space by if (*left > ungranted + bytes), instead
+ of (*left > ungranted), because only we are sure the left space
+ is enough for another "bytes", then the ungrant space should be
+ increase. In client, we should update cl_avail_grant only there
+ is OBD_MD_FLGRANT in the reply.
Severity : critical
Bugzilla : 13748
described in CVE-2007-4573. This problem could allow untrusted
local users to gain root access.
-
Severity : normal
Frequency : when using O_DIRECT and quotas
Bugzilla : 13930
Details : block usage reported by 'lfs quota' does not take into account
files that have been written with O_DIRECT.
+Severity : normal
+Frequency : always
+Bugzilla : 13976
+Description: touch file failed when fs is not full
+Details : OST in recovery should not be discarded by MDS in alloc_qos(),
+ otherwise we can get ENOSP while fs is not full.
+
+Severity : normal
+Bugzilla : 11301
+Description: parallel lock callbacks
+Details : Instead of sending blocking and completion callbacks as separated
+ requests, adding them to a set and sending in parallel.
+
+Severity : normal
+Frequency : only for Cray XT3
+Bugzilla : 12829/13455
+Description: Changing primary group doesn't change the group lustre assigns to
+ a file
+Details : When CRAY_XT3 is defined, the fsgid supplied by the client is
+ overridden with the primary group provided by the group upcall,
+ whereas the supplied fsgid can be trusted if it is in the list of
+ supplementary groups returned by the group upcall.
+
+Severity : enhancement
+Bugzilla : 14398
+Description: Allow masking D_WARNING, D_ERROR messages from console
+Details : Console messages can now be disabled via lnet.printk.
+
+Severity : normal
+Bugzilla : 14614
+Description: User code with malformed file open parameter crashes client node
+Details : Before packing join_file req, all the related reference should be
+ checked carefully in case some malformed flags cause fake
+ join_file req on client.
+
+Severity : normal
+Bugzilla : 14225
+Description: LDLM_ENQUEUE races with LDLM_CP_CALLBACK
+Details : ldlm_completion_ast() assumes that a lock is granted when the req
+ mode is equal to the granted mode. However, it should also check
+ that LDLM_FL_CP_REQD is not set.
+
+Severity : normal
+Bugzilla : 14360
+Description: Heavy nfs access might result in deadlocks
+Details : After ELC code landed, it is now improper to enqueue any mds
+ locks under och_sem, because enqueue might want to decide to
+ cancel open locks for same inode we are holding och_sem for.
+
+Severity : normal
+Bugzilla : 13843
+Description: Client eviction while running blogbench
+Details : A lot of unlink operations with concurrent I/O can lead to a
+ deadlock causing evictions. To address the problem, the number of
+ oustanding OST_DESTROY requests is now throttled to
+ max_rpcs_in_flight per OSC and LDLM_FL_DISCARD_DATA blocking
+ callbacks are processed in priority.
+
+Severity : normal
+Bugzilla : 13829
+Description: enable ACLs on MDS by default
+Details : ACLs must be enabled on MDS by default.
+
+Severity : normal
+Frequency : PPC/PPC64 only
+Bugzilla : 14845
+Description: conflicts between asm-ppc64/types.h and lustre_types.h
+Details : fix duplicated definitions between asm-ppc64/types.h and
+ lustre_types.h on PPC.
+
+Severity : normal
+Frequency : PPC/PPC64 only
+Bugzilla : 14844
+Description: asm-ppc/segment.h does not exist
+Details : fix compile issue on PPC.
+
+Severity : normal
+Bugzilla : 14864
+Description: better handle error messages in extents code
+
+Severity : normal
+Frequency : RHEL4 only
+Bugzilla : 14618
+Description: mkfs is very slow on IA64/RHEL4
+Details : A performance regression has been discovered in the MPT Fusion
+ driver between versions 3.02.73rh and 3.02.99.00rh. As a
+ consequence, we have downgraded the MPT Fusion driver in the RHEL4
+ kernel from 3.02.99.00 to 3.02.73 until this problem is fixed.
+
+Severity : enhancement
+Bugzilla : 14729
+Description: SNMP support enhancement
+Details : Adding total number of sampled request for an MDS node in snmp
+ support.
+
+Severity : enhancement
+Bugzilla : 14748
+Description: Optimize ldlm waiting list processing for PR extent locks
+Details : When processing waiting list for read extent lock and meeting read
+ lock that is same or wider to it that is not contended, skip
+ processing rest of the list and immediatelly return current
+ status of conflictness, since we are guaranteed there are no
+ conflicting locks in the rest of the list.
+
+Severity : normal
+Bugzilla : 14774
+Description: Time out and refuse to reconnect
+Details : When the failover node is the primary node, it is possible
+ to have two identical connections in imp_conn_list. We must
+ compare not conn's pointers but NIDs, otherwise we can defeat
+ connection throttling.
+
+Severity : normal
+Bugzilla : 13821
+Description: port llog fixes from b1_6 into HEAD
+Details : Port llog reference couting and some llog cleanups from b1_6
+ (bug 10800) into HEAD, for protect from panic and access to already
+ free llog structures.
+
+Severity : normal
+Bugzilla : 14483
+Description: Detect stride IO mode in read-ahead
+Details : When a client does stride read, read-ahead should detect that and
+ read-ahead pages according to the detected stride pattern.
+
+Severity : normal
+Bugzilla : 13805
+Description: data checksumming impacts single node performance
+Details : add support for several checksum algorithm. Currently, only CRC32
+ and Adler-32 are supported. The checksum type can be changed on
+ the fly via /proc/fs/lustre/osc/*/checksum_type.
+
+Severity : normal
+Bugzilla : 14648
+Description: use adler32 for page checksums
+Details : when available, use the Adler-32 algorithm instead of CRC32 for
+ page checksums.
+
+Severity : normal
+Bugzilla : 15033
+Description: build for x2 fails
+Details : fix compile issue on Cray systems.
+
+Severity : normal
+Bugzilla : 14379
+Description: Properly match for duplicate locks
+Details : Due to different lock order from skiplists code, we need to
+ traverse entire list for now
+
+Severity : normal
+Frequency : only on PPC/SLES10
+Bugzilla : 14855
+Description: "BITS_PER_LONG is not 32 or 64" in linux/idr.h
+Details : On SLES10/PPC, fs.h includes idr.h which requires BITS_PER_LONG to
+ be defined. Add a hack in mkfs_lustre.c to work around this compile
+ issue.
+
+Severity : normal
+Bugzilla : 14257
+Description: LASSERT on MDS when client holding flock lock dies
+Details : ldlm pool logic depends on number of granted locks equal to
+ number of released locks which is not true for flock locks, so
+ just exclude such locks from consideration.
+
+Severity : normal
+Bugzilla : 15188
+Description: MDS deadlock with many ll_sync_lov threads and I/O stalled
+Details : Use fsfilt_sync() for both the whole filesystem sync and
+ individual file sync to eliminate dangerous inode locking
+ with I_LOCK that can lead to a deadlock.
+
+Severity : normal
+Bugzilla : 14410
+Description: performance in 1.6.3
+Details : Force q->max_phys_segments to MAX_PHYS_SEGMENTS on SLES10 to be
+ sure that 1MB requests are not fragmented by the block layer.
+
+Severity : enhancement
+Bugzilla : 11089
+Description: organize the server-side client stats on per-nid basis
+Details : Change the structure of stats under obdfilter and mds to
+ New structure:
+ +- exports
+ +- nid#1
+ | + stats
+ | + uuids
+ +- nid#2...
+ +- clear
+ The "uuid"s file would list the uuids of _active_ exports.
+ And the clear entry is to clear all stats and stale nids.
+
+Severity : enhancement
+Bugzilla : 11270
+Description: eliminate client locks in face of contention
+Details : file contention detection and lockless i/o implementation
+ for contended files.
+
+Severity : normal
+Bugzilla : 15212
+Description: Reinitialize optind to 0 so that interactive lfs works in all cases
+
--------------------------------------------------------------------------------
2007-08-10 Cluster File Systems, Inc. <info@clusterfs.com>
Severity : minor
Bugzilla : 11706
Description: service threads may hog cpus when there are a lot of requests
- coming
-Details : Insert cond_resched to give other threads a chance to use some of
- the cpu
+Details : Insert cond_resched to give other threads a chance to use some CPU
Severity : normal
Frequency : rare
Bugzilla : 12597
Description: brw_stats were being printed incorrectly
Details : brw_stats were being printed as log2 but all of them were not
- recorded as log2. Also remove some code duplication arising from
- filter_tally_{read,write}.
+ recorded as log2. Also remove some code duplication arising from
+ filter_tally_{read,write}.
Severity : normal
Bugzilla : 11674
Frequency : rare, only in recovery.
Description: ASSERTION(req->rq_type != LI_POISON) failed
Details : imp_lock should be held while iterating over imp_sending_list for
- prevent destroy request after get timeout in ptlrpc_queue_wait.
+ prevent destroy request after get timeout in ptlrpc_queue_wait.
Severity : normal
Bugzilla : 12689
Description: replay-single.sh test 52 fails
Details : A lock's skiplist need to be cleanup when it being unlinked
- from its resource list.
+ from its resource list.
Severity : normal
Bugzilla : 11737
Description: Short directio read returns full requested size rather than
- actual amount read.
+ actual amount read.
Details : Direct I/O operations should return actual amount of bytes
- transferred rather than requested size.
+ transferred rather than requested size.
Severity : enhancement
Bugzilla : 10589
Description: metadata RPC reduction (e.g. for rm performance)
Details : decrease the amount of synchronous RPC between clients and servers
- by canceling conflicing lock before the operation on the client side
- and packing thier handles into the main operation RPC to server.
+ by canceling conflicing lock before the operation on the client
+ and packing thier handles into the main operation RPC to server.
Severity : enhancement
Bugzilla : 4900
Description: Async OSC create to avoid the blocking unnecessarily.
Details : If a OST has no remain object, system will block on the creating
- when need to create a new object on this OST. Now, ways use
- pre-created objects when available, instead of blocking on an
- empty osc while others are not empty. If we must block, we block
- for the shortest possible period of time.
+ when need to create a new object on this OST. Now, ways use
+ pre-created objects when available, instead of blocking on an
+ empty osc while others are not empty. If we must block, we block
+ for the shortest possible period of time.
+
+Severity : enhancement
+Bugzilla : 12702
+Description: refine locking for avoid write wrong info into lov_objid file
+Details : fix possible races with add new target and write/update data in
+ lov_objid file.
--------------------------------------------------------------------------------
this on patchless clients the deathrow inode reaper is turned
off, and we depend on the VM to clean up old inodes. This
dependency was during via the fix for bug 12181.
-
+
--------------------------------------------------------------------------------
2007-04-19 Cluster File Systems, Inc. <info@clusterfs.com>
Bugzilla : 4226
Description: Permanently set tunables
Details : All writable /proc/fs/lustre tunables can now be permanently
- set on a per-server basis, at mkfs time or on a live
- system.
+ set on a per-server basis, at mkfs time or on a live system.
Severity : enhancement
Bugzilla : 10547
Bugzilla : 6062
Description: SPEC SFS validation failure on NFS v2 over lustre.
Details : Changes the blocksize for regular files to be 2x RPC size,
- and not depend on stripe size.
+ and not depend on stripe size.
Severity : enhancement
Bugzilla : 9293
Description: Multiple MD RPCs in flight.
Details : Further unserialise some read-only MDS RPCs - learn about intents.
- To avoid overly-overloading MDS, introduce a limit on number of
- MDS RPCs in flight for a single client and add /proc controls
- to adjust this limit.
+ To avoid overly-overloading MDS, introduce a limit on number of
+ MDS RPCs in flight for a single client and add /proc controls
+ to adjust this limit.
Severity : enhancement
Bugzilla : 22484