X-Git-Url: https://git.whamcloud.com/?a=blobdiff_plain;f=lustre%2FChangeLog;h=f9ebc36882edd67ade4405b2a822019f27fbe1f7;hb=9b3a6b6f5bd13e79c6a5509f6032211549df004f;hp=7b8ecd5d7ed6af0b96d7e8ed7e39cbd2700842d3;hpb=ed587060e225a85aa5eff3ba583a8b6508eaab7b;p=fs%2Flustre-release.git diff --git a/lustre/ChangeLog b/lustre/ChangeLog index 7b8ecd5..f9ebc36 100644 --- a/lustre/ChangeLog +++ b/lustre/ChangeLog @@ -1,8 +1,8 @@ tbd Sun Microsystems, Inc. * version 2.0.0 * Support for kernels: - 2.6.16.60-0.23 (SLES 10), - 2.6.18-92.1.6.el5 (RHEL 5), + 2.6.16.60-0.27 (SLES 10), + 2.6.18-92.1.10.el5 (RHEL 5), 2.6.22.14 vanilla (kernel.org). * Client support for unpatched kernels: (see http://wiki.lustre.org/index.php?title=Patchless_Client) @@ -12,6 +12,117 @@ tbd Sun Microsystems, Inc. * RHEL 4 and RHEL 5/SLES 10 clients behaves differently on 'cd' to a removed cwd "./" (refer to Bugzilla 14399). +Severity : enhancement +Bugzilla : 16643 +Description: Generic /proc file permissions +Details : Set /Proc file permissions in a more generic way to enable non- + root users operate on some /proc files. + +Severity : major +Bugzilla : 16561 +Description: Hitting mdc_commit_close() ASSERTION +Details : Properly handle request reference release in + ll_release_openhandle(). + +Severity : normal +Bugzilla : 15975 +Frequency : only patchless client +Description: add workaround for race between add/remove dentry from hash + +Severity : enhancement +Bugzilla : 16845 +Description: Allow OST glimpses to return PW locks + +Severity : minor +Bugzilla : 16717 +Description: LBUG when llog conf file is full +Details : When llog bitmap is full, ENOSPC should be returned for plain + log. + +Severity : normal +Bugzilla : 16907 +Description: Prevent import from entering FULL state when server in recovery + +Severity : major +Bugzilla : 16750 +Description: service mount cannot take device name with ":" +Details : Only when device name contains ":/" will mount treat it as + client mount. + +Severity : normal +Bugzilla : 15927 +Frequency : rare +Description: replace ptlrpcd with the statahead thread to interpret the async + statahead RPC callback + +Severity : normal +Bugzilla : 16611 +Frequency : on recovery +Description: I/O failures after umount during fail back +Details : if client reconnected to restarted server we need join to recovery + instead of find server handler is changed and process self eviction + with cancel all locks. + +Severity : enhancement +Bugzilla : 16633 +Description: Update to RHEL5 kernel-2.6.18-92.1.10.el5. + +Severity : enhancement +Bugzilla : 16547 +Description: Update to SLES10 SP2 kernel-2.6.16.60-0.27. + +Severity : enhancement +Bugzilla : 16566 +Description: Upcall on Lustre log has been dumped +Details : Allow for a user mode script to be called once a Lustre log has + been dumped. It passes the filename of the dumped log to the + script, the location of the script can be specified via + /proc/sys/lnet/debug_log_upcall. + +Severity : minor +Bugzilla : 16583 +Frequency : rare +Description: avoid idr_remove called for id which is not allocated. +Details : Move assigment s_dev for clustered nfs to end of initialization, + to avoid problem with error handling. + +Severity : minor +Bugzilla : 16109 +Frequency : rare +Description: avoid Already found the key in hash [CONN_UNUSED_HASH] messages +Details : When connection is reused this not moved from CONN_UNUSED_HASH + into CONN_USED_HASH and this prodice warning when put connection + again in unused hash. + +Severity : enhancement +Bugzilla : 16573 +Description: Export bytes_read/bytes_write count on OSC/OST. + +Severity : normal +Bugzilla : 16237 +Description: Early reply size mismatch, MGC loses connection +Details : Apply the MGS_CONNECT_SUPPORTED mask at reconnect time so + the connect flags are properly negotiated. + +Severity : normal +Bugzilla : 16006 +Description: Properly propagate oinfo flags from lov to osc for statfs +Details : restore missing copy oi_flags to lov requests. + +Severity : enhancement +Bugzilla : 16581 +Description: Add man pages for llobdstat(8), llstat(8), plot-llstat(8), + : l_getgroups(8), lst(8), routerstat(8) +Details : included man pages for llobdstat(8), llstat(8), + : plot-llstat(8), l_getgroups(8), lst(8), routerstat(8) + +Severity : enhancement +Bugzilla : 16091 +Description: configure's --enable-quota should check the + : kernel .config for CONFIG_QUOTA +Details : configure is terminated if --enable-quota is passed but + : no quota support is in kernel + Severity : normal Bugzilla : 13139 Description: Remove portals compatibility @@ -56,7 +167,7 @@ Severity : normal Bugzilla : 15825 Description: Kernel BUG tries to release flock Details : Lustre does not destroy flock lock before last reference goes - away. So always drop flock locks when client is evicted and + away. So always drop flock locks when client is evicted and perform unlock regardless of successfulness of speaking to MDS. Severity : normal @@ -970,7 +1081,7 @@ Details : Console messages can now be disabled via lnet.printk. Severity : normal Bugzilla : 14614 -Description: User code with malformed file open parameter crashes client node +Description: User code with malformed file open parameter crashes client node Details : Before packing join_file req, all the related reference should be checked carefully in case some malformed flags cause fake join_file req on client. @@ -1101,7 +1212,7 @@ Severity : normal Bugzilla : 14257 Description: LASSERT on MDS when client holding flock lock dies Details : ldlm pool logic depends on number of granted locks equal to - number of released locks which is not true for flock locks, so + number of released locks which is not true for flock locks, so just exclude such locks from consideration. Severity : normal @@ -1121,15 +1232,15 @@ Severity : enhancement Bugzilla : 11089 Description: organize the server-side client stats on per-nid basis Details : Change the structure of stats under obdfilter and mds to - New structure: - +- exports - +- nid#1 - | + stats - | + uuids - +- nid#2... - +- clear - The "uuid"s file would list the uuids of _active_ exports. - And the clear entry is to clear all stats and stale nids. + New structure: + +- exports + +- nid#1 + | + stats + | + uuids + +- nid#2... + +- clear + The "uuid"s file would list the uuids of _active_ exports. + And the clear entry is to clear all stats and stale nids. Severity : enhancement Bugzilla : 11270 @@ -1159,15 +1270,15 @@ Severity : normal Bugzilla : 15346 Description: skiplist implementation simplification Details : skiplists are used to group compatible locks on granted list - that was implemented as tracking first and last lock of each lock group - the patch changes that to using doubly linked lists + that was implemented as tracking first and last lock of each + lock group the patch changes that to using doubly linked lists Severity : normal Bugzilla : 15574 Description: MDS LBUG: ASSERTION(!IS_ERR(dchild)) Details : Change LASSERTs to client eviction (i.e. abort client's recovery) - because LASSERT on both the data supplied by a client, and the data - on disk is dangerous and incorrect. + because LASSERT on both the data supplied by a client, and the + data on disk is dangerous and incorrect. Severity : enhancement Bugzilla : 10718 @@ -1269,6 +1380,27 @@ Bugzilla : 16140 Description: journal_dev option does not work in b1_6 Details : pass mount option during pre-mount. +Severity : enhancement +Bugzilla : 10555 +Description: Add a FIEMAP(FIle Extent MAP) ioctl +Details : FIEMAP ioctl will allow an application to efficiently fetch the + extent information of a file. It can be used to map logical blocks + in a file to physical blocks in the block device. + +Severity : normal +Bugzilla : 15198 +Description: LDLM soft lockups - improvement +Details : It is be possible to send the lock handle along with each read + or write request because the client is already doing a lock match + itself so there isn't any reason the OST should have to re-do that + search. + +Severity : normal +Frequency : only X2 +Bugzilla : 16813 +Description: X2 build failures +Details : fix build failures on Cray X2. + -------------------------------------------------------------------------------- 2007-08-10 Cluster File Systems, Inc. @@ -1299,7 +1431,7 @@ Severity : minor Frequency : at statup only Bugzilla : 12860 Description: mds_lov_synchronize race leads to various problems -Details : simultaneous MDT->OST connections at startup can cause the +Details : simultaneous MDT->OST connections at startup can cause the sync to abort, leaving the OSC in a bad state. Severity : enhancement @@ -1468,7 +1600,7 @@ Details : When osc reconnect ost, OST(filter) should check whether it to update the client grant space info. Severity : normal -Frequency : when client reconnect to OST +Frequency : when client reconnect to OST Bugzilla : 11662 Description: Grant Leak when osc do resend and replay bulk write Details : When osc reconnect to OST, OST(filter)should clear grant info of @@ -1477,33 +1609,33 @@ Details : When osc reconnect to OST, OST(filter)should clear grant info of these of resend/replay write req. Severity : normal -Frequency : rare +Frequency : rare Bugzilla : 11662 Description: Grant space more than avaiable left space sometimes. Details : When then OST is about to be full, if two bulk writing from different clients came to OST. Accord the avaliable space of the OST, the first req should be permitted, and the second one - should be denied by ENOSPC. But if the seconde arrived before + should be denied by ENOSPC. But if the seconde arrived before the first one is commited. The OST might wrongly permit second writing, which will cause grant space > avaiable space. Severity : normal -Frequency : when client is evicted +Frequency : when client is evicted Bugzilla : 12371 Description: Grant might be wrongly erased when osc is evicted by OST -Details : when the import is evicted by server, it will fork another - thread ptlrpc_invalidate_import_thread to invalidate the - import, where the grant will be set to 0. While the original - thread will update the grant it got when connecting. So if - the former happened latter, the grant will be wrongly errased +Details : when the import is evicted by server, it will fork another + thread ptlrpc_invalidate_import_thread to invalidate the + import, where the grant will be set to 0. While the original + thread will update the grant it got when connecting. So if + the former happened latter, the grant will be wrongly errased because of this race. Severity : normal -Frequency : rare +Frequency : rare Bugzilla : 12401 -Description: Checking Stale with correct fid -Details : ll_revalidate_it should uses de_inode instead of op_data.fid2 - to check whether it is stale, because sometimes, we want the +Description: Checking Stale with correct fid +Details : ll_revalidate_it should uses de_inode instead of op_data.fid2 + to check whether it is stale, because sometimes, we want the enqueue happened anyway, and op_data.fid2 will not be initialized. Severity : enhancement @@ -1519,29 +1651,29 @@ Details : size of struct ll_inode_info is to big for union inode.u and this can be cause of random memory corruption. Severity : normal -Frequency : rare +Frequency : rare Bugzilla : 10818 Description: Memory leak in recovery Details : Lov_mds_md was not free in an error handler in mds_create_object. - It should also check obd_fail before fsfilt_start, otherwise if + It should also check obd_fail before fsfilt_start, otherwise if fsfilt_start return -EROFS,(failover mds during mds recovery). - then the req will return with repmsg->transno = 0 and rc = EROFS. + then the req will return with repmsg->transno = 0 and rc = EROFS. and we met hit the assert LASSERT(req->rq_reqmsg->transno == - req->rq_repmsg->transno) in ptlrpc_replay_interpret. Fcc should + req->rq_repmsg->transno) in ptlrpc_replay_interpret. Fcc should be freed no matter whether fsfilt_commit success or not. Severity : minor Frequency : only with huge count clients Bugzilla : 11817 -Description: Prevents from taking the superblock lock in llap_from_page for +Description: Prevents from taking the superblock lock in llap_from_page for a soon died page. -Details : using LL_ORIGIN_REMOVEPAGE origin flag instead of LL_ORIGIN_UNKNOW - for llap_from_page call in ll_removepage prevents from taking the +Details : using LL_ORIGIN_REMOVEPAGE origin flag instead of LL_ORIGIN_UNKNOW + for llap_from_page call in ll_removepage prevents from taking the superblock lock for a soon died page. Severity : normal Frequency : rare -Bugzilla : 11935 +Bugzilla : 11935 Description: Not check open intent error before release open handle Details : in some rare cases, the open intent error is not checked before release open handle, which may cause @@ -1550,9 +1682,9 @@ Details : in some rare cases, the open intent error is not checked before Severity : normal Frequency : rare -Bugzilla : 12556 -Description: Set cat log bitmap only after create log success. -Details : in some rare cases, the cat log bitmap is set too early. and it +Bugzilla : 12556 +Description: Set cat log bitmap only after create log success. +Details : in some rare cases, the cat log bitmap is set too early. and it should be set only after create log success. Severity : major @@ -1569,11 +1701,11 @@ Details : Insert cond_resched to give other threads a chance to use some CPU Severity : normal Frequency : rare -Bugzilla : 12086 -Description: the cat log was not initialized in recovery +Bugzilla : 12086 +Description: the cat log was not initialized in recovery Details : When mds(mgs) do recovery, the tgt_count might be zero, so the unlink log on mds will not be initialized until mds post - recovery. And also in mds post recovery, the unlink log will + recovery. And also in mds post recovery, the unlink log will initialization will be done asynchronausly, so there will be race between add unlink log and unlink log initialization. @@ -1594,7 +1726,7 @@ Details : imp_lock should be held while iterating over imp_sending_list for Severity : normal Bugzilla : 12689 Description: replay-single.sh test 52 fails -Details : A lock's skiplist need to be cleanup when it being unlinked +Details : A lock's skiplist need to be cleanup when it being unlinked from its resource list. Severity : normal @@ -1615,15 +1747,22 @@ Severity : enhancement Bugzilla : 4900 Description: Async OSC create to avoid the blocking unnecessarily. Details : If a OST has no remain object, system will block on the creating - when need to create a new object on this OST. Now, ways use - pre-created objects when available, instead of blocking on an - empty osc while others are not empty. If we must block, we block - for the shortest possible period of time. + when need to create a new object on this OST. Now, ways use + pre-created objects when available, instead of blocking on an + empty osc while others are not empty. If we must block, we block + for the shortest possible period of time. + +Severity : major +Bugzilla : 11710 +Description: improve handling recoverable errors +Details : if request processig with error which can be recoverable on server + request should be resend, otherwise page released from cache and + marked as error. Severity : enhancement Bugzilla : 12702 Description: refine locking for avoid write wrong info into lov_objid file -Details : fix possible races with add new target and write/update data in +Details : fix possible races with add new target and write/update data in lov_objid file. -------------------------------------------------------------------------------- @@ -1657,7 +1796,7 @@ Details : The __iget() symbol export is missing. To avoid the need for special upgrade step is needed. Please read the user documentation before upgrading any part of a live system. * WIRE PROTOCOL CHANGE from previous 1.6 beta versions. This - version will not interoperate with 1.6 betas before beta5 (1.5.95). + version will not interoperate with 1.6 betas before beta5 (1.5.95). * WARNING: Lustre configuration and startup changes are required with this release. See https://mail.clusterfs.com/wikis/lustre/MountConf for details. @@ -1696,7 +1835,7 @@ Bugzilla : 9862 Description: optimized stripe assignment Details : stripe assignments are now made based on ost space available, ost previous usage, and OSS previous usage, in order to try - to optimize storage space and networking resources. + to optimize storage space and networking resources. Severity : enhancement Bugzilla : 4226 @@ -1733,16 +1872,16 @@ Severity : enhancement Bugzilla : 22484 Description: client read/write statistics Details : Add client read/write call usage stats for performance - analysis of user processes. + analysis of user processes. /proc/fs/lustre/llite/*/offset_stats shows non-sequential file access. extents_stats shows chunk size distribution. extents_stats_per_process show chunk size distribution per - user process. + user process. Severity : enhancement Bugzilla : 22486 Description: mds statistics -Details : Add detailed mds operations statistics in +Details : Add detailed mds operations statistics in /proc/fs/lustre/mds/*/stats. Severity : minor @@ -3982,7 +4121,7 @@ Severity : Minor Frequency : Rare Bugzilla : 11248 Description: merge and cleanup kernel patches. -Details : +Details : -----------------------------------------------------------------------------