Whamcloud - gitweb
Landing of b_recovery (at last).
authorshaver <shaver>
Thu, 28 Nov 2002 20:21:24 +0000 (20:21 +0000)
committershaver <shaver>
Thu, 28 Nov 2002 20:21:24 +0000 (20:21 +0000)
commit4d477d1468cf4be4c37681610b3d726fd27f229f
treeab7629e37f957523e34d16f319bd123a98096055
parentef1e48d8856d66a6b23bca3fe6ef9deb81a9b90d
Landing of b_recovery (at last).

Highlights:
  - b=324: MDS recovery must replay transactions in strict transno sequence
  - b=325: getattr after OST failure returns -EIO
  - b=326: unlink after OST failure returns -EIO
  - b=400: new client can't join cluster after OST failure
  - b=403: multi-client access failure when OST fails
  - b=410: After an OST failure, lfind incorrectly displays file information
  - b=417: Freeing unreplayable requests twice (aed's fix from b_md)
  - b=402: (partial) give error for lstripe request that exceeds configured OSTs
  - much better support for reconnecting to MDS after network partition
    (still some lock-repeating issues to be resolved for some requests)
  - better support for connecting to multiple MDSes on one host (xid and
    transno and request_list are all per-import now)
  - track disconnecting clients in last_rcvd, for more reliable recovery
  - also, sync last_rcvd after connect/disconnect
  - reduced syslog/CERROR output for recovery (hi, Terry!)
  - server (DLM) timeout is half the system-wide timeout, to avoid cascading
    failure in the face of a dead client
  - don't wait for recovery to finish in order to send disconnect messages
  - removal of c_dying_head
  - don't wait for timeout to trigger recovery after ptl_send_rpc error
  - strict MDS transno ordering via mds_transno_sem (non-optimal, but correct)
  - many !handle -> IS_ERR(handle) fixes around mds_fs_start callers.
  - turn on client-eviction for bulk-timeouts in OST and MDS
31 files changed:
lustre/include/linux/lustre_export.h
lustre/include/linux/lustre_ha.h
lustre/include/linux/lustre_idl.h
lustre/include/linux/lustre_import.h
lustre/include/linux/lustre_lib.h
lustre/include/linux/lustre_mds.h
lustre/include/linux/lustre_net.h
lustre/include/linux/obd.h
lustre/ldlm/ldlm_lockd.c
lustre/ldlm/ldlm_request.c
lustre/lib/client.c
lustre/lib/target.c
lustre/llite/file.c
lustre/llite/recover.c
lustre/llite/super.c
lustre/lov/lov_obd.c
lustre/mdc/mdc_reint.c
lustre/mdc/mdc_request.c
lustre/mds/handler.c
lustre/mds/mds_fs.c
lustre/mds/mds_reint.c
lustre/obdclass/class_obd.c
lustre/obdclass/genops.c
lustre/osc/osc_request.c
lustre/ost/ost_handler.c
lustre/ptlrpc/client.c
lustre/ptlrpc/connection.c
lustre/ptlrpc/recovd.c
lustre/ptlrpc/recover.c
lustre/ptlrpc/rpc.c
lustre/utils/lconf