Whamcloud - gitweb
LU-16736 quota: set revoke time to avoid endless wait
authorHongchao Zhang <hongchao@whamcloud.com>
Fri, 26 Jan 2024 10:57:34 +0000 (18:57 +0800)
committerAndreas Dilger <adilger@whamcloud.com>
Sat, 27 Apr 2024 22:32:47 +0000 (22:32 +0000)
The revoke time of the lquota entry should be set when its qunit
reaches least qunit, but it could not be set in some rare case,
which could be related to the broken quota LDLM lock, set it in
"qmt_acquire" to avoid endless wait in QSD.

Lustre-change: https://review.whamcloud.com/50626
Lustre-commit: 49730821c4e5116f188c931830ce23b2da2d8a41

Signed-off-by: Hongchao Zhang <hongchao@whamcloud.com>
Change-Id: Ib68c5dc881346e0e619d43553ee490847ae5e225
Reviewed-on: https://review.whamcloud.com/c/ex/lustre-release/+/54907
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Sergey Cheremencev <scherementsev@ddn.com>
Tested-by: jenkins <devops@whamcloud.com>
Tested-by: Maloo <maloo@whamcloud.com>
lustre/quota/qmt_entry.c

index 9fdd79a..a3ff599 100644 (file)
@@ -536,8 +536,17 @@ bool qmt_adjust_edquot(struct lquota_entry *lqe, __u64 now)
                                RETURN(false);
 
                        /* least qunit value not sent to all slaves yet */
-                       if (lqe->lqe_revoke_time == 0)
+                       if (lqe->lqe_revoke_time == 0 &&
+                           !lqe->lqe_gl && list_empty(&lqe->lqe_link)) {
+                               /* LU-16736: the revoke_time should be set when
+                                * the qunit reachs the least qunit, the quota
+                                * LDLM lock could encounter some issue, setting
+                                * it to avoid endless wait in QSD. */
+                               LQUOTA_ERROR(lqe, "set revoke_time explicitly");
+
+                               lqe->lqe_revoke_time = ktime_get_seconds();
                                RETURN(false);
+                       }
 
                        /* Let's give more time to slave to release space */
                        lapse = ktime_get_seconds() - QMT_REBA_TIMEOUT;