[Top] [All Lists]

Re: [PATCH 8/8] xfs: fix dquot shaker deadlock

To: Christoph Hellwig <hch@xxxxxxxxxxxxx>
Subject: Re: [PATCH 8/8] xfs: fix dquot shaker deadlock
From: Dave Chinner <david@xxxxxxxxxxxxx>
Date: Thu, 27 Jan 2011 12:54:20 +1100
Cc: xfs@xxxxxxxxxxx
In-reply-to: <20110125095242.GB23990@xxxxxxxxxxxxx>
References: <1295945444-29488-1-git-send-email-david@xxxxxxxxxxxxx> <1295945444-29488-9-git-send-email-david@xxxxxxxxxxxxx> <20110125095242.GB23990@xxxxxxxxxxxxx>
User-agent: Mutt/1.5.20 (2009-06-14)
On Tue, Jan 25, 2011 at 04:52:42AM -0500, Christoph Hellwig wrote:
> On Tue, Jan 25, 2011 at 07:50:44PM +1100, Dave Chinner wrote:
> > From: Dave Chinner <dchinner@xxxxxxxxxx>
> > 
> > Commit 368e136 ("xfs: remove duplicate code from dquot reclaim") fails
> > to unlock the dquot freelist when the number of loop restarts is
> > exceeded in xfs_qm_dqreclaim_one(). This causes hangs in memory
> > reclaim. Remove the bogus loop exit check that causes the problem.
> The fix looks correct, but it's a bit inconsequential about when
> to adhere the retry limit and when not.  Shouldn't we just turn the
> exit condition into:
>       if (dqout || restarts >= XFS_QM_RECLAIM_MAX_RESTARTS)
>               break;

I'm guessing that you are suggesting changing the code increments
and checks restarts to:

        goto startagain;

otherwise this doesn't make sense as restarts will never go above
XFS_QM_RECLAIM_MAX_RESTARTS at this point in the loop.

Even so, I don't think this is a good idea, because it means we have
to get to the end of the loop before we check restarts for loop
termination. Given that the restarts is counting the number of times
we fail to get locks, encounter dquots that are being recycled, etc,
this would render then restart counter ineffective as a method of
determining whether we are making progress or not. Hence I think
that simply removing the check that I did is the correct fix.

> also the failure to acquite qi_dqlist_lock increments the restart
> count twice, which was also added in the same commit.

I'll fix that, though.


Dave Chinner

<Prev in Thread] Current Thread [Next in Thread>