xfs
[Top] [All Lists]

Re: XFS hung task in xfs_ail_push_all_sync() when unmounting FS after di

To: xfs@xxxxxxxxxxx
Subject: Re: XFS hung task in xfs_ail_push_all_sync() when unmounting FS after disk failure/recovery
From: Dave Chinner <david@xxxxxxxxxxxxx>
Date: Thu, 24 Mar 2016 09:37:47 +1100
Delivered-to: xfs@xxxxxxxxxxx
In-reply-to: <20160323153221.GA19456@xxxxxxxxxx>
References: <f049419a2ab10f8e3c4fef0e4f4ca1ba@xxxxxxxxxxxxxx> <20160322121922.GA53693@xxxxxxxxxxxxxxx> <6457b1d9de271ec6cca6bc2626aac161@xxxxxxxxxxxxxx> <20160322140345.GA54245@xxxxxxxxxxxxxxx> <0f3832c45509f444f55fda2aaf9c9deb@xxxxxxxxxxxxxx> <20160323123010.GA43073@xxxxxxxxxxxxxxx> <20160323153221.GA19456@xxxxxxxxxx>
User-agent: Mutt/1.5.21 (2010-09-15)
On Wed, Mar 23, 2016 at 04:32:21PM +0100, Carlos Maiolino wrote:
> I'm still trying to get a reliable reproducer, at least exactly with what I 
> have
> seen a few days ago.
> 
> Shyam, could you try to reproduce it with a recent/upstream kernel? That would
> be great to make sure we have been seen the same issue.
> 
> AFAICT, it happens in the following situation:
> 
> 1 - Something is written to the filesystem
> 2 - log checkpoint is done for the previous write
> 3 - Disk failure
> 4 - XFS tries to writeback metadata logged in [2]
> 
> When [4] happens, I can't trigger xfs_log_force messages all the time, most of
> time I just get an infinite loop in these messages:
> 
> [12694.318109] XFS (dm-0): Failing async write on buffer block
> 0xffffffffffffffff. Retrying async write.
>
> Sometimes I can trigger the xfs_log_force() loop.

This all smells like the filesystem is getting IO errors but it not
in a shutdown state. What happens when you run 'xfs_io -x -c
"shutdown" /mnt/pt' on a filesystem in this state? Can you then
unmount it?

Cheers,

Dave.
-- 
Dave Chinner
david@xxxxxxxxxxxxx

<Prev in Thread] Current Thread [Next in Thread>