xfs
[Top] [All Lists]

Re: XFS hung task in xfs_ail_push_all_sync() when unmounting FS after di

To: xfs@xxxxxxxxxxx
Subject: Re: XFS hung task in xfs_ail_push_all_sync() when unmounting FS after disk failure/recovery
From: Carlos Maiolino <cmaiolino@xxxxxxxxxx>
Date: Thu, 24 Mar 2016 12:08:59 +0100
Delivered-to: xfs@xxxxxxxxxxx
In-reply-to: <20160323223747.GX30721@dastard>
Mail-followup-to: xfs@xxxxxxxxxxx
References: <f049419a2ab10f8e3c4fef0e4f4ca1ba@xxxxxxxxxxxxxx> <20160322121922.GA53693@xxxxxxxxxxxxxxx> <6457b1d9de271ec6cca6bc2626aac161@xxxxxxxxxxxxxx> <20160322140345.GA54245@xxxxxxxxxxxxxxx> <0f3832c45509f444f55fda2aaf9c9deb@xxxxxxxxxxxxxx> <20160323123010.GA43073@xxxxxxxxxxxxxxx> <20160323153221.GA19456@xxxxxxxxxx> <20160323223747.GX30721@dastard>
User-agent: Mutt/1.5.24 (2015-08-30)
On Thu, Mar 24, 2016 at 09:37:47AM +1100, Dave Chinner wrote:
> On Wed, Mar 23, 2016 at 04:32:21PM +0100, Carlos Maiolino wrote:
> > I'm still trying to get a reliable reproducer, at least exactly with what I 
> > have
> > seen a few days ago.
> > 
> > Shyam, could you try to reproduce it with a recent/upstream kernel? That 
> > would
> > be great to make sure we have been seen the same issue.
> > 
> > AFAICT, it happens in the following situation:
> > 
> > 1 - Something is written to the filesystem
> > 2 - log checkpoint is done for the previous write
> > 3 - Disk failure
> > 4 - XFS tries to writeback metadata logged in [2]
> > 
> > When [4] happens, I can't trigger xfs_log_force messages all the time, most 
> > of
> > time I just get an infinite loop in these messages:
> > 
> > [12694.318109] XFS (dm-0): Failing async write on buffer block
> > 0xffffffffffffffff. Retrying async write.
> >
> > Sometimes I can trigger the xfs_log_force() loop.
> 
> This all smells like the filesystem is getting IO errors but it not
> in a shutdown state. What happens when you run 'xfs_io -x -c
> "shutdown" /mnt/pt' on a filesystem in this state? Can you then
> unmount it?
> 

I'll give it a try today, although, I can't do it while umount command is hung,
since, before the command get stuck, the mount point is removed from the
user namespace, so I have no access to the mountpoint from userspace while the
command is 'running'.


> Cheers,
> 
> Dave.
> -- 
> Dave Chinner
> david@xxxxxxxxxxxxx
> 
> _______________________________________________
> xfs mailing list
> xfs@xxxxxxxxxxx
> http://oss.sgi.com/mailman/listinfo/xfs

-- 
Carlos

<Prev in Thread] Current Thread [Next in Thread>