xfs
[Top] [All Lists]

Re: [xfs-masters] xfs deadlock in stable kernel 3.0.4

To: Christoph Hellwig <hch@xxxxxxxxxxxxx>
Subject: Re: [xfs-masters] xfs deadlock in stable kernel 3.0.4
From: Alex Elder <aelder@xxxxxxx>
Date: Tue, 13 Sep 2011 16:58:13 -0500
Cc: Stefan Priebe - Profihost AG <s.priebe@xxxxxxxxxxxx>, "xfs-masters@xxxxxxxxxxx" <xfs-masters@xxxxxxxxxxx>, "xfs@xxxxxxxxxxx" <xfs@xxxxxxxxxxx>
In-reply-to: <1315950742.2159.89.camel@doink>
References: <1D2B34A7-7BB9-4E4E-9CA2-382C210E125F@xxxxxxxxxxxx> <20110912152133.GA8345@xxxxxxxxxxxxx> <C6515E45-5724-43DD-95A8-1F89AFE29601@xxxxxxxxxxxx> <20110912200543.GA22409@xxxxxxxxxxxxx> <4E6EF274.7050007@xxxxxxxxxxxx> <20110913205018.GA8543@xxxxxxxxxxxxx> <1315950742.2159.89.camel@doink>
Reply-to: <aelder@xxxxxxx>
On Tue, 2011-09-13 at 16:52 -0500, Alex Elder wrote:
> On Tue, 2011-09-13 at 16:50 -0400, Christoph Hellwig wrote:
> > On Tue, Sep 13, 2011 at 08:04:36AM +0200, Stefan Priebe - Profihost AG 
> > wrote:
> > > I just reported it to the scsi list as i didn't knew where the
> > > problems is. But then some people told be it must be a XFS problem.
> > > 
> > > Some more informations:
> > > 1.) It's running with 2.6.32 and 2.6.38
> > > 2.) I can also write to another ext2 part on the same disk
> > > array(aacraid driver) while xfs stucks - so i think it must be an
> > > xfs problem
> > 
> > That points a bit more towards XFS, although we've seen storage setups
> > create issues depending on the exact workload.  The prime culprit for
> > used to be the md software RAID driver, though.
> > 
> > > 3.) I've also tried running 3.1-rc5 but then i'm seeing this error:
> > > 
> > > BUG: unable to handle kernel NULL pointer dereference at 000000000000012c
> > > IP: [] inode_dio_done+0x4/0x25
> > 
> > Oops, that's a bug that I actually introduced myself.  Fix below:
> 
> Yikes.  I'll prepare that one to send to Linus for 3.1.
> I'll wait for your formal signoff, though, Christoph.
> 
> Reviewed-by: Alex Elder <aelder@xxxxxxx>

Nevermind--the latest code doesn't look quite
like that and doesn't suffer the same problem.

Christoph, will you please ensure the fix gets
to the stable folks though?  You have my review
for the change.

                                        -Alex


<Prev in Thread] Current Thread [Next in Thread>