| To: | Markus Trippelsdorf <markus@xxxxxxxxxxxxxxx> |
|---|---|
| Subject: | Re: 2.6.39-rc3, 2.6.39-rc4: XFS lockup - regression since 2.6.38 |
| From: | Bruno Prémont <bonbons@xxxxxxxxxxxxxxxxx> |
| Date: | Sat, 30 Apr 2011 16:18:10 +0200 |
| Cc: | Dave Chinner <david@xxxxxxxxxxxxx>, xfs-masters@xxxxxxxxxxx, xfs@xxxxxxxxxxx, Christoph Hellwig <hch@xxxxxxxxxxxxx>, Alex Elder <aelder@xxxxxxx>, Dave Chinner <dchinner@xxxxxxxxxx>, linux-kernel@xxxxxxxxxxxxxxx, James Bottomley <James.Bottomley@xxxxxxxxxxxxxxxxxxxxx> |
| In-reply-to: | <20110429213524.449e003b@xxxxxxxxxxxx> |
| References: | <20110423224403.5fd1136a@xxxxxxxxxxxx> <20110427050850.GG12436@dastard> <20110427182622.05a068a2@xxxxxxxxxxxx> <20110428194528.GA1627@xxxxxxxxxxxxxx> <20110429011929.GA13542@dastard> <20110429151841.GA893@xxxxxxxxxxxxxx> <20110429213524.449e003b@xxxxxxxxxxxx> |
On Fri, 29 April 2011 Bruno Prémont wrote: > On Fri, 29 April 2011 Markus Trippelsdorf wrote: > > On 2011.04.29 at 11:19 +1000, Dave Chinner wrote: > > > OK, so the common elements here appears to be root filesystems > > > with small log sizes, which means they are tail pushing all the > > > time metadata operations are in progress. Definitely seems like a > > > race in the AIL workqueue trigger mechanism. I'll see if I can > > > reproduce this and cook up a patch to fix it. > > > > Hmm, I'm wondering if this issue is somehow related to the hrtimer bug, > > that Thomas Gleixner fixed yesterday: > > http://git.us.kernel.org/?p=linux/kernel/git/tip/linux-2.6-tip.git;a=commit;h=ce31332d3c77532d6ea97ddcb475a2b02dd358b4 > > http://thread.gmane.org/gmane.linux.kernel.mm/61909/ > > > > It also looks similar to the issue that James Bottomley reported > > earlier: http://thread.gmane.org/gmane.linux.kernel.mm/62185/ > > I'm going to see, I've applied Thomas' fix on the box seeing XFS freeze > (without > other changes to kernel). > Going to run that kernel for the week-end and beyond if it survives to see > what > happens. Happened again (after a few hours of uptime), so it definitely is not caused by hrtimer bug that Thomas Gleixner fixed. Bruno |
| <Prev in Thread] | Current Thread | [Next in Thread> |
|---|---|---|
| ||
| Previous by Date: | Re: 2.6.39-rc3, 2.6.39-rc4: XFS lockup - regression since 2.6.38, Bruno PrÃmont |
|---|---|
| Next by Date: | [Bug 34012] 2.6.39-rc4+: oom-killer busy killing tasks, bugzilla-daemon |
| Previous by Thread: | Re: 2.6.39-rc3, 2.6.39-rc4: XFS lockup - regression since 2.6.38, Bruno PrÃmont |
| Next by Thread: | [Bug 34012] New: 2.6.39-rc4+: oom-killer busy killing tasks, bugzilla-daemon |
| Indexes: | [Date] [Thread] [Top] [All Lists] |