xfs
[Top] [All Lists]

Re: RHEL4/SL4 XFS stack problem?

To: Michael Mansour <mic@xxxxxxxxxxx>
Subject: Re: RHEL4/SL4 XFS stack problem?
From: Eric Sandeen <sandeen@xxxxxxx>
Date: Tue, 3 Jan 2006 23:06:38 -0600 (CST)
Cc: linux-xfs@xxxxxxxxxxx
In-reply-to: <20060104045336.M58734@xxxxxxxxxxx>
Sender: linux-xfs-bounce@xxxxxxxxxxx
On Wed, 4 Jan 2006, Michael Mansour wrote:

> Hi Eric,
> 
> > > After building a couple of clusters using xfs on the shared storage device
> > > (and using md and lvm on top of that), I'm getting this error now which 
> > > hard
> > > crashes my machines:
> > > 
> > >  do_IRQ: stack overflow: 284
> > >   [<c01078a2>] do_IRQ+0x44/0x130
> > 
> > The rest of the message would be most interesting, to see what your 
> > stack actually looks like.
> 
> What I've shown above is the only bit I can see on the console, can't use the
> keyboard or anything at that point and I have to physically powercycle the 
> server.

Hm, hard to say then.

> > Recent xfs is reasonable on 4k stacks and there are a few things in 
> > the works to make it better.  But depending on what you stack up in 
> > your IO path you could probably still blow it.
> 
> Hmm... ok, my stack is:
> 
> md   for IDE disk mirrors
> lvm  for LV support
> drbd for the shared storage
> xfs  formatted the filesystem

hm, yes, that's pretty optimistic :)

Just for kicks you could run http://oss.sgi.com/~sandeen/stackcheck-i386 
against each
of those modules & see if any large stack users show up that might matter.

> I run the linuxha.net HA software which uses drbd for network-linked shared
> storage.
> 
> Do you think all that stacking is the problem? would the previous email
> stating that I can build from kernel.org using RH config file but changing to
> 8k stack make this work?

It might.  There are some arguments that because  8k stacks must share with IRQ
stacks, you're  just as likely to have problems, but it seems that usually 8k
stacks are  a bit more forgiving...

-Eric

> Thanks.
> 
> Michael.
> 
> > -Eric
> > 
> > > I'm using Scientific Linux 4.2 (RHEL4 Update 2) with a SL Contrib kernel 
> > > of:
> > > 
> > > kernel-smp-2.6.9-11.EL.XFS
> > > 
> > > which has xfs support. I also use the xfsprogs rpm supplied by Dag 
> > > Wieers. I
> > > run on an x86 platform.
> > > 
> > > After googling quite a bit, it seems that RH have caused an issue with 
> > > their
> > > RHEL4 release by only enabling a 4k stack, where it seems that XFS 
> > > requires an
> > > 8k stack?
> > > 
> > > I'd really like to know how to fix this problem as I just finished months 
> > > of
> > > works building a couple of SL4 clustered environments using XFS, and now 
> > > with
> > > this problem am looking at the unpleasant alternative of getting rid of 
> > > the
> > > XFS filesystems and changing them to ext3, which will take me 
> > > approximately
> > > half a day of work per cluster for the added benefit of a slower 
> > > filesystem.
> > > 
> > > I just visited the SGI site to see if there's any hints to fixes of this
> > > problem there, which is where I got this email address from.
> > > 
> > > Any help is very much appreciated.
> > > 
> > > Michael.
> > > 
> > >
> ------- End of Original Message -------
> 


<Prev in Thread] Current Thread [Next in Thread>