On Wed, 4 Jan 2006, Michael Mansour wrote:
> Hi,
>
> After building a couple of clusters using xfs on the shared storage device
> (and using md and lvm on top of that), I'm getting this error now which hard
> crashes my machines:
>
> do_IRQ: stack overflow: 284
> [<c01078a2>] do_IRQ+0x44/0x130
The rest of the message would be most interesting, to see what your stack
actually looks like.
Recent xfs is reasonable on 4k stacks and there are a few things in the works
to make it better. But depending on what you stack up in your IO path
you could probably still blow it.
-Eric
> I'm using Scientific Linux 4.2 (RHEL4 Update 2) with a SL Contrib kernel of:
>
> kernel-smp-2.6.9-11.EL.XFS
>
> which has xfs support. I also use the xfsprogs rpm supplied by Dag Wieers. I
> run on an x86 platform.
>
> After googling quite a bit, it seems that RH have caused an issue with their
> RHEL4 release by only enabling a 4k stack, where it seems that XFS requires an
> 8k stack?
>
> I'd really like to know how to fix this problem as I just finished months of
> works building a couple of SL4 clustered environments using XFS, and now with
> this problem am looking at the unpleasant alternative of getting rid of the
> XFS filesystems and changing them to ext3, which will take me approximately
> half a day of work per cluster for the added benefit of a slower filesystem.
>
> I just visited the SGI site to see if there's any hints to fixes of this
> problem there, which is where I got this email address from.
>
> Any help is very much appreciated.
>
> Michael.
>
>
|