xfs
[Top] [All Lists]

Re: kernel panic-xfs errors

To: xfs@xxxxxxxxxxx
Subject: Re: kernel panic-xfs errors
From: blacknred <leo1783@xxxxxxxxxxxxx>
Date: Thu, 9 Dec 2010 05:23:35 -0800 (PST)
In-reply-to: <20101209005944.GD32766@dastard>
References: <30397503.post@xxxxxxxxxxxxxxx> <20101207222558.GC29333@dastard> <30403823.post@xxxxxxxxxxxxxxx> <20101209005944.GD32766@dastard>
>Is the version 7.x firmware certified with such an old kernel?
Yes, it is...

It hung again today and dmesg said
XFS: bad magic number
XFS: SB validate failed

But when I do dd if=/dev/cciss/c0d0 bs=512 count=1 |od -c I get below which
suggests its a valid XFS superblock magic number as per your reply, correct?

I couldn't unmount the partition to do a xfs_repair -n

1+0 records in
1+0 records out
0000000    X   F   S   B  \0  \0 020  \0  \0  \0  \0  \0   + 251 262   ^
512 bytes (512 B) copied0000020   \0  \0  \0  \0  \0  \0  \0  \0  \0  \0  \0 
\0  \0  \0  \0  \0
0000040  354   B  \b 277   ) 376   @ 333 267 232 304 326   *   L 344 322
0000060   \0  \0  \0  \0      \0  \0   @  \0  \0  \0  \0  \0  \0  \0 200
0000100   \0  \0  \0  \0  \0  \0  \0 201  \0  \0  \0  \0  \0  \0  \0 202
0000120   \0  \0  \0 001  \n 352   l 300  \0  \0  \0 004  \0  \0  \0  \0
, 0.000190895 seconds, 2.7 MB/s
0000140   \0  \0 200  \0 265 244 002  \0  \b  \0  \0 002  \0  \0  \0  \0
0000160   \0  \0  \0  \0  \0  \0  \0  \0  \b  \t  \v 001 034  \0  \0 005
0000200   \0  \0  \0  \0  \0  \0  \v  \0  \0  \0  \0  \0  \0  \0  \t   .
0000220   \0  \0  \0  \0 030 243 275 267  \0  \0  \0  \0  \0  \0  \0  \0
0000240   \0  \0  \0  \0  \0  \0  \0  \0  \0  \0  \0  \0  \0  \0  \0  \0
0000260   \0  \0  \0  \0  \0  \0  \0 002  \0  \0  \0   @  \0  \0 001  \0
0000300   \0  \0  \0  \0  \0 004  \0  \0  \0  \0  \0  \b  \0  \0  \0  \b
0000320   \0  \0  \0  \0  \0  \0  \0  \0  \0  \0  \0  \0  \0  \0  \0  \0
*
0001000


Dave Chinner wrote:
> 
> On Wed, Dec 08, 2010 at 01:39:10AM -0800, blacknred wrote:
>> 
>> 
>> >You've done a forced module load. No guarantee your kernel is in any
>> >sane shape if you've done that....
>> 
>> Agree, but I'm reasonably convinced that module isn't the issue, because
>> it
>> works fine with my other servers......
>> 
>> >Strange failure. Hmmm - i386 arch and fedora - are you running with
>> 4k stacks? If so, maybe it blew the stack...
>> 
>> i386 arch, rhel 5.0
> 
> Yup, 4k stacks. This is definitely smelling like a stack blowout.
> 
> XFS on 4k stacks is a ticking timebomb - it will explode and you've
> got no idea of when it will go boom. Recompile your kernel with 8k
> stacks or move to x86_64.
> 
>> ># dd if=<device> bs=512 count=1 | od -c
>> This is what i get now, but now server's been rebooted and running OK,
>> what
>> should i be expecting or rather what are we looking for in this output at
>> point of failure?
> 
> Well, what you see here:
> 
>> 0000000    X   F   S   B  \0  \0 020  \0  \0  \0  \0  \0 025 324 304  \0
>              ^^^^^^^^^^^^^
> Is a valid XFS superblock magic number.
> 
> If you are getting this error:
> 
>> >> XFS: bad magic number
>> >> XFS: SB validate failed 
> 
> Then I'd expect to see anything other than "XFSB" as the magic
> number. Of course, if you smashed the stack during mount, then there
> will most likely be nothing wrong with the value on disk...
> 
>> >why did I flash the controller
>> I was on 5.22 fw version which has a known 'lockup' issue which is fixed
>> in
>> 7.x ver.
>> This is a critical fix.
> 
> Is the version 7.x firmware certified with such an old kernel? It's
> not uncommon for different firmware versions to only be supported on
> specific releases/kernel versions.
> 
> Cheers,
> 
> Dave.
> -- 
> Dave Chinner
> david@xxxxxxxxxxxxx
> 
> _______________________________________________
> xfs mailing list
> xfs@xxxxxxxxxxx
> http://oss.sgi.com/mailman/listinfo/xfs
> 
> 

-- 
View this message in context: 
http://old.nabble.com/kernel-panic-xfs-errors-tp30397503p30416451.html
Sent from the Xfs - General mailing list archive at Nabble.com.

<Prev in Thread] Current Thread [Next in Thread>