xfs
[Top] [All Lists]

Re: Major XFS problems...

To: linux-xfs@xxxxxxxxxxx
Subject: Re: Major XFS problems...
From: Anders Saaby <as@xxxxxxxxxxxx>
Date: Wed, 08 Sep 2004 17:58:26 +0200
Delivered-to: news2mail@news.cohaesio.com
Organization: Cohaesio A/S
References: <20040908133954.GB390@unthought.net> <413F1C6E.9040009@sgi.com> <chn85e$2ja$1@harrier.cohaesio.com>
Sender: linux-xfs-bounce@xxxxxxxxxxx
Anders Saaby wrote:

> Hi Eric,
> 
> I am the primary technician on the "large" system Jakob wrote about. I can
> give you the details regarding the problems.
> 
> Eric Sandeen wrote:
> 
>> Jakob Oestergaard wrote:
>> 
>>> Second XFS bug:
>>> ---------------
>>> Also causes the 'kernel BUG at fs/xfs/support/debug.c:106' message to be
>>> printed. This bug is not solved by applying the simple patch to the
>>> first problem.
>>> 
>>> How well known this problem is, I don't know - I can get more details on
>>> this if anyone is actually interested in working on fixing XFS.
>> 
>> Do you have -any- details on this problem... pretty much nothing to go
>> on here.
> 
> I have some details here... The following is a snip of the kernel log
> right before it reboots itself:

OK - The server crashed just now again, and I thought, that you guys would
be interested in one more logsnip to see differences/similarities:

<SNIP>
Sep  8 17:46:43 st1 kernel: xfs_iget_core: ambiguous vns: vp/0xe9537e00,
invp/0xf2883200
Sep  8 17:46:43 st1 kernel: ------------[ cut here ]------------
Sep  8 17:46:44 st1 kernel: kernel BUG at fs/xfs/support/debug.c:106!
Sep  8 17:46:44 st1 kernel: invalid operand: 0000 [#1]
Sep  8 17:46:44 st1 kernel: SMP
Sep  8 17:46:44 st1 kernel: Modules linked in: nfs e1000 rtc
Sep  8 17:46:44 st1 kernel: CPU:    0
Sep  8 17:46:44 st1 kernel: EIP:    0060:[<c021111c>]    Not tainted
Sep  8 17:46:44 st1 kernel: EFLAGS: 00010246   (2.6.8.1)
Sep  8 17:46:44 st1 kernel: EIP is at cmn_err+0x8c/0xa0
Sep  8 17:46:44 st1 kernel: eax: 00000040   ebx: 00000293   ecx: 00000000  
edx: c0351544
Sep  8 17:46:44 st1 kernel: esi: c03145f1   edi: c042e0fe   ebp: 00000000  
esp: f3a059ec
Sep  8 17:46:44 st1 kernel: ds: 007b   es: 007b   ss: 0068
Sep  8 17:46:44 st1 kernel: Process nfsd (pid: 1452, threadinfo=f3a04000
task=f012ac70)
Sep  8 17:46:44 st1 kernel: Stack: f3a04000 f2883200 ce058760 f7642238
c01e3f12 00000000 c031bc00 e9537e00
Sep  8 17:46:44 st1 kernel:        f2883200 00000001 03d03e23 f6cb527c
00000000 00000000 4045bb20 00000000
Sep  8 17:46:44 st1 kernel:        ce058760 c0162315 f7fb0e00 c2522b98
4045bb20 f2883220 4045bb20 f2883200
Sep  8 17:46:44 st1 kernel: Call Trace:
Sep  8 17:46:44 st1 kernel:  [<c01e3f12>] xfs_iget_core+0x1a2/0x590
Sep  8 17:46:44 st1 kernel:  [<c0162315>] iget_locked+0x95/0xa0
Sep  8 17:46:44 st1 kernel:  [<c01e43a4>] xfs_iget+0xa4/0x170
Sep  8 17:46:44 st1 kernel:  [<c01fff6b>] xfs_vget+0x4b/0xc0
Sep  8 17:46:44 st1 kernel:  [<c0210631>] vfs_vget+0x21/0x30
Sep  8 17:46:44 st1 kernel:  [<c0210098>] linvfs_get_dentry+0x48/0x80
Sep  8 17:46:44 st1 kernel:  [<c01d1ac6>] xfs_dir2_block_lookup+0x96/0xb0
Sep  8 17:46:44 st1 kernel:  [<c018ef98>] find_exported_dentry+0x38/0x5d0
Sep  8 17:46:44 st1 kernel:  [<c01622e4>] iget_locked+0x64/0xa0
Sep  8 17:46:44 st1 kernel:  [<c01e43fe>] xfs_iget+0xfe/0x170
Sep  8 17:46:44 st1 kernel:  [<c01fdf71>] xfs_dir_lookup_int+0x61/0xd0
Sep  8 17:46:44 st1 kernel:  [<c01fdf81>] xfs_dir_lookup_int+0x71/0xd0
Sep  8 17:46:44 st1 kernel:  [<c0202a3e>] xfs_lookup+0x3e/0x70
Sep  8 17:46:44 st1 kernel:  [<c0202a5b>] xfs_lookup+0x5b/0x70
Sep  8 17:46:44 st1 kernel:  [<c029b113>] sock_alloc_send_pskb+0x73/0x200
Sep  8 17:46:44 st1 kernel:  [<c02aae14>] qdisc_restart+0x14/0x180
Sep  8 17:46:44 st1 kernel:  [<c029b2bb>] sock_alloc_send_skb+0x1b/0x20
Sep  8 17:46:44 st1 kernel:  [<c02ab1e0>] pfifo_fast_enqueue+0x0/0x90
Sep  8 17:46:44 st1 kernel:  [<c02a12cf>] dev_queue_xmit+0x13f/0x280
Sep  8 17:46:44 st1 kernel:  [<c02b717b>] ip_finish_output2+0x13b/0x18f
Sep  8 17:46:44 st1 kernel:  [<c02a9eed>] nf_iterate+0x3d/0xa0
Sep  8 17:46:44 st1 kernel:  [<c02b7040>] ip_finish_output2+0x0/0x18f
Sep  8 17:46:44 st1 kernel:  [<c02b7040>] ip_finish_output2+0x0/0x18f
Sep  8 17:46:44 st1 kernel:  [<c02aa213>] nf_hook_slow+0x63/0xe0
Sep  8 17:46:44 st1 kernel:  [<c02b7040>] ip_finish_output2+0x0/0x18f
Sep  8 17:46:44 st1 kernel:  [<c02aa24e>] nf_hook_slow+0x9e/0xe0
Sep  8 17:46:44 st1 kernel:  [<c0196640>] exp_find_key+0x90/0xa0
Sep  8 17:46:44 st1 kernel:  [<c018f852>] export_decode_fh+0x62/0x6a
Sep  8 17:46:44 st1 kernel:  [<c0191600>] nfsd_acceptable+0x0/0xe0
Sep  8 17:46:44 st1 kernel:  [<c0191a73>] fh_verify+0x393/0x540
Sep  8 17:46:44 st1 kernel:  [<c0191600>] nfsd_acceptable+0x0/0xe0
Sep  8 17:46:44 st1 kernel:  [<c02aa213>] nf_hook_slow+0x63/0xe0
Sep  8 17:46:44 st1 kernel:  [<c02b6fd0>] dst_output+0x0/0x20
Sep  8 17:46:44 st1 kernel:  [<c02b6fe1>] dst_output+0x11/0x20
Sep  8 17:46:44 st1 kernel:  [<c02aa24e>] nf_hook_slow+0x9e/0xe0
Sep  8 17:46:44 st1 kernel:  [<c0192f0c>] nfsd_open+0x2c/0x130
Sep  8 17:46:44 st1 kernel:  [<c019356e>] nfsd_write+0x4e/0x2d0
Sep  8 17:46:44 st1 kernel:  [<c029d73b>] skb_copy_and_csum_bits+0x22b/0x2a0
Sep  8 17:46:44 st1 kernel:  [<c029c068>] kfree_skbmem+0x18/0x20
Sep  8 17:46:44 st1 kernel:  [<c029c123>] __kfree_skb+0xb3/0xc0
Sep  8 17:46:44 st1 kernel:  [<c029bf51>] skb_drop_fraglist+0x41/0x50
Sep  8 17:46:44 st1 kernel:  [<c029c02b>] skb_release_data+0x9b/0xc0
Sep  8 17:46:44 st1 kernel:  [<c029c068>] kfree_skbmem+0x18/0x20
Sep  8 17:46:44 st1 kernel:  [<c029e404>] skb_free_datagram+0x24/0x30
Sep  8 17:46:44 st1 kernel:  [<c02f9f6c>] svcauth_unix_accept+0x22c/0x2b0
Sep  8 17:46:44 st1 kernel:  [<c0199ed4>] nfsd3_proc_write+0xd4/0xf0
Sep  8 17:46:44 st1 kernel:  [<c018fee6>] nfsd_dispatch+0xc6/0x16c
Sep  8 17:46:44 st1 kernel:  [<c02f653a>] svc_process+0x40a/0x618
Sep  8 17:46:44 st1 kernel:  [<c018fca7>] nfsd+0x1f7/0x370
Sep  8 17:46:44 st1 kernel:  [<c018fab0>] nfsd+0x0/0x370
Sep  8 17:46:44 st1 kernel:  [<c01024bd>] kernel_thread_helper+0x5/0x18
Sep  8 17:46:44 st1 kernel: Code: 0f 0b 6a 00 f5 45 31 c0 5b 5e 5f 5d c3 8d
b4 26 00 00 00 00

</SNIP>

/Saaby


<Prev in Thread] Current Thread [Next in Thread>