Anders Saaby wrote:
> Hi Eric,
>
> I am the primary technician on the "large" system Jakob wrote about. I can
> give you the details regarding the problems.
>
> Eric Sandeen wrote:
>
>> Jakob Oestergaard wrote:
>>
>>> Second XFS bug:
>>> ---------------
>>> Also causes the 'kernel BUG at fs/xfs/support/debug.c:106' message to be
>>> printed. This bug is not solved by applying the simple patch to the
>>> first problem.
>>>
>>> How well known this problem is, I don't know - I can get more details on
>>> this if anyone is actually interested in working on fixing XFS.
>>
>> Do you have -any- details on this problem... pretty much nothing to go
>> on here.
>
> I have some details here... The following is a snip of the kernel log
> right before it reboots itself:
OK - The server crashed just now again, and I thought, that you guys would
be interested in one more logsnip to see differences/similarities:
<SNIP>
Sep 8 17:46:43 st1 kernel: xfs_iget_core: ambiguous vns: vp/0xe9537e00,
invp/0xf2883200
Sep 8 17:46:43 st1 kernel: ------------[ cut here ]------------
Sep 8 17:46:44 st1 kernel: kernel BUG at fs/xfs/support/debug.c:106!
Sep 8 17:46:44 st1 kernel: invalid operand: 0000 [#1]
Sep 8 17:46:44 st1 kernel: SMP
Sep 8 17:46:44 st1 kernel: Modules linked in: nfs e1000 rtc
Sep 8 17:46:44 st1 kernel: CPU: 0
Sep 8 17:46:44 st1 kernel: EIP: 0060:[<c021111c>] Not tainted
Sep 8 17:46:44 st1 kernel: EFLAGS: 00010246 (2.6.8.1)
Sep 8 17:46:44 st1 kernel: EIP is at cmn_err+0x8c/0xa0
Sep 8 17:46:44 st1 kernel: eax: 00000040 ebx: 00000293 ecx: 00000000
edx: c0351544
Sep 8 17:46:44 st1 kernel: esi: c03145f1 edi: c042e0fe ebp: 00000000
esp: f3a059ec
Sep 8 17:46:44 st1 kernel: ds: 007b es: 007b ss: 0068
Sep 8 17:46:44 st1 kernel: Process nfsd (pid: 1452, threadinfo=f3a04000
task=f012ac70)
Sep 8 17:46:44 st1 kernel: Stack: f3a04000 f2883200 ce058760 f7642238
c01e3f12 00000000 c031bc00 e9537e00
Sep 8 17:46:44 st1 kernel: f2883200 00000001 03d03e23 f6cb527c
00000000 00000000 4045bb20 00000000
Sep 8 17:46:44 st1 kernel: ce058760 c0162315 f7fb0e00 c2522b98
4045bb20 f2883220 4045bb20 f2883200
Sep 8 17:46:44 st1 kernel: Call Trace:
Sep 8 17:46:44 st1 kernel: [<c01e3f12>] xfs_iget_core+0x1a2/0x590
Sep 8 17:46:44 st1 kernel: [<c0162315>] iget_locked+0x95/0xa0
Sep 8 17:46:44 st1 kernel: [<c01e43a4>] xfs_iget+0xa4/0x170
Sep 8 17:46:44 st1 kernel: [<c01fff6b>] xfs_vget+0x4b/0xc0
Sep 8 17:46:44 st1 kernel: [<c0210631>] vfs_vget+0x21/0x30
Sep 8 17:46:44 st1 kernel: [<c0210098>] linvfs_get_dentry+0x48/0x80
Sep 8 17:46:44 st1 kernel: [<c01d1ac6>] xfs_dir2_block_lookup+0x96/0xb0
Sep 8 17:46:44 st1 kernel: [<c018ef98>] find_exported_dentry+0x38/0x5d0
Sep 8 17:46:44 st1 kernel: [<c01622e4>] iget_locked+0x64/0xa0
Sep 8 17:46:44 st1 kernel: [<c01e43fe>] xfs_iget+0xfe/0x170
Sep 8 17:46:44 st1 kernel: [<c01fdf71>] xfs_dir_lookup_int+0x61/0xd0
Sep 8 17:46:44 st1 kernel: [<c01fdf81>] xfs_dir_lookup_int+0x71/0xd0
Sep 8 17:46:44 st1 kernel: [<c0202a3e>] xfs_lookup+0x3e/0x70
Sep 8 17:46:44 st1 kernel: [<c0202a5b>] xfs_lookup+0x5b/0x70
Sep 8 17:46:44 st1 kernel: [<c029b113>] sock_alloc_send_pskb+0x73/0x200
Sep 8 17:46:44 st1 kernel: [<c02aae14>] qdisc_restart+0x14/0x180
Sep 8 17:46:44 st1 kernel: [<c029b2bb>] sock_alloc_send_skb+0x1b/0x20
Sep 8 17:46:44 st1 kernel: [<c02ab1e0>] pfifo_fast_enqueue+0x0/0x90
Sep 8 17:46:44 st1 kernel: [<c02a12cf>] dev_queue_xmit+0x13f/0x280
Sep 8 17:46:44 st1 kernel: [<c02b717b>] ip_finish_output2+0x13b/0x18f
Sep 8 17:46:44 st1 kernel: [<c02a9eed>] nf_iterate+0x3d/0xa0
Sep 8 17:46:44 st1 kernel: [<c02b7040>] ip_finish_output2+0x0/0x18f
Sep 8 17:46:44 st1 kernel: [<c02b7040>] ip_finish_output2+0x0/0x18f
Sep 8 17:46:44 st1 kernel: [<c02aa213>] nf_hook_slow+0x63/0xe0
Sep 8 17:46:44 st1 kernel: [<c02b7040>] ip_finish_output2+0x0/0x18f
Sep 8 17:46:44 st1 kernel: [<c02aa24e>] nf_hook_slow+0x9e/0xe0
Sep 8 17:46:44 st1 kernel: [<c0196640>] exp_find_key+0x90/0xa0
Sep 8 17:46:44 st1 kernel: [<c018f852>] export_decode_fh+0x62/0x6a
Sep 8 17:46:44 st1 kernel: [<c0191600>] nfsd_acceptable+0x0/0xe0
Sep 8 17:46:44 st1 kernel: [<c0191a73>] fh_verify+0x393/0x540
Sep 8 17:46:44 st1 kernel: [<c0191600>] nfsd_acceptable+0x0/0xe0
Sep 8 17:46:44 st1 kernel: [<c02aa213>] nf_hook_slow+0x63/0xe0
Sep 8 17:46:44 st1 kernel: [<c02b6fd0>] dst_output+0x0/0x20
Sep 8 17:46:44 st1 kernel: [<c02b6fe1>] dst_output+0x11/0x20
Sep 8 17:46:44 st1 kernel: [<c02aa24e>] nf_hook_slow+0x9e/0xe0
Sep 8 17:46:44 st1 kernel: [<c0192f0c>] nfsd_open+0x2c/0x130
Sep 8 17:46:44 st1 kernel: [<c019356e>] nfsd_write+0x4e/0x2d0
Sep 8 17:46:44 st1 kernel: [<c029d73b>] skb_copy_and_csum_bits+0x22b/0x2a0
Sep 8 17:46:44 st1 kernel: [<c029c068>] kfree_skbmem+0x18/0x20
Sep 8 17:46:44 st1 kernel: [<c029c123>] __kfree_skb+0xb3/0xc0
Sep 8 17:46:44 st1 kernel: [<c029bf51>] skb_drop_fraglist+0x41/0x50
Sep 8 17:46:44 st1 kernel: [<c029c02b>] skb_release_data+0x9b/0xc0
Sep 8 17:46:44 st1 kernel: [<c029c068>] kfree_skbmem+0x18/0x20
Sep 8 17:46:44 st1 kernel: [<c029e404>] skb_free_datagram+0x24/0x30
Sep 8 17:46:44 st1 kernel: [<c02f9f6c>] svcauth_unix_accept+0x22c/0x2b0
Sep 8 17:46:44 st1 kernel: [<c0199ed4>] nfsd3_proc_write+0xd4/0xf0
Sep 8 17:46:44 st1 kernel: [<c018fee6>] nfsd_dispatch+0xc6/0x16c
Sep 8 17:46:44 st1 kernel: [<c02f653a>] svc_process+0x40a/0x618
Sep 8 17:46:44 st1 kernel: [<c018fca7>] nfsd+0x1f7/0x370
Sep 8 17:46:44 st1 kernel: [<c018fab0>] nfsd+0x0/0x370
Sep 8 17:46:44 st1 kernel: [<c01024bd>] kernel_thread_helper+0x5/0x18
Sep 8 17:46:44 st1 kernel: Code: 0f 0b 6a 00 f5 45 31 c0 5b 5e 5f 5d c3 8d
b4 26 00 00 00 00
</SNIP>
/Saaby
|