xfs-masters
[Top] [All Lists]

[Bug 13375] Kernel crash with 2.6.29 + nfs + xfs (radix-tree)

To: xfs-masters@xxxxxxxxxxx
Subject: [Bug 13375] Kernel crash with 2.6.29 + nfs + xfs (radix-tree)
From: bugzilla-daemon@xxxxxxxxxxxxxxxxxxx
Date: Thu, 25 Jun 2009 08:21:32 GMT
Auto-submitted: auto-generated
In-reply-to: <bug-13375-470@xxxxxxxxxxxxxxxxxxxxxxxx/>
References: <bug-13375-470@xxxxxxxxxxxxxxxxxxxxxxxx/>
http://bugzilla.kernel.org/show_bug.cgi?id=13375


Gabriel Barazer <gabriel@xxxxxxxx> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |gabriel@xxxxxxxx




--- Comment #17 from Gabriel Barazer <gabriel@xxxxxxxx>  2009-06-25 08:21:30 ---
Hi,

Same bug here on 2.6.29.4 on the server, various versions on the clients
(2.6.27 / 2.6.29). I tried to disable the swap space to not use kswapd with no
success and the same bug. I also have my syslog filled with "reconnect_path:
npd != pd" from the kernel log and stale NFS file handles one the clients (I
suspect there is 1 line each time the client tries to access but get a stale
NFS file handle error), but I'm not sure this is related. Here are my
backtraces, all related to lib/radix-tree.c:473 AND 485:


   [43251.547520] ------------[ cut here ]------------
   [43251.547714] kernel BUG at lib/radix-tree.c:485!
   [43251.547902] invalid opcode: 0000 [#1] SMP
   [43251.548151] last sysfs file: /sys/devices/virtual/block/dm-11/dm/name
   [43251.548345] CPU 7
   [43251.548564] Pid: 52, comm: kswapd0 Not tainted 2.6.29.4-filer #3 X7DCX
   [43251.548758] RIP: 0010:[<ffffffff811e5fca>] [<ffffffff811e5fca>]
radix_tree_tag_set+0x88/0xc8
   [43251.549144] RSP: 0018:ffff88031d3d9c78 EFLAGS: 00010246
   [43251.549336] RAX: ffff8802639eff58 RBX: 0000000000000000 RCX:
000000000000001d
   [43251.549532] RDX: 0000000000000000 RSI: 0000000000000000 RDI:
ffff8802639efd40
   [43251.549727] RBP: 0000000000000001 R08: ffff8801edd2c8c0 R09:
ffff8801edd2c798
   [43251.549751] R10: ffff88031cd86800 R11: ffffffff811bb75b R12:
ffff88031cdd7040
   [43251.549751] R13: 0000000000000000 R14: 000000000053159d R15:
0000000000000000
   [43251.549751] FS: 0000000000000000(0000) GS:ffff88031f849200(0000)
knlGS:0000000000000000
   [43251.549751] CS: 0010 DS: 0018 ES: 0018 CR0: 000000008005003b
   [43251.549751] CR2: 00007fcb6fc1d2d5 CR3: 00000002d0dd9000 CR4:
00000000000406e0
   [43251.549751] DR0: 0000000000000000 DR1: 0000000000000000 DR2:
0000000000000000
   [43251.549751] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7:
0000000000000400
   [43251.549751] Process kswapd0 (pid: 52, threadinfo ffff88031d3d8000, task
ffff88031fa44fa0)
   [43251.549751] Stack:
   [43251.549751] ffff88030000001d 0000000000000000 ffff8801edd2c740
ffff88031cdd7000
   [43251.549751] ffff88031cdd703c ffff8801edd2c838 ffff88031cd86800
ffff88031d3d9d50
   [43251.552255] ffffffff811bcecb ffff8801edd2c740 ffff8801edd2c8d0
000000000000007b
   [43251.553006] Call Trace:
   [43251.553006] [<ffffffff811bcecb>] ? xfs_inode_set_reclaim_tag+0x65/0x80
   [43251.553006] [<ffffffff811af041>] ? xfs_reclaim+0x99/0x9f
   [43251.553006] [<ffffffff811bb791>] ? xfs_fs_destroy_inode+0x36/0x54
   [43251.553006] [<ffffffff81091aa3>] ? dispose_list+0xc1/0xf0
   [43251.553006] [<ffffffff81091cc6>] ? shrink_icache_memory+0x1f4/0x226
   [43251.553006] [<ffffffff81063f7a>] ? shrink_slab+0xde/0x154
   [43251.553006] [<ffffffff81064433>] ? kswapd+0x443/0x59d
   [43251.553006] [<ffffffff810621d9>] ? isolate_pages_global+0x0/0x25b
   [43251.553006] [<ffffffff81043a1f>] ? autoremove_wake_function+0x0/0x2a
   [43251.553006] [<ffffffff81063ff0>] ? kswapd+0x0/0x59d
   [43251.553006] [<ffffffff8104370b>] ? kthread+0x47/0x71
   [43251.553006] [<ffffffff8100ceba>] ? child_rip+0xa/0x20
   [43251.556254] [<ffffffff810436c4>] ? kthread+0x0/0x71
   [43251.556254] [<ffffffff8100ceb0>] ? child_rip+0x0/0x20
   [43251.556254] Code: 0c 24 e8 de fb ff ff 8b 0c 24 85 c0 75 10 48 8b 54 24
08 48 8d 84 13 18 02 00 00 0f ab 08 48 63 c9 48 8b 5c cb 18 48 85 db 75 04 <0f>
0b eb fe 41 83 ef 06 ff cd 85 ed 75 b0 48 85 db 74 1b 41 8b
   [43251.557004] RIP [<ffffffff811e5fca>] radix_tree_tag_set+0x88/0xc8
   [43251.557004] RSP <ffff88031d3d9c78>
   [43251.560005] ---[ end trace 7d232a2ba1ee9003 ]---



   [504384.225844] ------------[ cut here ]------------
   [504384.226057] kernel BUG at lib/radix-tree.c:473!
   [504384.226249] invalid opcode: 0000 [#1] SMP
   [504384.226504] last sysfs file: /sys/devices/virtual/block/dm-4/dm/name
   [504384.226697] CPU 2
   [504384.226911] Pid: 52, comm: kswapd0 Not tainted 2.6.29.4-filer #3 X7DCX
   [504384.227103] RIP: 0010:[<ffffffff811e5f67>] [<ffffffff811e5f67>]
radix_tree_tag_set+0x25/0xc8
   [504384.227487] RSP: 0018:ffff88031d3d9c78 EFLAGS: 00010206
   [504384.227678] RAX: 0000000000000000 RBX: ffff88031bd0e180 RCX:
000000000000001a
   [504384.228019] RDX: 0000000000000000 RSI: 0000000000000100 RDI:
ffff88031faf7290
   [504384.228359] RBP: 0000000000000000 R08: ffff88031bd0e300 R09:
0000000000000052
   [504384.228698] R10: ffff88031bd0ddc0 R11: ffffffff811bb75b R12:
ffff88031faf7290
   [504384.229036] R13: 0000000000000000 R14: 0000000000000100 R15:
ffff88031d3d9d50
   [504384.229175] FS: 0000000000000000(0000) GS:ffff88031f848300(0000)
knlGS:0000000000000000
   [504384.229175] CS: 0010 DS: 0018 ES: 0018 CR0: 000000008005003b
   [504384.229175] CR2: 00007f2e4ca912d5 CR3: 000000029ae62000 CR4:
00000000000406e0
   [504384.229175] DR0: 0000000000000000 DR1: 0000000000000000 DR2:
0000000000000000
   [504384.229175] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7:
0000000000000400
   [504384.229175] Process kswapd0 (pid: 52, threadinfo ffff88031d3d8000, task
ffff88031fa44fa0)
   [504384.229175] Stack:
   [504384.229175] ffff88031cc55800 0000000000000202 ffff88031bd0e180
ffff88031faf7250
   [504384.229175] ffff88031faf728c ffff88031bd0e278 ffff88031c9d5000
ffff88031d3d9d50
   [504384.229175] ffffffff811bcecb ffff88031bd0e180 ffff88031bd0e310
0000000000000053
   [504384.232254] Call Trace:
   [504384.232254] [<ffffffff811bcecb>] xfs_inode_set_reclaim_tag+0x65/0x80
   [504384.232254] [<ffffffff811af041>] xfs_reclaim+0x99/0x9f
   [504384.232254] [<ffffffff811bb791>] xfs_fs_destroy_inode+0x36/0x54
   [504384.232254] [<ffffffff81091aa3>] dispose_list+0xc1/0xf0
   [504384.232254] [<ffffffff81091cc6>] shrink_icache_memory+0x1f4/0x226
   [504384.232254] [<ffffffff81063f7a>] shrink_slab+0xde/0x154
   [504384.232254] [<ffffffff81064433>] kswapd+0x443/0x59d
   [504384.232254] [<ffffffff810621d9>] isolate_pages_global+0x0/0x25b
   [504384.232254] [<ffffffff81043a1f>] autoremove_wake_function+0x0/0x2a
   [504384.232254] [<ffffffff81063ff0>] kswapd+0x0/0x59d
   [504384.232254] [<ffffffff8104370b>] kthread+0x47/0x71
   [504384.232254] [<ffffffff8100ceba>] child_rip+0xa/0x20
   [504384.232254] [<ffffffff810436c4>] kthread+0x0/0x71
   [504384.232254] [<ffffffff8100ceb0>] child_rip+0x0/0x20
   [504384.232254] Code: 5d 41 5e 41 5f c3 41 57 41 56 49 89 f6 41 55 41 89 d5
41 54 49 89 fc 55 53 48 83 ec 10 8b 2f 89 e8 48 3b 34 c5 e0 7f 62 81 76 04 <0f>
0b eb fe 48 8b 5f 08 44 6b fd 06 89 d0 48 83 e3 fe 48 c1 e0
   [504384.236253] RIP [<ffffffff811e5f67>] radix_tree_tag_set+0x25/0xc8
   [504384.236253] RSP <ffff88031d3d9c78>
   [504384.239401] ---[ end trace 6f17673e48e005b5 ]---


   [ 7256.039741] ------------[ cut here ]------------
   [ 7256.039948] kernel BUG at lib/radix-tree.c:485!
   [ 7256.040136] invalid opcode: 0000 [#1] SMP
   [ 7256.040388] last sysfs file:
/sys/devices/pci0000:00/0000:00:1a.7/usb1/1-2/1-2:1.0/host3/target3:0:0/3:0:0:0/block/sdf/uevent
   [ 7256.040739] CPU 3
   [ 7256.040959] Pid: 52, comm: kswapd0 Not tainted 2.6.29.4-filer #3 X7DCX
   [ 7256.041155] RIP: 0010:[<ffffffff811e5fca>] [<ffffffff811e5fca>]
radix_tree_tag_set+0x88/0xc8
   [ 7256.041540] RSP: 0018:ffff88031d3d9c78 EFLAGS: 00010246
   [ 7256.041727] RAX: ffff880243c6c448 RBX: 0000000000000000 RCX:
000000000000002d
   [ 7256.041918] RDX: 0000000000000000 RSI: 0000000000000000 RDI:
ffff880243c6c230
   [ 7256.042110] RBP: 0000000000000001 R08: ffff8802a0886e40 R09:
ffff8802d087b680
   [ 7256.042302] R10: ffff88031cdc0400 R11: ffffffff811bb75b R12:
ffff88031faf7640
   [ 7256.042495] R13: 0000000000000000 R14: 00000000001423ed R15:
0000000000000000
   [ 7256.042688] FS: 0000000000000000(0000) GS:ffff88031f848600(0000)
knlGS:0000000000000000
   [ 7256.043028] CS: 0010 DS: 0018 ES: 0018 CR0: 000000008005003b
   [ 7256.043219] CR2: 0000000001934fe8 CR3: 00000001b9127000 CR4:
00000000000406e0
   [ 7256.043386] DR0: 0000000000000000 DR1: 0000000000000000 DR2:
0000000000000000
   [ 7256.043386] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7:
0000000000000400
   [ 7256.043386] Process kswapd0 (pid: 52, threadinfo ffff88031d3d8000, task
ffff88031fa44fa0)
   [ 7256.043386] Stack:
   [ 7256.043386] ffff88030000002d 0000000000000000 ffff8802a0886cc0
ffff88031faf7600
   [ 7256.043386] ffff88031faf763c ffff8802a0886db8 ffff88031cdc0400
ffff88031d3d9d50
   [ 7256.043386] ffffffff811bcecb ffff8802a0886cc0 ffff8802a0886e50
0000000000000005
   [ 7256.043386] Call Trace:
   [ 7256.043386] [<ffffffff811bcecb>] ? xfs_inode_set_reclaim_tag+0x65/0x80
   [ 7256.043386] [<ffffffff811af041>] ? xfs_reclaim+0x99/0x9f
   [ 7256.043386] [<ffffffff811bb791>] ? xfs_fs_destroy_inode+0x36/0x54
   [ 7256.043386] [<ffffffff81091aa3>] ? dispose_list+0xc1/0xf0
   [ 7256.043386] [<ffffffff81091cc6>] ? shrink_icache_memory+0x1f4/0x226
   [ 7256.043386] [<ffffffff81063f7a>] ? shrink_slab+0xde/0x154
   [ 7256.043386] [<ffffffff81064433>] ? kswapd+0x443/0x59d
   [ 7256.043386] [<ffffffff810621d9>] ? isolate_pages_global+0x0/0x25b
   [ 7256.043386] [<ffffffff81043a1f>] ? autoremove_wake_function+0x0/0x2a
   [ 7256.043386] [<ffffffff81063ff0>] ? kswapd+0x0/0x59d
   [ 7256.043386] [<ffffffff8104370b>] ? kthread+0x47/0x71
   [ 7256.043386] [<ffffffff8100ceba>] ? child_rip+0xa/0x20
   [ 7256.043386] [<ffffffff810436c4>] ? kthread+0x0/0x71
   [ 7256.043386] [<ffffffff8100ceb0>] ? child_rip+0x0/0x20
   [ 7256.043386] Code: 0c 24 e8 de fb ff ff 8b 0c 24 85 c0 75 10 48 8b 54 24
08 48 8d 84 13 18 02 00 00 0f ab 08 48 63 c9 48 8b 5c cb 18 48 85 db 75 04 <0f>
0b eb fe 41 83 ef 06 ff cd 85 ed 75 b0 48 85 db 74 1b 41 8b
   [ 7256.043386] RIP [<ffffffff811e5fca>] radix_tree_tag_set+0x88/0xc8
   [ 7256.043386] RSP <ffff88031d3d9c78>
   [ 7256.052293] ---[ end trace c542ca92b3353b53 ]---

Note: I switched to 2.6.30 tonight and didn't get any "kernel BUG" message
since (finger crossed), but the "reconnect_path: npd != pd" bug, forced me to
reboot the server to finally mount the NFS exports on the client without
getting a stale NFS file handle error at mount time.

-- 
Configure bugmail: http://bugzilla.kernel.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.

<Prev in Thread] Current Thread [Next in Thread>