Crash with XFS
Alexander Naumann
alexandernaumann at gmx.de
Mon Nov 7 08:36:51 CST 2011
Hi!
I would be glad if anybody can give me any hint on the following
subject.
I have a linux server running formatted with XFS. Afer a couple of days
(about 6 or 7) I get the following crash:
Nov 6 15:44:03 archive kernel: BUG: unable to handle kernel NULL
pointer dereference at 0000000000000044
Nov 6 15:44:03 archive kernel: IP: [<ffffffff811d3248>]
xfs_inode_ag_iterator+0x4a/0xce
Nov 6 15:44:03 archive kernel: PGD 0
Nov 6 15:44:03 archive kernel: Oops: 0000 [#1] SMP
Nov 6 15:44:03 archive kernel: last sysfs
file: /sys/devices/pci0000:00/0000:00:1c.0/0000:08:00.1/host2/rport-2:0-0/target2:0:0/fc_transport/target2:0:0/port_name
Nov 6 15:44:03 archive kernel: CPU 13
Nov 6 15:44:03 archive kernel: Modules linked in: iscsi_tcp
libiscsi_tcp libiscsi scsi_transport_iscsi wcte11xp wctc4xxp wct4xxp
wct1xxp wcte12xp dahdi_voicebus dahdi_transcode dahdi dm_round_robin
qla2xxx scsi_dh_rdac scsi_dh_emc scsi_
dh_alua scsi_dh_hp_sw af_packet ipt_REDIRECT iptable_nat nf_nat
ipt_REJECT xt_tcpudp nf_conntrack_ipv4 nf_defrag_ipv4 xt_state
nf_conntrack iptable_filter ip_tables x_tables ipmi_si ipmi_watchdog
ipmi_devintf ipmi_msghandler fan ac ipv6
fuse dm_multipath scsi_dh psmouse usbhid hid evdev ehci_hcd uhci_hcd
iTCO_wdt rtc_cmos pcspkr serio_raw iTCO_vendor_support usbcore thermal
bnx2 rtc_core rtc_lib button processor thermal_sys unix
Nov 6 15:44:03 archive kernel:
Nov 6 15:44:03 archive kernel: Pid: 659, comm: kswapd0 Not tainted
2.6.34.7-64bit #9 0P658H/PowerEdge R910
Nov 6 15:44:03 archive kernel: RIP: 0010:[<ffffffff811d3248>]
[<ffffffff811d3248>] xfs_inode_ag_iterator+0x4a/0xce
Nov 6 15:44:03 archive kernel: RSP: 0018:ffff88085eb1bcb0 EFLAGS:
00010282
Nov 6 15:44:03 archive kernel: RAX: 0000000000000000 RBX:
ffff88085841b800 RCX: 0000000000000000
Nov 6 15:44:03 archive kernel: RDX: 0000000000000001 RSI:
0000000000000000 RDI: 0000000000000000
Nov 6 15:44:03 archive kernel: RBP: ffff88085eb1bd10 R08:
0000000000000001 R09: ffff88085eb1bd24
Nov 6 15:44:03 archive kernel: R10: ffffffffff000000 R11:
ffff88047d42b0e8 R12: ffff88085eb1bd24
Nov 6 15:44:03 archive kernel: R13: 0000000000000000 R14:
ffff88085eb1bd24 R15: 0000000000000000
Nov 6 15:44:03 archive kernel: FS: 0000000000000000(0000)
GS:ffff8800023a0000(0000) knlGS:0000000000000000
Nov 6 15:44:03 archive kernel: CS: 0010 DS: 0000 ES: 0000 CR0:
000000008005003b
Nov 6 15:44:03 archive kernel: CR2: 0000000000000044 CR3:
00000000016cb000 CR4: 00000000000006a0
Nov 6 15:44:03 archive kernel: DR0: 0000000000000000 DR1:
0000000000000000 DR2: 0000000000000000
Nov 6 15:44:03 archive kernel: DR3: 0000000000000000 DR6:
00000000ffff0ff0 DR7: 0000000000000400
Nov 6 15:44:03 archive kernel: Process kswapd0 (pid: 659, threadinfo
ffff88085eb1a000, task ffff88085f4706b0)
Nov 6 15:44:03 archive kernel: Stack:
Nov 6 15:44:03 archive kernel: ffff88085eb1bce4 000000018102914a
0000000000000000 ffffffff811d268f
Nov 6 15:44:03 archive kernel: <0> ffff88085841b800 0000000000000000
0000005500000283 ffff88085841b800
Nov 6 15:44:03 archive kernel: <0> ffff88085eb1bd24 00000000ffffffff
00000000000002bc 00000000000000d0
Nov 6 15:44:03 archive kernel: Call Trace:
Nov 6 15:44:03 archive kernel: [<ffffffff811d268f>] ?
xfs_reclaim_inode+0x0/0x212
Nov 6 15:44:03 archive kernel: [<ffffffff811d332d>]
xfs_reclaim_inode_shrink+0x61/0x123
Nov 6 15:44:03 archive kernel: [<ffffffff81075f45>] shrink_slab
+0xd8/0x148
Nov 6 15:44:03 archive kernel: [<ffffffff810765da>] kswapd+0x625/0x89d
Nov 6 15:44:03 archive kernel: [<ffffffff8107445f>] ?
isolate_pages_global+0x0/0x23f
Nov 6 15:44:03 archive kernel: [<ffffffff81040882>] ?
autoremove_wake_function+0x0/0x38
Nov 6 15:44:03 archive kernel: [<ffffffff81075fb5>] ? kswapd+0x0/0x89d
Nov 6 15:44:03 archive kernel: [<ffffffff81040472>] kthread+0x7d/0x85
Nov 6 15:44:03 archive kernel: [<ffffffff81002c74>]
kernel_thread_helper+0x4/0x10
Nov 6 15:44:03 archive kernel: [<ffffffff810403f5>] ? kthread+0x0/0x85
Nov 6 15:44:03 archive kernel: [<ffffffff81002c70>] ?
kernel_thread_helper+0x0/0x10
Nov 6 15:44:03 archive kernel: Code: c0 48 89 75 b8 89 55 b4 89 4d b0
44 89 45 ac 74 03 41 8b 01 89 45 d4 45 31 ff 45 31 ed eb 69 48 8b 7d c0
44 89 ee e8 2d d1 fe ff <83> 78 44 00 49 89 c4 75 0a 48 89 c7 e8 ad c2
fe ff eb 47 44 8b
Nov 6 15:44:03 archive kernel: RIP [<ffffffff811d3248>]
xfs_inode_ag_iterator+0x4a/0xce
Nov 6 15:44:03 archive kernel: RSP <ffff88085eb1bcb0>
Nov 6 15:44:03 archive kernel: CR2: 0000000000000044
Nov 6 15:44:03 archive kernel: ---[ end trace 3bcf38b06227bae0 ]---
Or like this:
Oct 22 08:30:05 archive kernel: BUG: unable to handle kernel NULL
pointer dereference at 0000000000000044
Oct 22 08:30:05 archive kernel: IP: [<ffffffff811d338f>]
xfs_reclaim_inode_shrink+0xc3/0x123
Oct 22 08:30:05 archive kernel: [<ffffffff81075f38>] shrink_slab
+0xcb/0x148
Oct 22 08:30:05 archive kernel: [<ffffffff810765da>] kswapd+0x625/0x89d
Oct 22 08:30:05 archive kernel: [<ffffffff8107445f>] ?
isolate_pages_global+0x0/0x23f
Oct 22 08:30:05 archive kernel: [<ffffffff81040882>] ?
autoremove_wake_function+0x0/0x38
Oct 22 08:30:05 archive kernel: [<ffffffff81075fb5>] ? kswapd+0x0/0x89d
Oct 22 08:30:05 archive kernel: [<ffffffff81040472>] kthread+0x7d/0x85
Oct 22 08:30:05 archive kernel: [<ffffffff81002c74>]
kernel_thread_helper+0x4/0x10
Oct 22 08:30:05 archive kernel: [<ffffffff810403f5>] ? kthread+0x0/0x85
Oct 22 08:30:05 archive kernel: [<ffffffff81002c70>] ?
kernel_thread_helper+0x0/0x10
The system is a Dell R910 (Intek Xeon CPU E7530, 24 cores (with
hyperthreading).
32GB Ram, Raid Controller is Perc H700, SAS discs, Raid 5 with 1.7TB,
formatted with XFS.
Kernel version 2.6.34.7 is running (64bit kernel on a 32bit system,
Debian packages).
There is a multipathing fibrechannel conenction to an external storage,
which partition is also formatted with XFS.
xfs-tools are version 3.0.4.
Host 2 is one of the FC connections.
Does anybody has a hint to this crash?
I could not find any solution in any bugtracker so I am not sure if it
is already fixed.
The system itself is under load (Load-Average is about 20 / 18 / 17).
Are any other informations needed?
Local filesystem informations:
xfs_info /
meta-data=/dev/root isize=256 agcount=32,
agsize=13683646 blks
= sectsz=512 attr=0
data = bsize=4096 blocks=437876672,
imaxpct=25
= sunit=0 swidth=0 blks
naming =version 2 bsize=4096 ascii-ci=0
log =internal bsize=4096 blocks=32768, version=1
= sectsz=512 sunit=0 blks, lazy-count=0
realtime =none extsz=65536 blocks=0, rtextents=0
Thanks in advance
Alex
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://oss.sgi.com/pipermail/xfs/attachments/20111107/ad3a77bd/attachment.htm>
More information about the xfs
mailing list