xfs
[Top] [All Lists]

BUG: unable to handle kernel paging request xfs_is_delayed_page

To: xfs-masters@xxxxxxxxxxx, xfs@xxxxxxxxxxx
Subject: BUG: unable to handle kernel paging request xfs_is_delayed_page
From: Török Edwin <edwintorok@xxxxxxxxx>
Date: Thu, 21 Jul 2011 23:55:04 +0300
Cc: Linux Kernel Mailing List <linux-kernel@xxxxxxxxxxxxxxx>
Dkim-signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=message-id:date:from:user-agent:mime-version:to:cc:subject :x-enigmail-version:content-type; bh=xYpegBxfBpwSDhb9LxiR2NVOGAMWflGHuUYa5L7tLEU=; b=xk7DNubhPffftnS7MsNKWKeck1YOLl8fH0MyIy/CchXpd7vhtAvAAzgCpGZ50pomYJ Yx+VvCnKYXNKc1dbarMFB6AxUd1pF86tVMQgamStirYeBL07WBQD0kTfRUhYizcBF4OB gcfBM8ugfSgx+LLtixlEcFEmf+4e8srdASpoc=
User-agent: Mozilla/5.0 (X11; U; Linux x86_64; en-US; rv:1.9.2.18) Gecko/20110626 Icedove/3.1.11
Hi,

Just got this BUG in my dmesg:
[47504.938446] BUG: unable to handle kernel paging request at ffff884058ec3270
[47504.938488] IP: [<ffffffff8127baf1>] xfs_is_delayed_page+0x41/0xa0
[47504.938520] PGD 1a24063 PUD 0
[47504.938536] Oops: 0000 [#1] PREEMPT SMP
[47504.938558] CPU 5
[47504.938568] Modules linked in: ipt_REDIRECT xt_CHECKSUM iptable_mangle 
bridge stp llc powernow_k8 cpufreq_stats cpufreq_powersave cpufreq_ondemand 
cpufreq_conservative cpufreq_userspace rfcomm bnep
binfmt_misc fuse ipt_REJECT xt_state xt_tcpudp ipv6 iptable_filter 
ipt_MASQUERADE iptable_nat nf_nat nf_conntrack_ipv4 nf_conntrack nf_defrag_ipv4 
ip_tables x_tables radeon ttm drm_kms_helper drm
i2c_algo_bit cfbcopyarea cfbimgblt cfbfillrect freq_table mperf it87 hwmon_vid 
loop firewire_sbp2 snd_hda_codec_hdmi hid_a4tech usbhid snd_hda_codec_realtek 
k10temp hwmon snd_hda_intel snd_hda_codec
snd_hwdep snd_pcm_oss snd_mixer_oss evdev i2c_piix4 btusb bluetooth snd_pcm 
snd_seq_oss snd_seq_midi_event snd_seq snd_timer snd_seq_device snd soundcore 
snd_page_alloc parport_pc parport button wmi
processor thermal_sys ohci_hcd sr_mod cdrom firewire_ohci firewire_core 
crc_itu_t ssb ehci_hcd xhci_hcd usbcore [last unloaded: scsi_wait_scan]
[47504.938999]
[47504.939002] Pid: 11511, comm: flush-9:1 Not tainted 3.0.0-rc7 #157 Gigabyte 
Technology Co., Ltd. GA-870A-UD3/GA-870A-UD3
[47504.939002] RIP: 0010:[<ffffffff8127baf1>]  [<ffffffff8127baf1>] 
xfs_is_delayed_page+0x41/0xa0
[47504.939002] RSP: 0018:ffff8802256836e0  EFLAGS: 00010287
[47504.939002] RAX: 0000000000000001 RBX: ffffea0000c01c08 RCX: 0000000000000020
[47504.939002] RDX: ffff884058ec3270 RSI: 0000000000000003 RDI: ffff880058ec3270
[47504.939002] RBP: ffff8802256836e0 R08: 00000000000fe2e8 R09: ffff880225683cb8
[47504.939002] R10: 57ffe6ff91801bd0 R11: 0000000000000024 R12: 0000000000000001
[47504.939002] R13: ffff880100fa7190 R14: ffff880225683900 R15: ffff880225683cb8
[47504.939002] FS:  00007fc28c6e8700(0000) GS:ffff88022fd40000(0000) 
knlGS:00000000f74286c0
[47504.939002] CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
[47504.939002] CR2: ffff884058ec3270 CR3: 0000000225c70000 CR4: 00000000000006e0
[47504.939002] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[47504.939002] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
[47504.939002] Process flush-9:1 (pid: 11511, threadinfo ffff880225682000, task 
ffff8800cfa52d20)
[47504.939002] Stack:
[47504.939002]  ffff880225683780 ffffffff8127cb01 00000000000fe2db 
ffff8802256838e0
[47504.939002]  00000000000fe2e8 ffff880225683cb8 ffff880225683900 
ffff8802256838f0
[47504.939002]  ffff8802256838e0 0000000100000000 ffff880058ec3208 
00000000fe2e8000
[47504.939002] Call Trace:
[47504.939002]  [<ffffffff8127cb01>] xfs_convert_page.isra.10+0xa1/0x300
[47504.939002]  [<ffffffff8127ce55>] xfs_cluster_write+0xf5/0x160
[47504.939002]  [<ffffffff8127d711>] xfs_vm_writepage+0x481/0x510
[47504.939002]  [<ffffffff811109c2>] __writepage+0x12/0x40
[47504.939002]  [<ffffffff81110e19>] write_cache_pages+0x1b9/0x4c0
[47504.939002]  [<ffffffff810a6f7c>] ? up+0x2c/0x50
[47504.939002]  [<ffffffff811109b0>] ? bdi_set_max_ratio+0x90/0x90
[47504.939002]  [<ffffffff8111115b>] generic_writepages+0x3b/0x60
[47504.939002]  [<ffffffff8127befc>] xfs_vm_writepages+0x4c/0x60
[47504.939002]  [<ffffffff8111204c>] do_writepages+0x1c/0x30
[47504.939002]  [<ffffffff81179886>] writeback_single_inode+0xf6/0x260
[47504.939002]  [<ffffffff81179c5b>] writeback_sb_inodes+0xdb/0x1b0
[47504.939002]  [<ffffffff8117a79b>] writeback_inodes_wb+0x7b/0x150
[47504.939002]  [<ffffffff8117ad3b>] wb_writeback+0x4cb/0x520
[47504.939002]  [<ffffffff8117ae58>] wb_check_old_data_flush+0xc8/0xd0
[47504.939002]  [<ffffffff8117afed>] wb_do_writeback+0x18d/0x270
[47504.939002]  [<ffffffff81090ce2>] ? del_timer_sync+0x32/0x50
[47504.939002]  [<ffffffff8108f720>] ? init_timer_deferrable_key+0x20/0x20
[47504.939002]  [<ffffffff8117b154>] bdi_writeback_thread+0x84/0x300
[47504.939002]  [<ffffffff810e1410>] ? call_rcu+0x10/0x20
[47504.939002]  [<ffffffff8117b0d0>] ? wb_do_writeback+0x270/0x270
[47504.939002]  [<ffffffff810a1457>] kthread+0x87/0x90
[47504.939002]  [<ffffffff814c2a94>] kernel_thread_helper+0x4/0x10
[47504.939002]  [<ffffffff810a13d0>] ? kthread_worker_fn+0x190/0x190
[47504.939002]  [<ffffffff814c2a90>] ? gs_change+0xb/0xb
[47504.939002] Code: 31 c0 80 e6 08 74 4b 48 8b 07 f6 c4 08 74 6f 48 8b 7f 10 
31 c0 48 89 fa eb 11 31 c0 83 fe 02 0f 94 c0 48 8b 52 08 48 39 d7 74 27
[47504.939002]  8b 0a 80 e5 10 75 e7 48 8b 0a 80 e5 02 74 27 48 8b 52 08 31
[47504.939002] RIP  [<ffffffff8127baf1>] xfs_is_delayed_page+0x41/0xa0
[47504.939002]  RSP <ffff8802256836e0>
[47504.939002] CR2: ffff884058ec3270
[47504.939002] ---[ end trace 3745557f071632d2 ]---


I don't have a way to reproduce this, but this is what I was doing when it 
happened:
 - 2 KVM guests running, 1 doing about 1MB/s I/O ocasionally
 - 1 process running and dumping core to /var/tmp/cores (on XFS filesystem). 
Coredump is 4.1G, so takes a while
 - wireshark, firefox, icedove etc. running but that shouldn't matter

After I got this error I can't 'sync' anymore, it just hangs, and some windows 
in X are frozen as well (konsole with all its tabs),
but launching new processes still works.

Attached is my .config and dmesg.

Looking at top:
top - 23:54:01 up 13:35, 10 users,  load average: 5.79, 6.16, 11.34
Tasks: 274 total,   1 running, 251 sleeping,  21 stopped,   1 zombie
Cpu(s):  4.4%us,  0.7%sy,  0.0%ni, 62.1%id, 32.8%wa,  0.0%hi,  0.0%si,  0.0%st
Mem:   8180900k total,  8058764k used,   122136k free,    76112k buffers
Swap:  4194264k total,    77560k used,  4116704k free,  3377104k cached

Not sure what the BUG means: did XFS ran out of memory when trying to write the 
4GB coredump, as it wanted to allocate an extra 4G temporarely to hold it?

Best regards,
--Edwin

Attachment: .config
Description: Text document

Attachment: dmesg
Description: Text document

<Prev in Thread] Current Thread [Next in Thread>