http://oss.sgi.com/bugzilla/show_bug.cgi?id=904
Summary: XFS CPU soft lockup: xfs_write > try_to_free_pages >
find_get_pages
Product: XFS
Version: unspecified
Platform: All
OS/Version: Linux
Status: NEW
Severity: critical
Priority: P5
Component: XFS kernel code
AssignedTo: xfs-masters@xxxxxxxxxxx
ReportedBy: linux.news@xxxxxxxxxxx
Estimated Hours: 0.0
Classification: Unclassified
Created an attachment (id=302)
--> (http://oss.sgi.com/bugzilla/attachment.cgi?id=302)
Crash log: dmesg, df -h, mount
Kernel version: 2.6.27-9-generic (Ubuntu)
This is a fresh FS, not fragmented, not full, nevertheless XFS find_get_pages
locks up.
1. Created a fresh XFS filesystem, on LVM on RAID5, with 1.68 TB of space
(Created it a month ago, but did not use it at all until now.)
2. dmesg is fine
3. Started copying files from another array to this filesystem, as root:
# time nice cp -a /media/jkl/foo/* /media/ijk/
4. Next day, found:
- Target XFS filesystem is filled with 1.1TB, 671 GB are free, 62% in use
- dmesg shows crash below, many times.
- cp seems hung, Ctrl-C does not abort it.
- FS use (GB) does not change. Crashes in dmesg continue, as I type.
# df -h
Filesystem Size Used Avail Use% Mounted on
/dev/sda2 9,4G 1,3G 8,1G 14% /
...
/dev/mapper/array1-ijk
1,7T 1,1T 671G 62% /media/ijk
/dev/mapper/array2-jkl
6,4T 6,3T 165G 98% /media/jkl
(All filesystems are XFS)
I should mention that I had some problems with this machine, with disks
stopping to respond sometimes. I have not found the cause yet. Could be a weak
power supply, on idea. The symptoms I had then (SATA link errors) did not
appear here, though, and not for several days, and not while writing to this
new FS.
[498641.408996] BUG: soft lockup - CPU#1 stuck for 61s! [cp:19037]
[498641.409005] Modules linked in: nfsd auth_rpcgss exportfs kvm_amd kvm ipv6
nfs lockd nfs_acl sunrpc iptable_filter ip_tables x_tables parport_pc lp
parport loop pcspkr evdev k8temp dm_multipath scsi_dh shpchp pci_hotplug wmi
button xfs pata_acpi sd_mod crc_t10dif sg ata_generic mptsas mptscsih ahci
sata_sil sata_sil24 mptbase scsi_transport_sas libata scsi_mod ohci_hcd
ehci_hcd dock forcedeth usbcore raid10 raid456 async_xor async_memcpy async_tx
xor raid1 raid0 multipath linear md_mod dm_mirror dm_log dm_snapshot dm_mod
thermal processor fan fbcon tileblit font bitblit softcursor fuse
[498641.409005] CPU 1:
[498641.409005] Modules linked in: nfsd auth_rpcgss exportfs kvm_amd kvm ipv6
nfs lockd nfs_acl sunrpc iptable_filter ip_tables x_tables parport_pc lp
parport loop pcspkr evdev k8temp dm_multipath scsi_dh shpchp pci_hotplug wmi
button xfs pata_acpi sd_mod crc_t10dif sg ata_generic mptsas mptscsih ahci
sata_sil sata_sil24 mptbase scsi_transport_sas libata scsi_mod ohci_hcd
ehci_hcd dock forcedeth usbcore raid10 raid456 async_xor async_memcpy async_tx
xor raid1 raid0 multipath linear md_mod dm_mirror dm_log dm_snapshot dm_mod
thermal processor fan fbcon tileblit font bitblit softcursor fuse
[498641.409005] Pid: 19037, comm: cp Not tainted 2.6.27-9-generic #1
[498641.409005] RIP: 0010:[<ffffffff802ac0b1>] [<ffffffff802ac0b1>]
find_get_pages+0x71/0x110
[498641.409005] RSP: 0018:ffff880004bb3378 EFLAGS: 00000282
[498641.409005] RAX: ffff8800d9177e70 RBX: ffff880004bb33b8 RCX:
0000000000000003
[498641.409005] RDX: 0000000000000004 RSI: 0000000000000000 RDI:
ffffe2000369f880
[498641.409005] RBP: ffff880004bb32f8 R08: ffffe20002441588 R09:
0000000000000009
[498641.409005] R10: 0000000000000008 R11: 0000000000031b49 R12:
ffff8800db5f0658
[498641.409005] R13: ffff8800db5f0620 R14: 01ffffffa00c1d20 R15:
ffff880004bb32f8
[498641.409005] FS: 00007f6a55fd0770(0000) GS:ffff8800dd802880(0000)
knlGS:00000000f7d796c0
[498641.409005] CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b
[498641.409005] CR2: 00007f66af6f7000 CR3: 000000007c8f0000 CR4:
00000000000006e0
[498641.409005] DR0: 0000000000000000 DR1: 0000000000000000 DR2:
0000000000000000
[498641.409005] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7:
0000000000000400
[498641.409005]
[498641.409005] Call Trace:
[498641.409005] [<ffffffff802ac083>] ? find_get_pages+0x43/0x110
[498641.409005] [<ffffffff802b6b94>] ? pagevec_lookup+0x24/0x30
[498641.409005] [<ffffffffa029802d>] ? xfs_cluster_write+0xad/0x180 [xfs]
[498641.409005] [<ffffffffa0298598>] ? xfs_page_state_convert+0x498/0x760
[xfs]
[498641.409005] [<ffffffffa02989c1>] ? xfs_vm_writepage+0x71/0x120 [xfs]
[498641.409005] [<ffffffff802b9474>] ? pageout+0x124/0x280
[498641.409005] [<ffffffff802ab1da>] ? page_waitqueue+0xa/0x90
[498641.409005] [<ffffffff802b9a7d>] ? shrink_page_list+0x34d/0x530
[498641.409005] [<ffffffff802b9049>] ? __isolate_lru_page+0x79/0xb0
[498641.409005] [<ffffffff802b910a>] ? isolate_lru_pages+0x8a/0x220
[498641.409005] [<ffffffff802b9e02>] ? shrink_inactive_list+0x1a2/0x4b0
[498641.409005] [<ffffffff802ba18b>] ? shrink_zone+0x7b/0x160
[498641.409005] [<ffffffff802ba2fd>] ? shrink_zones+0x8d/0x150
[498641.409005] [<ffffffff802ba446>] ? do_try_to_free_pages+0x86/0x2e0
[498641.409005] [<ffffffff802ba797>] ? try_to_free_pages+0x67/0x70
[498641.409005] [<ffffffff802b92a0>] ? isolate_pages_global+0x0/0x50
[498641.409005] [<ffffffff802b2a69>] ? __alloc_pages_internal+0x239/0x520
[498641.409005] [<ffffffff802d58cd>] ? alloc_pages_current+0xad/0x110
[498641.409005] [<ffffffff802ac617>] ? __page_cache_alloc+0x67/0x80
[498641.409005] [<ffffffff802ad253>] ? __grab_cache_page+0x63/0xb0
[498641.409005] [<ffffffff80316dc9>] ? block_write_begin+0x89/0xf0
[498641.409005] [<ffffffffa029748a>] ? xfs_vm_write_begin+0x2a/0x30 [xfs]
[498641.409005] [<ffffffffa0297050>] ? xfs_get_blocks+0x0/0x20 [xfs]
[498641.409005] [<ffffffff802ab93c>] ? generic_perform_write+0xbc/0x1c0
[498641.409005] [<ffffffff80301086>] ? touch_atime+0x36/0x140
[498641.409005] [<ffffffff802ad6a2>] ? generic_file_buffered_write+0x92/0x170
[498641.409005] [<ffffffffa02a02f3>] ? xfs_write+0x6b3/0x9b0 [xfs]
[498641.409005] [<ffffffff802ae3a1>] ? generic_file_aio_read+0xd1/0x1b0
[498641.409005] [<ffffffffa0273abc>] ? xfs_iunlock+0x8c/0xc0 [xfs]
[498641.409005] [<ffffffffa02a0a69>] ? xfs_read+0x179/0x2b0 [xfs]
[498641.409005] [<ffffffffa029bca8>] ? xfs_file_aio_write+0x58/0x60 [xfs]
[498641.409005] [<ffffffff802e97d9>] ? do_sync_write+0xf9/0x140
[498641.409005] [<ffffffff8026afaf>] ? hrtimer_start+0xdf/0x1b0
[498641.409005] [<ffffffff80267050>] ? autoremove_wake_function+0x0/0x40
[498641.409005] [<ffffffff80386bb1>] ? aa_file_permission+0x21/0xf0
[498641.409005] [<ffffffff80386cd8>] ? apparmor_file_permission+0x28/0x30
[498641.409005] [<ffffffff80361796>] ? security_file_permission+0x16/0x20
[498641.409005] [<ffffffff802e9e9b>] ? vfs_write+0xcb/0x130
[498641.409005] [<ffffffff802e9ff5>] ? sys_write+0x55/0x90
[498641.409005] [<ffffffff8021285a>] ? system_call_fastpath+0x16/0x1b
[498641.409005]
--
Configure bugmail: http://oss.sgi.com/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.
|