xfs
[Top] [All Lists]

Re: XFS: Observed Crash followed by deadlock of khubd/sync/XFS

To: Christoph Hellwig <hch@xxxxxxxxxxxxx>
Subject: Re: XFS: Observed Crash followed by deadlock of khubd/sync/XFS
From: Amit Sahrawat <amit.sahrawat83@xxxxxxxxx>
Date: Mon, 12 Sep 2011 16:45:15 +0530
Cc: xfs@xxxxxxxxxxx
Dkim-signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :cc:content-type:content-transfer-encoding; bh=N1KfbbSjCMQoW4axStJdUdHj9+DkFy+RtRXS0ORcSLE=; b=obo0SBvtx3W440AgAW/oJZPcpxKkPFuEVEidoO/0WFKyXm4qnYtFvnY0VMrcGHHJrQ +RkOWBh2cpv5/NNec1vQjD2V9ymd9NUeNkUvIAwcGpHyTbmV9lL2uS4VTlMaWHowv3FN Gqd/Nbf11wt9aHofWJ4gPxx10NVsEOEbZy0BY=
In-reply-to: <CADDb1s3WbZr7aqOAs13vu6Z+QOd7f+JtLyHXqhnM7foqUVar9Q@xxxxxxxxxxxxxx>
References: <CADDb1s2QDk7y+JgMikoje35LviYQwzpgFihndUPuZx2VXYV4Ew@xxxxxxxxxxxxxx> <20110910183028.GA23537@xxxxxxxxxxxxx> <CADDb1s1fr8fiRMAUTvnSqpKeLGJdUPAbEfLNpwb2-iVd+mjw4w@xxxxxxxxxxxxxx> <CADDb1s3WbZr7aqOAs13vu6Z+QOd7f+JtLyHXqhnM7foqUVar9Q@xxxxxxxxxxxxxx>
Kernel Version:
Linux version 3.0.3 (root@xxxxxxxxxxxxxxxxxxxxx) (gcc version 4.4.4
20100503 (Red Hat 4.4.4-2) (GCC) ) #5 SMP Fri Sep 9 11:00:53 IST 2011
Target: x86

XFS (sdb3): Mounting Filesystem
XFS (sdb3): Ending clean mount
usb 2-6: USB disconnect, device number 3
XFS (sdb3): I/O error occurred: meta-data dev sdb3 block 0x56f159
 ("xlog_iodone") error 5 buf count 1024
XFS (sdb3): xfs_do_force_shutdown(0x2) called from line 891 of file
fs/xfs/xfs_log.c.  Return address = 0xf7b20ae1
XFS (sdb3): Log I/O Error Detected.  Shutting down filesystem
XFS (sdb3): Please umount the filesystem and rectify the problem(s)
XFS (sdb3): Unable to update superblock counters. Freespace may not be
correct on next mount.
XFS (€): xfs_trans_ail_delete_bulk: attempting to delete a log item
that is not in the AIL
XFS (€): xfs_do_force_shutdown(0x8) called from line 740 of file
fs/xfs/xfs_trans_ail.c.  Return address = 0xf7b2dc7a
BUG: unable to handle kernel NULL pointer dereference at 00000208
IP: [<f7b20f73>] xfs_log_force_umount+0x1d/0x1b5 [xfs]
*pde = 00000000
Oops: 0000 [#1] SMP
Modules linked in: vfat fat usb_storage xfs exportfs fuse sunrpc
cpufreq_ondemand acpi_cpufreq mperf ip6t_REJECT nf_conntrack_ipv6
nf_defrag_ipv6 ip6table_filter ip6_tables ipv6 uinput r8169 microcode
snd_hda_codec_realtek snd_hda_intel snd_hda_codec snd_hwdep i2c_i801
snd_seq snd_seq_device snd_pcm snd_timer snd soundcore snd_page_alloc
mii ppdev parport_pc parport iTCO_wdt iTCO_vendor_support pcspkr i915
drm_kms_helper drm i2c_algo_bit i2c_core video [last unloaded:
scsi_wait_scan]

Pid: 2233, comm: umount Not tainted 3.0.3 #5 Hewlett-Packard HP dx2480
MT(KL969AV)/0B08h
EIP: 0060:[<f7b20f73>] EFLAGS: 00210202 CPU: 1
EIP is at xfs_log_force_umount+0x1d/0x1b5 [xfs]
EAX: f2a1b380 EBX: 000001f4 ECX: f3ccdd5c EDX: 00000000
ESI: f2a1b380 EDI: 00000000 EBP: f3ccdd98 ESP: f3ccdd80
DS: 007b ES: 007b FS: 00d8 GS: 00e0 SS: 0068
Process umount (pid: 2233, ti=f3ccc000 task=f1f957f0 task.ti=f3ccc000)
Stack:
 f7b32547 f2a1b380 00000000 f2a1b380 00000008 00000000 f3ccddb0 f7b32564
 00000000 f2a1b000 f2a1b380 00000000 f3ccddec f7b2dc7a 000002e4 f2a1b380
 00000004 f7b48bbf f7b42efc f7b2364d 00000001 f3ccddf4 f2a1b004 00000000
Call Trace:
 [<f7b32547>] ? xfs_do_force_shutdown+0x39/0xd6 [xfs]
 [<f7b32564>] xfs_do_force_shutdown+0x56/0xd6 [xfs]
 [<f7b2dc7a>] xfs_trans_ail_delete_bulk+0x83/0xfa [xfs]
 [<f7b2364d>] ? xlog_cil_push+0x2d1/0x2f6 [xfs]
 [<f7b35623>] ? xfs_buf_iodone_work+0x14/0x23 [xfs]
 [<f7b09d6f>] xfs_buf_iodone+0x31/0x3d [xfs]
 [<f7b09d1b>] xfs_buf_do_callbacks+0x24/0x31 [xfs]
 [<f7b0a5b7>] xfs_buf_iodone_callbacks+0x16f/0x1a2 [xfs]
 [<f7b35623>] xfs_buf_iodone_work+0x14/0x23 [xfs]
 [<f7b357ec>] xfs_buf_ioend+0x95/0xa5 [xfs]
 [<f7b35954>] ? xfs_bioerror+0x34/0x3c [xfs]
 [<f7b36217>] ? xfs_flush_buftarg+0x9e/0xe9 [xfs]
 [<f7b35954>] xfs_bioerror+0x34/0x3c [xfs]
 [<f7b36169>] xfs_bdstrat_cb+0x5f/0x6f [xfs]
 [<f7b36217>] xfs_flush_buftarg+0x9e/0xe9 [xfs]
 [<f7b36284>] xfs_free_buftarg+0x22/0x45 [xfs]
 [<f7b3c390>] xfs_close_devices+0x55/0x59 [xfs]
 [<f7b3c3e4>] xfs_fs_put_super+0x50/0x61 [xfs]
 [<c04e1432>] generic_shutdown_super+0x52/0xb0
 [<c04e14b2>] kill_block_super+0x22/0x5e
 [<c04e1997>] deactivate_locked_super+0x1f/0x40
 [<c04e21bf>] deactivate_super+0x37/0x3c
 [<c04f40c3>] mntput_no_expire+0x114/0x11a
 [<c04f4a2b>] sys_umount+0x26e/0x295
 [<c04f4a64>] sys_oldumount+0x12/0x14
 [<c07c401f>] sysenter_do_call+0x12/0x28
Code: 05 b8 05 00 00 00 83 c4 10 5b 5e 5f 5d c3 55 89 e5 57 56 53 83
ec 0c 3e 8d 74 26 00 89 55 f0 8b 98 18 01 00 00 89 c6 85 db 74 06 <f6>
43 14 02 74 27 8b 86 e0 00 00 00 31 ff 83 8e cc 01 00 00 10
EIP: [<f7b20f73>] xfs_log_force_umount+0x1d/0x1b5 [xfs] SS:ESP 0068:f3ccdd80
CR2: 0000000000000208
 ---[ end trace f3af361b30e84114 ]---
------------[ cut here ]------------
WARNING: at kernel/exit.c:909 do_exit+0x37/0x621()
Hardware name: HP dx2480 MT(KL969AV)
Modules linked in: vfat fat usb_storage xfs exportfs fuse sunrpc
cpufreq_ondemand acpi_cpufreq mperf ip6t_REJECT nf_conntrack_ipv6
nf_defrag_ipv6 ip6table_filter ip6_tables ipv6 uinput r8169 microcode
snd_hda_codec_realtek snd_hda_intel snd_hda_codec snd_hwdep i2c_i801
snd_seq snd_seq_device snd_pcm snd_timer snd soundcore snd_page_alloc
mii ppdev parport_pc parport iTCO_wdt iTCO_vendor_support pcspkr i915
drm_kms_helper drm i2c_algo_bit i2c_core video [last unloaded:
scsi_wait_scan]
Pid: 2233, comm: umount Tainted: G      D     3.0.3 #5
Call Trace:
 [<c0437f03>] warn_slowpath_common+0x6a/0x7f
 [<c043b016>] ? do_exit+0x37/0x621
 [<c0437f2c>] warn_slowpath_null+0x14/0x18
 [<c043b016>] do_exit+0x37/0x621
 [<c043840d>] ? kmsg_dump+0x3a/0xb3
 [<c07bfd15>] oops_end+0x9d/0xa5
 [<c042155b>] no_context+0x115/0x11f
 [<c0421659>] __bad_area_nosemaphore+0xf4/0xfc
 [<c04216b0>] bad_area+0x3a/0x40
 [<c07c15db>] do_page_fault+0x227/0x376
 [<c07c13b4>] ? spurious_fault+0xba/0xba
 [<c07bf3d7>] error_code+0x67/0x6c
 [<f7b20f73>] ? xfs_log_force_umount+0x1d/0x1b5 [xfs]
 [<f7b32547>] ? xfs_do_force_shutdown+0x39/0xd6 [xfs]
 [<f7b32564>] xfs_do_force_shutdown+0x56/0xd6 [xfs]
 [<f7b2dc7a>] xfs_trans_ail_delete_bulk+0x83/0xfa [xfs]
 [<f7b2364d>] ? xlog_cil_push+0x2d1/0x2f6 [xfs]
 [<f7b35623>] ? xfs_buf_iodone_work+0x14/0x23 [xfs]
 [<f7b09d6f>] xfs_buf_iodone+0x31/0x3d [xfs]
 [<f7b09d1b>] xfs_buf_do_callbacks+0x24/0x31 [xfs]
 [<f7b0a5b7>] xfs_buf_iodone_callbacks+0x16f/0x1a2 [xfs]
 [<f7b35623>] xfs_buf_iodone_work+0x14/0x23 [xfs]
 [<f7b357ec>] xfs_buf_ioend+0x95/0xa5 [xfs]
 [<f7b35954>] ? xfs_bioerror+0x34/0x3c [xfs]
 [<f7b36217>] ? xfs_flush_buftarg+0x9e/0xe9 [xfs]
 [<f7b35954>] xfs_bioerror+0x34/0x3c [xfs]
 [<f7b36169>] xfs_bdstrat_cb+0x5f/0x6f [xfs]
 [<f7b36217>] xfs_flush_buftarg+0x9e/0xe9 [xfs]
 [<f7b36284>] xfs_free_buftarg+0x22/0x45 [xfs]
 [<f7b3c390>] xfs_close_devices+0x55/0x59 [xfs]
 [<f7b3c3e4>] xfs_fs_put_super+0x50/0x61 [xfs]
 [<c04e1432>] generic_shutdown_super+0x52/0xb0
 [<c04e14b2>] kill_block_super+0x22/0x5e
 [<c04e1997>] deactivate_locked_super+0x1f/0x40
 [<c04e21bf>] deactivate_super+0x37/0x3c
 [<c04f40c3>] mntput_no_expire+0x114/0x11a
 [<c04f4a2b>] sys_umount+0x26e/0x295
 [<c04f4a64>] sys_oldumount+0x12/0x14
 [<c07c401f>] sysenter_do_call+0x12/0x28
---[ end trace f3af361b30e84115 ]---


On Mon, Sep 12, 2011 at 4:36 PM, Amit Sahrawat
<amit.sahrawat83@xxxxxxxxx> wrote:
> Please ignore previous message, there is formatting issue with the
> back trace. I will post again.
>
> On Sun, Sep 11, 2011 at 10:16 PM, Amit Sahrawat
> <amit.sahrawat83@xxxxxxxxx> wrote:
>> Yes, the patch is applied and the crash is still appearing easily.
>>
>> On Sun, Sep 11, 2011 at 12:00 AM, Christoph Hellwig <hch@xxxxxxxxxxxxx> 
>> wrote:
>>> On Thu, Sep 08, 2011 at 04:35:28PM +0530, Amit Sahrawat wrote:
>>>> Kernel Version: 2.6.39.4
>>>> Target: ARM
>>>>
>>>> Observed while doing:
>>>> Copy some file (any size, I tried with 10MB, 100MB) to XFS partition
>>>> After Copy do???sync???
>>>> Now immediately, unplug the device.
>>>
>>> Does this still happen with the patch below applied?
>>>
>>>
>>
>

<Prev in Thread] Current Thread [Next in Thread>