xfs
[Top] [All Lists]

[Bug 269] New: XFS Data Corruption on Power Failure(on unclean unmount)

To: xfs-master@xxxxxxxxxxx
Subject: [Bug 269] New: XFS Data Corruption on Power Failure(on unclean unmount)
From: bugzilla-daemon@xxxxxxxxxxx
Date: Wed, 30 Jul 2003 06:25:00 -0700
Sender: linux-xfs-bounce@xxxxxxxxxxx
http://oss.sgi.com/bugzilla/show_bug.cgi?id=269

           Summary: XFS Data Corruption on Power Failure(on unclean unmount)
           Product: Linux XFS
           Version: Current
          Platform: IA32
        OS/Version: Linux
            Status: NEW
          Severity: normal
          Priority: High
         Component: XFS kernel code
        AssignedTo: xfs-master@xxxxxxxxxxx
        ReportedBy: ashahi@xxxxxxxxxxx
                CC: cattelan@xxxxxxx


I am using linux 2.4.20 + LVM 1.0.7 + XFS(snapshot-xfs-2.4.20-2003-04-
07_05:19_UTC with ACLs, no debug enabled). 

 I created couple of Logical Volumes using LVM, and then created/mounted XFS
 file  system over it. I am running some NFS Client doing I/O over different 
 files  in these file systems. I am doing Failover/Failback testing. That is 
 I have one filestem attached to one node and other to the second node. When 
 I fail one of the node, the other node takes over the file system of the 
 second node. That is it tried to mount a file system which was previously 
 not cleanly unmounted. When trying to mount the file system of the second 
 node, I am getting File System Data corruption. I am using our own HA software 
 which takes care of the fact that LVM is not cluster aware.
 
 Attached here is the output from "dmesg".


Jul 29 13:17:25 localhost kernel: Linux version 2.4.20 
(root@xxxxxxxxxxxxxxxxxxxxxxxx) (gcc version 3.2 20020903 (Red Hat Linux 8.0 
3.2-7)) #1 SMP Tue Jul 29 11:31:57 EDT 2003
Jul 29 13:17:26 localhost kernel: LVM version 1.0.7(28/03/2003)
Jul 29 13:17:26 localhost kernel: NET4: Linux TCP/IP 1.0 for NET4.0
Jul 29 13:17:26 localhost kernel: qla2x00: Found  VID=1077 DID=2312 SSVID=1077 
SSDID=100
Jul 29 13:17:26 localhost kernel: scsi(0:0:1:1): Enabled tagged queuing, queue 
depth 16.
Jul 29 13:17:26 localhost kernel: Attached scsi disk sda at scsi0, channel 0, 
id 0, lun 0
Jul 29 13:17:26 localhost kernel: Attached scsi disk sdb at scsi0, channel 0, 
id 0, lun 1
Jul 29 13:17:26 localhost kernel: Attached scsi disk sdc at scsi0, channel 0, 
id 1, lun 0
Jul 29 13:17:26 localhost kernel: Attached scsi disk sdd at scsi0, channel 0, 
id 1, lun 1
Jul 29 13:17:26 localhost kernel: SCSI device sda: 573498800 512-byte hdwr 
sectors (293631 MB)
Jul 29 13:17:26 localhost kernel:  sda: sda1 sda2 sda3
Jul 29 13:17:26 localhost kernel: SCSI device sdb: 573498800 512-byte hdwr 
sectors (293631 MB)
Jul 29 13:17:26 localhost kernel:  sdb: sdb1 sdb2 sdb3
Jul 29 13:17:26 localhost kernel: SCSI device sdc: 573498800 512-byte hdwr 
sectors (293631 MB)
Jul 29 13:17:26 localhost kernel:  sdc: sdc1 sdc2 sdc3
Jul 29 13:17:26 localhost kernel: SCSI device sdd: 573498800 512-byte hdwr 
sectors (293631 MB)
Jul 29 13:17:26 localhost kernel:  sdd: sdd1 sdd2 sdd3
Jul 29 13:17:26 localhost kernel: reiserfs: checking transaction log (device 
03:03) ...
Jul 29 13:17:26 localhost kernel: Warning, log replay starting on readonly 
filesystem
Jul 29 13:17:26 localhost kernel: reiserfs: replayed 63 transactions in 1 
seconds
Jul 29 13:17:26 localhost kernel: Using r5 hash to sort names
Jul 29 13:17:26 localhost kernel: ReiserFS version 3.6.25
Jul 29 13:17:32 localhost modprobe: modprobe: Can't locate module block-major-43
Jul 29 13:17:35 localhost kernel: hydra uses obsolete (PF_INET,SOCK_PACKET)
Jul 29 13:17:35 localhost modprobe: modprobe: Can't locate module block-major-43
Jul 29 13:17:35 localhost last message repeated 31 times
Jul 29 13:17:35 localhost nfs: Starting NFS services:  succeeded
Jul 29 13:17:35 localhost nfs: rpc.nfsd startup succeeded
Jul 29 13:17:36 localhost nfs: rpc.mountd startup succeeded
Jul 29 13:17:52 localhost modprobe: modprobe: Can't locate module block-major-43
Jul 29 13:17:52 localhost last message repeated 31 times
Jul 29 13:17:56 localhost kernel: SGI XFS snapshot-xfs-2.4.20-2003-04-
07_05:19_UTC with ACLs, no debug enabled
Jul 29 13:17:56 localhost kernel: SGI XFS Quota Management subsystem
Jul 29 13:17:56 localhost kernel: XFS mounting filesystem lvm(58,1)
Jul 29 13:35:25 localhost modprobe: modprobe: Can't locate module block-major-43
Jul 29 13:36:21 localhost last message repeated 192 times
Jul 29 13:36:23 localhost last message repeated 127 times
Jul 29 13:36:29 localhost kernel: XFS mounting filesystem lvm(58,1)
Jul 29 13:36:30 localhost kernel: XFS quotacheck lvm(58,1): Please wait.
Jul 29 13:36:32 localhost kernel: XFS quotacheck lvm(58,1): Done.
Jul 29 13:43:47 localhost kernel: e1000: eth2 NIC Link is Down
Jul 29 13:43:49 localhost kernel: e1000: eth2 NIC Link is Up 100 Mbps Full 
Duplex
Jul 29 13:44:20 localhost kernel: XFS mounting filesystem lvm(58,0)
Jul 29 13:44:20 localhost kernel: Filesystem "lvm(58,0)": XFS internal error 
xlog_clear_stale_blocks(2) at line 1135 of file xfs_log_recover.c.  Caller 
0xf8b27f8a
Jul 29 13:44:20 localhost kernel: eb7e3bf0 f8b13775 f8b137ef 00000008 00000000 
00000001 f219b000 f8b5b6e0 
Jul 29 13:44:20 localhost kernel:        f8b5a95e 0000046f f8b5a8b2 f8b27f8a 
00000007 00002400 00001200 f8b286ea 
Jul 29 13:44:20 localhost kernel:        f8b5a95e 00000001 f219b000 f8b5a8b2 
0000046f f8b27f8a 00000008 00000007 
Jul 29 13:44:20 localhost kernel: Call Trace:
Jul 29 13:44:20 localhost kernel:  
[qla2300:__insmod_qla2300_S.bss_L22432+2805589/92572491] 
xfs_stack_trace+0x5/0x10 [xfs]
Jul 29 13:44:20 localhost kernel:  [<f8b13775>] xfs_stack_trace+0x5/0x10 [xfs]
Jul 29 13:44:20 localhost kernel:  
[qla2300:__insmod_qla2300_S.bss_L22432+2805711/92572369] 
xfs_error_report+0x6f/0xb0 [xfs]
Jul 29 13:44:20 localhost kernel:  [<f8b137ef>] xfs_error_report+0x6f/0xb0 [xfs]
Jul 29 13:44:20 localhost kernel:  
[qla2300:__insmod_qla2300_S.bss_L22432+3100352/92277728] .rodata.str1.32+0x240/0
x2c00 [xfs]
Jul 29 13:44:20 localhost kernel:  [<f8b5b6e0>] .rodata.str1.32+0x240/0x2c00 
[xfs]
Jul 29 13:44:20 localhost kernel:  
[qla2300:__insmod_qla2300_S.bss_L22432+3096894/92281186] .rodata.str1.1+0x83a/0x
137c [xfs]
Jul 29 13:44:20 localhost kernel:  [<f8b5a95e>] .rodata.str1.1+0x83a/0x137c 
[xfs]
Jul 29 13:44:20 localhost kernel:  
[qla2300:__insmod_qla2300_S.bss_L22432+3096722/92281358] .rodata.str1.1+0x78e/0x
137c [xfs]
Jul 29 13:44:20 localhost kernel:  [<f8b5a8b2>] .rodata.str1.1+0x78e/0x137c 
[xfs]
Jul 29 13:44:20 localhost kernel:  
[qla2300:__insmod_qla2300_S.bss_L22432+2889578/92488502] 
xlog_find_tail+0x27a/0x440 [xfs]
Jul 29 13:44:20 localhost kernel:  [<f8b27f8a>] xlog_find_tail+0x27a/0x440 [xfs]
Jul 29 13:44:20 localhost kernel:  
[qla2300:__insmod_qla2300_S.bss_L22432+2891466/92486614] 
xlog_clear_stale_blocks+0x14a/0x1a0 [xfs]
Jul 29 13:44:20 localhost kernel:  [<f8b286ea>] 
xlog_clear_stale_blocks+0x14a/0x1a0 [xfs]
Jul 29 13:44:20 localhost kernel:  
[qla2300:__insmod_qla2300_S.bss_L22432+3096894/92281186] .rodata.str1.1+0x83a/0x
137c [xfs]
Jul 29 13:44:20 localhost kernel:  [<f8b5a95e>] .rodata.str1.1+0x83a/0x137c 
[xfs]
Jul 29 13:44:20 localhost kernel:  
[qla2300:__insmod_qla2300_S.bss_L22432+3096722/92281358] .rodata.str1.1+0x78e/0x
137c [xfs]
Jul 29 13:44:20 localhost kernel:  [<f8b5a8b2>] .rodata.str1.1+0x78e/0x137c 
[xfs]
Jul 29 13:44:20 localhost kernel:  
[qla2300:__insmod_qla2300_S.bss_L22432+2889578/92488502] 
xlog_find_tail+0x27a/0x440 [xfs]
Jul 29 13:44:20 localhost kernel:  [<f8b27f8a>] xlog_find_tail+0x27a/0x440 [xfs]
Jul 29 13:44:20 localhost kernel:  
[qla2300:__insmod_qla2300_S.bss_L22432+2889578/92488502] 
xlog_find_tail+0x27a/0x440 [xfs]
Jul 29 13:44:20 localhost kernel:  [<f8b27f8a>] xlog_find_tail+0x27a/0x440 [xfs]
Jul 29 13:44:20 localhost kernel:  
[qla2300:__insmod_qla2300_S.bss_L22432+2906167/92471913] 
xlog_recover+0x37/0x100 [xfs]
Jul 29 13:44:20 localhost kernel:  [<f8b2c057>] xlog_recover+0x37/0x100 [xfs]
Jul 29 13:44:20 localhost kernel:  
[qla2300:__insmod_qla2300_S.bss_L22432+2869341/92508739] 
xfs_log_mount+0x8d/0xf0 [xfs]
Jul 29 13:44:20 localhost kernel:  [<f8b2307d>] xfs_log_mount+0x8d/0xf0 [xfs]
Jul 29 13:44:20 localhost kernel:  
[qla2300:__insmod_qla2300_S.bss_L22432+2912227/92465853] 
xfs_mountfs+0x503/0xf20 [xfs]
Jul 29 13:44:20 localhost kernel:  [<f8b2d803>] xfs_mountfs+0x503/0xf20 [xfs]
Jul 29 13:44:20 localhost kernel:  
[qla2300:__insmod_qla2300_S.bss_L22432+2909972/92468108] xfs_readsb+0x134/0x1f0 
[xfs]
Jul 29 13:44:20 localhost kernel:  [<f8b2cf34>] xfs_readsb+0x134/0x1f0 [xfs]
Jul 29 13:44:20 localhost kernel:  
[qla2300:__insmod_qla2300_S.bss_L22432+2858466/92519614] xfs_ioinit+0x42/0x50 
[xfs]
Jul 29 13:44:20 localhost kernel:  [<f8b20602>] xfs_ioinit+0x42/0x50 [xfs]
Jul 29 13:44:20 localhost kernel:  
[qla2300:__insmod_qla2300_S.bss_L22432+2947982/92430098] xfs_mount+0x2ce/0x400 
[xfs]
Jul 29 13:44:20 localhost kernel:  [<f8b363ae>] xfs_mount+0x2ce/0x400 [xfs]
Jul 29 13:44:20 localhost kernel:  
[qla2300:__insmod_qla2300_S.bss_L22432+3043747/92334333] vfs_mount+0x43/0x50 
[xfs]
Jul 29 13:44:20 localhost kernel:  [<f8b4d9c3>] vfs_mount+0x43/0x50 [xfs]
Jul 29 13:44:20 localhost kernel:  
[qla2300:__insmod_qla2300_S.bss_L22432+3075484/92302596] xfs_qm_mount+0x4c/0x70 
[xfs]
Jul 29 13:44:20 localhost kernel:  [<f8b555bc>] xfs_qm_mount+0x4c/0x70 [xfs]
Jul 29 13:44:20 localhost kernel:  
[qla2300:__insmod_qla2300_S.bss_L22432+3043747/92334333] vfs_mount+0x43/0x50 
[xfs]
Jul 29 13:44:20 localhost kernel:  [<f8b4d9c3>] vfs_mount+0x43/0x50 [xfs]
Jul 29 13:44:20 localhost kernel:  
[qla2300:__insmod_qla2300_S.bss_L22432+3042987/92335093] 
linvfs_read_super+0x9b/0x1c0 [xfs]
Jul 29 13:44:20 localhost kernel:  [<f8b4d6cb>] linvfs_read_super+0x9b/0x1c0 
[xfs]
Jul 29 13:44:20 localhost kernel:  [kmalloc+75/96] kmalloc+0x4b/0x60 [kernel]
Jul 29 13:44:20 localhost kernel:  [<c013ae5b>] kmalloc+0x4b/0x60 [kernel]
Jul 29 13:44:20 localhost kernel:  [alloc_super+58/432] alloc_super+0x3a/0x1b0 
[kernel]
Jul 29 13:44:20 localhost kernel:  [<c014bbfa>] alloc_super+0x3a/0x1b0 [kernel]
Jul 29 13:44:20 localhost kernel:  [insert_super+100/128] 
insert_super+0x64/0x80 [kernel]
Jul 29 13:44:20 localhost kernel:  [<c014bee4>] insert_super+0x64/0x80 [kernel]
Jul 29 13:44:20 localhost kernel:  [get_sb_bdev+446/752] 
get_sb_bdev+0x1be/0x2f0 [kernel]
Jul 29 13:44:20 localhost kernel:  [<c014c8ee>] get_sb_bdev+0x1be/0x2f0 [kernel]
Jul 29 13:44:20 localhost kernel:  
[qla2300:__insmod_qla2300_S.bss_L22432+3160236/92217844] xfs_fs_type+0x0/0x34 
[xfs]
Jul 29 13:44:20 localhost kernel:  [<f8b6a0cc>] xfs_fs_type+0x0/0x34 [xfs]
Jul 29 13:44:20 localhost kernel:  [do_kern_mount+289/320] 
do_kern_mount+0x121/0x140 [kernel]
Jul 29 13:44:20 localhost kernel:  [<c014ccf1>] do_kern_mount+0x121/0x140 
[kernel]
Jul 29 13:44:20 localhost kernel:  
[qla2300:__insmod_qla2300_S.bss_L22432+3160236/92217844] xfs_fs_type+0x0/0x34 
[xfs]
Jul 29 13:44:20 localhost kernel:  [<f8b6a0cc>] xfs_fs_type+0x0/0x34 [xfs]
Jul 29 13:44:20 localhost kernel:  [do_add_mount+147/400] 
do_add_mount+0x93/0x190 [kernel]
Jul 29 13:44:20 localhost kernel:  [<c0163993>] do_add_mount+0x93/0x190 [kernel]
Jul 29 13:44:20 localhost kernel:  [do_mount+352/432] do_mount+0x160/0x1b0 
[kernel]
Jul 29 13:44:20 localhost kernel:  [<c0163cc0>] do_mount+0x160/0x1b0 [kernel]
Jul 29 13:44:20 localhost kernel:  [copy_mount_options+121/208] 
copy_mount_options+0x79/0xd0 [kernel]
Jul 29 13:44:20 localhost kernel:  [<c0163b09>] copy_mount_options+0x79/0xd0 
[kernel]
Jul 29 13:44:20 localhost kernel:  [sys_mount+215/352] sys_mount+0xd7/0x160 
[kernel]
Jul 29 13:44:20 localhost kernel:  [<c01640f7>] sys_mount+0xd7/0x160 [kernel]
Jul 29 13:44:20 localhost kernel:  [system_call+51/56] system_call+0x33/0x38 
[kernel]
Jul 29 13:44:20 localhost kernel:  [<c01094ef>] system_call+0x33/0x38 [kernel]
Jul 29 13:44:20 localhost kernel: 
Jul 29 13:44:20 localhost kernel: XFS: failed to locate log tail
Jul 29 13:44:20 localhost kernel: XFS: log mount/recovery failed
Jul 29 13:44:20 localhost kernel: XFS: log mount failed
Jul 29 13:44:31 localhost kernel: XFS mounting filesystem lvm(58,0)



------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.


<Prev in Thread] Current Thread [Next in Thread>
  • [Bug 269] New: XFS Data Corruption on Power Failure(on unclean unmount), bugzilla-daemon <=