xfs-masters
[Top] [All Lists]

[xfs-masters] [Bug 791] New: endless xfs_repiar loop

To: xfs-master@xxxxxxxxxxx
Subject: [xfs-masters] [Bug 791] New: endless xfs_repiar loop
From: bugzilla-daemon@xxxxxxxxxxx
Date: Thu, 18 Sep 2008 14:07:14 -0700
Reply-to: xfs-masters@xxxxxxxxxxx
Sender: xfs-masters-bounce@xxxxxxxxxxx
http://oss.sgi.com/bugzilla/show_bug.cgi?id=791

           Summary: endless xfs_repiar loop
           Product: Linux XFS
           Version: Current
          Platform: PC
        OS/Version: Linux
            Status: NEW
          Severity: normal
          Priority: P2
         Component: xfsprogs
        AssignedTo: xfs-master@xxxxxxxxxxx
        ReportedBy: glaucon.hunn@xxxxxxxxx


distro: debian etch
kernel: 2.6.26-1-686 (etch version)
xfsprogs: 2.9.8-1

(also ran memtest86+ overnight, everything looked good)

summary:
xfs_repair (lots of errors reported but seems to exit ok with "done")
xfs_repair (more errors...)
xfs_repair (more errors...)
...

The other day I thought it might be a good Idea to run xfs_repair on my 2.8T
(mdadm) raid6 filesystem.  Mainly because xfs_check wont work because I dont
have enough memory... (nothing seemed broken just thought it would be a good 
idea)

So I ran xfs_repair -n, it segfaulted due to a (seemingly) corrupt filesystem. 
I say seemingly because I have no idea how it got corrupt... Anyways re-ran
without the -n to let it do its magic, it threw a bunch of files in lost+found/.
 When I ls'ed them the kernel oopsed with things like:

Sep 16 21:52:31 mandy kernel: [114207.316008] Filesystem "dm-0": corrupt dinode
1276311, extent total = 134217728, nblocks = 0.  Unmount and run xfs_repair.
Sep 16 21:52:31 mandy kernel: [114207.316008] 00000000: 49 4e 43 ff 02 01 00 00
00 00 03 e8 00 00 00 65  INC............e
Sep 16 21:52:31 mandy kernel: [114207.318280] Filesystem "dm-0": XFS internal
error xfs_iformat(1) at line 362 of file fs/xfs/xfs_inode.c.  Caller 0xf8b31d7e
Sep 16 21:52:31 mandy kernel: [114207.321250] Pid: 1308, comm: find Not tainted
2.6.26-1-686 #1
Sep 16 21:52:31 mandy kernel: [114207.321269]  [<f8b31a26>]
xfs_iformat+0x1a4/0x434 [xfs]
Sep 16 21:52:31 mandy kernel: [114207.321348]  [<f8b31d7e>] xfs_iread+0xc8/0x1ba
[xfs]
Sep 16 21:52:31 mandy kernel: [114207.321379]  [<f8b31d7e>] xfs_iread+0xc8/0x1ba
[xfs]
Sep 16 21:52:31 mandy kernel: [114207.321414]  [<f8b31d7e>] xfs_iread+0xc8/0x1ba
[xfs]
Sep 16 21:52:31 mandy kernel: [114207.321447]  [<f8b2d43d>]
xfs_iget_core+0x28a/0x47d [xfs]
Sep 16 21:52:31 mandy kernel: [114207.321485]  [<f8b2d6d1>] xfs_iget+0xa1/0x128
[xfs]
Sep 16 21:52:31 mandy kernel: [114207.321520]  [<f8b45ad6>] xfs_lookup+0x6e/0x8a
[xfs]
Sep 16 21:52:31 mandy kernel: [114207.321564]  [<f8b502f6>]
xfs_vn_lookup+0x35/0x6b [xfs]
Sep 16 21:52:31 mandy kernel: [114207.321604]  [<c017a347>] do_lookup+0xb6/0x153
Sep 16 21:52:31 mandy kernel: [114207.321619]  [<c017bf3b>]
__link_path_walk+0x726/0xb0d
Sep 16 21:52:31 mandy kernel: [114207.321627]  [<c01867ac>]
mntput_no_expire+0x13/0xd9
Sep 16 21:52:31 mandy kernel: [114207.321640]  [<c017c359>] path_walk+0x37/0x70
Sep 16 21:52:31 mandy kernel: [114207.321646]  [<c017c608>]
do_path_lookup+0x122/0x184
Sep 16 21:52:31 mandy kernel: [114207.321653]  [<c017ce65>] 
__user_walk_fd+0x29/0x3a
Sep 16 21:52:31 mandy kernel: [114207.321659]  [<c0176f4d>] 
vfs_lstat_fd+0x12/0x39
Sep 16 21:52:31 mandy kernel: [114207.321669]  [<c0134396>] ktime_get+0xd/0x21
Sep 16 21:52:31 mandy kernel: [114207.321676]  [<c01029cb>] 
__switch_to+0x102/0x125
Sep 16 21:52:31 mandy kernel: [114207.321683]  [<c011d358>] hrtick_set+0x7a/0xd8
Sep 16 21:52:31 mandy kernel: [114207.321690]  [<c0176fe1>] sys_lstat64+0xf/0x23
Sep 16 21:52:31 mandy kernel: [114207.321695]  [<c02b7dd7>] schedule+0x63d/0x66f
Sep 16 21:52:31 mandy kernel: [114207.321706]  [<c01867ac>]
mntput_no_expire+0x13/0xd9
Sep 16 21:52:31 mandy kernel: [114207.321712]  [<c01728d5>] filp_close+0x4d/0x53
Sep 16 21:52:31 mandy kernel: [114207.321719]  [<c0103853>]
sysenter_past_esp+0x78/0xb1


So I unmounted, re-ran xfs_repair, it found a bunch more problems.  So without
remounting I re-ran xfs_repiar -n, found more problems, ran xfs_repair... rinse
and repeat...  Meanwhile I keep loosing more and more data (started out with
about > 1T on disk now down to ~500G in lost+found and ~100 left in the 
filesystem)


Now im using xfs_copy and getting oops on the dest filesystem (due to the src
being corrupted?):

[ 2114.906171] Filesystem "sdd1": XFS internal error xfs_btree_check_sblock at
line 334 of file fs/xfs/xfs_btree.c.  Caller 0xf8afae76
[ 2114.906231] Pid: 4152, comm: xfs_copy Not tainted 2.6.26-1-686 #1
[ 2114.906254]  [<f8b10f82>] xfs_btree_check_sblock+0x93/0xa2 [xfs]
[ 2114.906329]  [<f8afae76>] xfs_alloc_lookup+0x130/0x33d [xfs]
[ 2114.906358]  [<f8afae76>] xfs_alloc_lookup+0x130/0x33d [xfs]
[ 2114.906392]  [<f8af9406>] xfs_alloc_ag_vextent_size+0x3c/0x3b4 [xfs]
[ 2114.906427]  [<f8afa035>] xfs_alloc_ag_vextent+0x24/0xe5 [xfs]
[ 2114.906454]  [<f8afa874>] xfs_alloc_vextent+0x2fc/0x448 [xfs]
[ 2114.906487]  [<f8b0ae80>] xfs_bmap_btalloc+0x77a/0xa22 [xfs]
[ 2114.906518]  [<f8b22dcf>] xfs_iext_bno_to_ext+0x48/0x191 [xfs]
[ 2114.906570]  [<f8b0b9de>] xfs_bmapi+0x894/0x126a [xfs]
[ 2114.906599]  [<f8b22dcf>] xfs_iext_bno_to_ext+0x48/0x191 [xfs]
[ 2114.906630]  [<f8b22e5f>] xfs_iext_bno_to_ext+0xd8/0x191 [xfs]
[ 2114.906675]  [<f8b30000>] xlog_recover_do_buffer_trans+0xe8/0x21d [xfs]
[ 2114.906723]  [<f8b288ab>] xfs_iomap_write_direct+0x3c0/0x4b3 [xfs]
[ 2114.906776]  [<f8b28bcb>] xfs_iomap+0x22d/0x346 [xfs]
[ 2114.906820]  [<f8b3fae1>] __xfs_get_blocks+0x5f/0x1e9 [xfs]
[ 2114.906858]  [<c01641cc>] get_user_pages+0x2a0/0x334
[ 2114.906876]  [<f8b3fc7c>] xfs_get_blocks_direct+0x11/0x15 [xfs]
[ 2114.906911]  [<c0195878>] __blockdev_direct_IO+0x5eb/0xb5b
[ 2114.906923]  [<c0135eec>] getnstimeofday+0x37/0xbc
[ 2114.906943]  [<f8b3f86c>] xfs_vm_direct_IO+0x96/0xb8 [xfs]
[ 2114.906979]  [<f8b3fc6b>] xfs_get_blocks_direct+0x0/0x15 [xfs]
[ 2114.907012]  [<f8b3f481>] xfs_end_io_direct+0x0/0x5a [xfs]
[ 2114.907047]  [<c01570e8>] generic_file_direct_IO+0x103/0x118
[ 2114.907061]  [<c0157149>] generic_file_direct_write+0x4c/0x11c
[ 2114.907071]  [<f8b45f06>] xfs_write+0x426/0x70d [xfs]
[ 2114.907116]  [<c0133c2e>] enqueue_hrtimer+0xc9/0xd4
[ 2114.907126]  [<f8b42d41>] xfs_file_aio_write+0x52/0x59 [xfs]
[ 2114.907162]  [<c0173f17>] do_sync_write+0xbf/0x100
[ 2114.907170]  [<c011b61d>] default_wake_function+0x0/0x8
[ 2114.907185]  [<c0126419>] __do_softirq+0x66/0xd3
[ 2114.907190]  [<c013177c>] autoremove_wake_function+0x0/0x2d
[ 2114.907201]  [<c0105f3f>] do_IRQ+0x52/0x63
[ 2114.907213]  [<c01b918f>] security_file_permission+0xc/0xd
[ 2114.907221]  [<c0173e58>] do_sync_write+0x0/0x100
[ 2114.907225]  [<c0174681>] vfs_write+0x83/0x120
[ 2114.907232]  [<c0174c53>] sys_write+0x3c/0x63
[ 2114.907238]  [<c0103853>] sysenter_past_esp+0x78/0xb1
[ 2114.907251]  =======================

Any Ideas?

-- 
Configure bugmail: http://oss.sgi.com/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.


<Prev in Thread] Current Thread [Next in Thread>
  • [xfs-masters] [Bug 791] New: endless xfs_repiar loop, bugzilla-daemon <=