xfs
[Top] [All Lists]

Re: something very strange w/ filestreams...

To: "David Chinner" <dgc@xxxxxxx>, "Eric Sandeen" <sandeen@xxxxxxxxxxx>
Subject: Re: something very strange w/ filestreams...
From: "Barry Naujok" <bnaujok@xxxxxxx>
Date: Mon, 24 Sep 2007 15:50:57 +1000
Cc: xfs-oss <xfs@xxxxxxxxxxx>
In-reply-to: <20070923092444.GQ995458@xxxxxxx>
Organization: SGI
References: <46F49C80.60007@xxxxxxxxxxx> <20070923092444.GQ995458@xxxxxxx>
Sender: xfs-bounce@xxxxxxxxxxx
User-agent: Opera Mail/9.10 (Win32)
On Sun, 23 Sep 2007 19:24:44 +1000, David Chinner <dgc@xxxxxxx> wrote:

Barry - I think xfs_repair might be finding the incorrect superblock
for the repair. Tests 172, 173 and 174 use less than the whole disk,
so there are going to be stale superblocks all over the place....

hm, no zone name, length of 0x22222274?

I already provided a metadump image to Barry, but I wonder why the
timing(?) seems to make a difference here... first sign of things going
awry in repair is:

Phase 2 - using internal log
        - zero log...
        - scan filesystem freespace and inode maps...
bad length 131072 for agf 0, should be 4096
bad length # 131072 for agi 0, should be 4096

Yes - test 173 uses 1GB filesystem with 64x16MB AGs - 4096 * 4k block
size = 16MB AG. definitely looks like a stale superblock being
found.

Barry, I think that the secondary superblock needs better verification
(e.g. that there really are AG headers where the sb says there
are supposed to be and all the lengths match up).

Eric - you can relax. Filestreams is not hosing your filesystem; xfs_reapir
is....

Test 178 is designed to test mkfs.xfs in
http://oss.sgi.com/archives/xfs/2007-07/msg00139.html and
will still make xfs_repair go bananas if there is other
old AG headers.

So, before running this test, you should make sure your test
partitions are completely zeroed from mkfs's that occurred
before that recent version of mkfs.xfs was installed.

I tried on my test box and sure enough, xfs_repair barfed.
After zeroing the devices, 172, 174 & 178 sequence succeeded.

If you have failures after the zeroing and ONLY using the
latest mkfs.xfs then something else is wrong. Also,
xfs_copy/xfs_mdrestore of different images could still
trigger the problem.

There is a TODO to improve xfs_repair's handling of this
scenario.

Barry.


<Prev in Thread] Current Thread [Next in Thread>