On Thu, Apr 14, 2005 at 12:53:05AM -0400, Ian Westmacott wrote:
> Well, I can provide a bit more information.
>
> -- We have a number of these hardware systems. As I said, it is very
> easy to reproduce, at some of them. As it goes, it is at our Beta
> sites where it is easy to reproduce, and in our lab where it is
> tough to reproduce. We are looking into why.
Sounds familiar. :(
> -- I was unable to try the sunit=0 & swidth=0 experiment: no matter
> what parameters I give to mkfs.xfs (sunit, swidth, su, sw, various
> args), or what options I use in mount, the filesystem is always
> created/mounted with the geometry read from the RAID. (perhaps this
> is a known issue)
You need a very recent mkfs (cvs), which has a noalign (-d suboption)
command line argument now.
> -- we are currently verifying a workaround: we added a pseudo-service
> during shutdown that does
>
> dd if=/dev/zero of=/xfs_filesystem/junk bs=64k count=8k
>
> (and removes junk on startup). On a system where this was
> nearly 100% repeatable, we have now gone though 10 reboot cycles
> without a problem (tests continue -- tough at a beta site).
>
> -- The problem remains unchanged if Linux Software RAID is removed
> from the equation. I stopped the RAID, formatted one of the disks
> as XFS (installed Postgres, etc.), and got the corruption on the
> first reboot.
If you can, try to come up with a step-by-step test case so that
we can reproduce your situation locally - thats a huge help.
thanks!
--
Nathan
|