xfs
[Top] [All Lists]

Re: linux software RAID, 2.6.6, XFS, Postgres: corrupt files

To: Ian Westmacott <ianw@xxxxxxxxxxxxxx>
Subject: Re: linux software RAID, 2.6.6, XFS, Postgres: corrupt files
From: Nathan Scott <nathans@xxxxxxx>
Date: Thu, 14 Apr 2005 15:24:47 +1000
Cc: James Foris <jforis@xxxxxxxxx>, linux-xfs@xxxxxxxxxxx
In-reply-to: <000401c540ad$d865d780$76d81e42@hsd1.ma.comcast.net>
References: <425DD4CD.1090108@wi.rr.com> <000401c540ad$d865d780$76d81e42@hsd1.ma.comcast.net>
Sender: linux-xfs-bounce@xxxxxxxxxxx
User-agent: Mutt/1.5.3i
On Thu, Apr 14, 2005 at 12:53:05AM -0400, Ian Westmacott wrote:
> Well, I can provide a bit more information.
> 
> -- We have a number of these hardware systems.  As I said, it is very
>    easy to reproduce, at some of them.  As it goes, it is at our Beta
>    sites where it is easy to reproduce, and in our lab where it is
>    tough to reproduce.  We are looking into why.

Sounds familiar. :(

> -- I was unable to try the sunit=0 & swidth=0 experiment: no matter
>    what parameters I give to mkfs.xfs (sunit, swidth, su, sw, various
>    args), or what options I use in mount, the filesystem is always
>    created/mounted with the geometry read from the RAID.  (perhaps this
>    is a known issue)

You need a very recent mkfs (cvs), which has a noalign (-d suboption)
command line argument now.

> -- we are currently verifying a workaround:  we added a pseudo-service
>    during shutdown that does
> 
>    dd if=/dev/zero of=/xfs_filesystem/junk bs=64k count=8k
> 
>    (and removes junk on startup).  On a system where this was
>    nearly 100% repeatable, we have now gone though 10 reboot cycles
>    without a problem (tests continue -- tough at a beta site).
> 
> -- The problem remains unchanged if Linux Software RAID is removed
>    from the equation.  I stopped the RAID, formatted one of the disks
>    as XFS (installed Postgres, etc.), and got the corruption on the
>    first reboot.

If you can, try to come up with a step-by-step test case so that
we can reproduce your situation locally - thats a huge help.

thanks!

-- 
Nathan


<Prev in Thread] Current Thread [Next in Thread>