xfs
[Top] [All Lists]

Re: Data corruption with xfs+nfs+lvm

To: Nathan Scott <nathans@xxxxxxx>
Subject: Re: Data corruption with xfs+nfs+lvm
From: Russell Cattelan <cattelan@xxxxxxx>
Date: Thu, 29 Jan 2004 22:33:21 -0600
Cc: cattelan@xxxxxxx, linux-xfs@xxxxxxxxxxx, Craig Tierney <ctierney@xxxxxxxx>
In-reply-to: <20040130024343.GC1062@frodo>
References: <1075423747.3859.280.camel@hpti7.fsl.noaa.gov> <20040130024343.GC1062@frodo>
Sender: linux-xfs-bounce@xxxxxxxxxxx

On Jan 29, 2004, at 8:43 PM, Nathan Scott wrote:

On Thu, Jan 29, 2004 at 05:49:07PM -0700, Craig Tierney wrote:
I have just discovered that I am having problems with data corruption
on my NFS servers and XFS. It happens in several different cases, but
all under load. Here are the cases that I have gotten data corruption
for reads and writes. Corruption happens on different servers and
on different filesystems (some configured with LVM striping, some not).

Can you descibe your test case in more detail? In particular, do you have a program/programs that demonstrates the problem? That is always a huge help. Or a list of things to run - what sort of IO is being done, and what does "under load" mean in your context.

We tested the new linux-2.4.21 kernel on the dual P3.

"new" and "2.4.21" don't really go together. :)

The file writes are from single processes.  Some codes are MPI, but
all the IO, reads and writes, go through the rank 0 node.  We can
reproduce the corruption relatively easy when 16 processes are active.

Can you give me a recipe so that I can reproduce it locally? Does NFS have to be in the picture for this to fail? And is it reproducible without LVM too?

Russell, does this sound like that NFS corruption that you
were looking into awhile back?
Yes it does.
I still don't have any idea as to what is going wrong, I think somehow there is a race
someplace when a close happens and the pages beyond eof that get flushed.
http://oss.sgi.com/bugzilla/show_bug.cgi?id=198


comment out the xfs_refcache_purge_some and see if the corruption goes away.


cheers.

--
Nathan



<Prev in Thread] Current Thread [Next in Thread>