[Top] [All Lists]

Re: XFS filesystem corruption

To: Ric Wheeler <rwheeler@xxxxxxxxxx>
Subject: Re: XFS filesystem corruption
From: Stan Hoeppner <stan@xxxxxxxxxxxxxxxxx>
Date: Fri, 08 Mar 2013 02:39:18 -0600
Cc: Emmanuel Florac <eflorac@xxxxxxxxxxxxxx>, Julien FERRERO <jferrero06@xxxxxxxxx>, xfs@xxxxxxxxxxx
Delivered-to: xfs@xxxxxxxxxxx
In-reply-to: <5137CD46.6070909@xxxxxxxxxx>
References: <CAPcwv6wZJSBtgF-L6KNSn6N6Y+wUZJFXdbcg+zYRwoaB2sDdjw@xxxxxxxxxxxxxx> <20130306161519.2c28d911@xxxxxxxxxxxxxx> <CAPcwv6wqv0b_CPqDpBfOwVDg23uBi=tpGQSy9XuH2uWS5oVMWQ@xxxxxxxxxxxxxx> <20130306232100.6286f640@xxxxxxxxxxxxxx> <5137CD46.6070909@xxxxxxxxxx>
Reply-to: stan@xxxxxxxxxxxxxxxxx
User-agent: Mozilla/5.0 (Windows NT 5.1; rv:17.0) Gecko/20130215 Thunderbird/17.0.3
On 3/6/2013 5:12 PM, Ric Wheeler wrote:

> We actually test brutal "Power off" for xfs, ext4 and other file
> systems. If your storage is configured properly and you have barriers
> enabled, they all pass without corruption.

Something that none of us mentioned WRT write barriers is that while the
filesystem structure may avoid corruption when the power is cut, files
may still be corrupted, in conditions such as any/all of these:

1.  unwritten data still in buffer cache
2.  drive caches are enabled
3.  BBWC not working properly

If the techs are determined to hard cut power because they don't have
the time or the knowledge to do a clean shutdown, it may be well worth
your time/effort to write a script and teach the field techs to execute
it, before flipping the master switch.  Your simple script would run as
root, or you'd need to do some sudo foo within, and would contain
something like:

#! /bin/sh
echo 2 > /proc/sys/vm/drop_caches
echo "Ready for power down."

This will flush pending writes in buffer cache to disk, and assumes of
course that drive caches are disabled, and/or that BBWC, if present, is
functioning properly.  It also assumes no applications are still
actively writing files, in which case you're screwed regardless.  It's
not a perfect solution and there's no guarantee you won't suffer file
corruption, but this greatly increases your odds against it.


<Prev in Thread] Current Thread [Next in Thread>