<div dir="ltr">Hey, just sharing some hard-won (believe me) professional experience. I have seen xfs_repair take a bad situation and make it worse many times. I don't know that a filesystem fuzzer or any other simulation can ever provide true simulation of users absolutely pounding the tar out of a system. There seems to be a real disconnect between what developers are able to test and observe directly, and what happens in the production environment in a very high-throughput environment.<div><br></div><div><div>Best,</div><div><br></div><div>Sean</div><div><br></div></div></div><div class="gmail_extra"><br><div class="gmail_quote">On Tue, Sep 9, 2014 at 6:24 PM, Eric Sandeen <span dir="ltr"><<a href="mailto:sandeen@sandeen.net" target="_blank">sandeen@sandeen.net</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><span class="">On 9/9/14 11:03 AM, Sean Caron wrote:<br>
<br>
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
Barring rare cases, xfs_repair is bad juju.<br>
</blockquote>
<br></span>
No, it's not. It is the appropriate tool to use for filesystem repair.<br>
<br>
But it is not the appropriate tool for recovery from mangled storage.<br>
<br>
I've actually been running a filesystem fuzzer over xfs images, randomly<br>
corrupting data and testing repair, 1000s of times over. It does<br>
remarkably well.<br>
<br>
If you scramble your raid, which means your block device is no longer<br>
an xfs filesystem, but is instead a random tangle of bits and pieces of<br>
other things, of course xfs_repair won't do well, but it's not the right<br>
tool for the job at that stage.<span class="HOEnZb"><font color="#888888"><br>
<br>
-Eric<br>
</font></span></blockquote></div><br></div>