xfs
[Top] [All Lists]

Re: Advice needed with file system corruption

To: Emmanuel Florac <eflorac@xxxxxxxxxxxxxx>, Steve Brooks <sjb14@xxxxxxxxxxxxxxxx>
Subject: Re: Advice needed with file system corruption
From: Steve Brooks <sjb14@xxxxxxxxxxxxxxxx>
Date: Mon, 8 Aug 2016 17:16:05 +0100
Cc: xfs@xxxxxxxxxxx
Delivered-to: xfs@xxxxxxxxxxx
In-reply-to: <20160808161132.1d76eb5c@xxxxxxxxxxxxxxxxxxxx>
References: <5787852A.7030900@xxxxxxxxxxxxxxxx> <20160808161132.1d76eb5c@xxxxxxxxxxxxxxxxxxxx>
User-agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:45.0) Gecko/20100101 Thunderbird/45.2.0
Hi,

I chose the words "rebuilding a replaced disk" deliberately as I removed a disk that (according to adaptec's software) had some "media errors" even though the SMART attributes showed there were no "pending sectors" or "reallocated sectors", in fact all the SMART attributes were clean. As I was also using "RAID 6" I did not expect any issues leaving the filesystem online while rebuilding. Previous to this the RAID had been running live 24/7 for 0ver three years.

Steve

 On 08/08/2016 15:11, Emmanuel Florac wrote:
Le Thu, 14 Jul 2016 13:27:22 +0100
Steve Brooks <sjb14@xxxxxxxxxxxxxxxx> Ãcrivait:

We have a RAID system with file system issues as follows,

50 TB in RAID 6 hosted on an Adaptec 71605 controller using
WD4000FYYZ drives.

Centos 6.7  2.6.32-642.el6.x86_64   :   xfsprogs-3.1.1-16.el6

While rebuilding a replaced disk, with the file system online and in
use, the system logs showed multiple entries of;

XFS (sde): Corruption detected. Unmount and run xfs_repair.

Late to the game, I just wanted to remark that I've unfortunately
verified many times that write activity during rebuilds on Adaptec RAID
controllers often creates corruption. I've reported that to Adaptec,
but they don't seem to care much...


<Prev in Thread] Current Thread [Next in Thread>