xfs_force_shutdown after Raid crash

Michael Monnerie michael.monnerie at is.it-management.at
Wed Feb 4 10:18:15 CST 2009


On Mittwoch 04 Februar 2009 Ralf Liebenow wrote:
> Should Battery backed RAID controllers not always set their discs
> cache off ?
>
> As I see it (in case of a power failure):
>   - the discs are connectet to the main power, so if there is a power
>     failure they're offline at that moment in time and their (write)
> cache will be gone in that instance of time too

Normally a server is on a UPS, and that should report when there's a 
power outage so the server has enough time to gracefully shut down. 
Still, there can be other events such as:
- power supply error. Even with redundant PS, an outage can exist
- human error (coffee into the server, someone unplugging the cable 
between UPS and server,...)
- and of course mainboard/cpu/ram total crashes
so you are basically never safe.

>   - if a RAID controller does not turn off the disks write cache, the
> controller cannot know if previous writes have made it to the disk.

The controller could keep in-transfer blocks in it's cache, waiting for 
a confirm from the disk that the blocks are on the media, and only 
afterwards remove it from cache. I don't know if controllers do that 
actually. I'll ask Areca support on that.

> good RAID Controller would also use its cache to re-organise the disc
> writes to minimize seek times doing somthing like intelligent command
> queuing. This would also mean, that any order of writes to a disk
> could have been changed by the controller. This would ultimately
> break any filesystem which does not explicitly fsyncing consistent
> checkpoints to the disk, which would make battery backed RAID Systems
> pretty useless ... would it ?
>
>     So .. a battery backed RAID controller should default to "no disk
> write cache" should it ? Otherwise why should anyone want to use such
> expensive controllers ... it just does not make sense to have a
> battery backed cache on the controller, when things get inconsistent
> at a power outage ... It wouldn't have any purpuse ... I hope
> developers of battery backed RAID controllers are aware of that
> implication ...

Yes, imagine you have a RAID with 8 hard disks each having 32MB cache... 
up to 256MB data lost, with a very big chance of having filesystem 
metadata in cache, as that's written very often...
I'll be back on that once I have an official answer from Areca.

mfg zmi
-- 
// Michael Monnerie, Ing.BSc    -----      http://it-management.at
// Tel: 0660 / 415 65 31                      .network.your.ideas.
// PGP Key:         "curl -s http://zmi.at/zmi.asc | gpg --import"
// Fingerprint: AC19 F9D5 36ED CD8A EF38  500E CE14 91F7 1C12 09B4
// Keyserver: wwwkeys.eu.pgp.net                  Key-ID: 1C1209B4

-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 197 bytes
Desc: This is a digitally signed message part.
URL: <http://oss.sgi.com/pipermail/xfs/attachments/20090204/73c63245/attachment.sig>


More information about the xfs mailing list