XFS-filesystem corrupted by defragmentation

Robert Brockway robert at timetraveller.org
Tue Apr 13 09:58:22 CDT 2010


On Tue, 13 Apr 2010, Bernhard Gschaider wrote:

>> xfs_db -r /dev/mapper/VolGroup00-LogVol04
> xfs_db: unexpected XFS SB magic number 0x00000000
> xfs_db: read failed: Invalid argument
> xfs_db: data size check failed
> cache_node_purge: refcount was 1, not zero (node=0x2a25c20)
> xfs_db: cannot read root inode (22)

Hi Bernhard.  Hmm that doesn't sound good.

> The file-system is still mounted and working and I don't dare to do
> anything about it (am in a mild state of panic) because I think it
> might not come back if I do.

I think your choice to sit back and evaluate your options before acting is 
a wise one, especially since the filesystem is apparently mounted and 
functioning.

Depending on how worried you are there are various options available.   Eg 
you could declare an emergency on the server and use xfs_freeze to freeze 
the filesystem while you take a backup.  Note - I have never used 
xfs_freeze like this, it is just a suggestion.  Naturally this will cause 
an outage and problems for users.

Alternatively you could use xfsdump to capture an incremental or full 
backup on the running system. (depending on whether you already have a 
level 0 xfs dump file or not).  The developers have confirmed (on this 
list) that xfsdump will provide a consistent backup on a live filesystem.

Please note that any heavy I/O (like a backup) has the potential to cause 
problems on a sick filesystem.  In my experience xfs is inclined to 
automatically remount read-only if it detects problems.  While this can be 
catastrophic for running processes it is helpful in protecting data so I'm 
happy it works this way.

One last note.  I hope you have good backups already.  If you don't then 
this is the time to start taking good backups.

These are the notes from my backup talk:

http://www.timetraveller.org/talks/backup_talk.pdf

> I swear to god: I did not do anything else with the xfs_*-commands
> than the stuff mentioned above

I defrag XFS filesystems from cron as recommended by SGI and I've never 
had a problem.  Maybe defragmentation didn't cause the problem - maybe it 
just revealed an underlying problem.

Cheers,

Rob

-- 
Email: robert at timetraveller.org
IRC: Solver
Web: http://www.practicalsysadmin.com
Open Source: The revolution that silently changed the world




More information about the xfs mailing list