Hi!
We seen in-memory corruption on two XFS filesystem on a server heartbeat
cluster of one of our customers:
XFS internal error XFS_WANT_CORRUPTED_GOTO at line 1563 of file
fs/xfs/xfs_alloc.c. Caller 0xffffffff8824eb5d
Call Trace:
[<ffffffff8824cff3>] :xfs:xfs_free_ag_extent+0x1a6/0x6b5
[<ffffffff8824eb5d>] :xfs:xfs_free_extent+0xa9/0xc9
[<ffffffff88258636>] :xfs:xfs_bmap_finish+0xf0/0x169
[<ffffffff88278b4c>] :xfs:xfs_itruncate_finish+0x180/0x2c1
[<ffffffff8829071a>] :xfs:xfs_setattr+0x841/0xe59
[<ffffffff8022e868>] sock_common_recvmsg+0x30/0x45
[<ffffffff8829adc8>] :xfs:xfs_vn_setattr+0x121/0x144
[<ffffffff8022a06d>] notify_change+0x156/0x2ef
[<ffffffff883bf9c6>] :nfsd:nfsd_setattr+0x334/0x4b1
[<ffffffff883c61d6>] :nfsd:nfsd3_proc_setattr+0xa2/0xae
[<ffffffff883bb24d>] :nfsd:nfsd_dispatch+0xdd/0x19e
[<ffffffff8833a10e>] :sunrpc:svc_process+0x3cb/0x6d9
[<ffffffff8025b20b>] __down_read+0x12/0x9a
[<ffffffff883bb816>] :nfsd:nfsd+0x192/0x2b0
[<ffffffff80255f38>] child_rip+0xa/0x12
[<ffffffff883bb684>] :nfsd:nfsd+0x0/0x2b0
[<ffffffff80255f2e>] child_rip+0x0/0x12
xfs_force_shutdown(dm-1,0x8) called from line 4261 of file fs/xfs/xfs_bmap.c.
Return address = 0xffffffff88258673
Filesystem "dm-1": Corruption of in-memory data detected. Shutting down
filesystem: dm-1
Please umount the filesystem, and rectify the problem(s)
on
Linux version 2.6.21-1-amd64 (Debian 2.6.21-4~bpo.1) (nobse@xxxxxxxxxxxxx)
(gcc version 4.1.2 20061115 (prerelease) (Debian 4.1.1-21)) #1 SMP Tue Jun 5
07:43:32 UTC 2007
We plan to do a takeover so that the server which appears to have memory
errors can be memtested.
After the takeover we would like to make sure that the XFS filesystems are
intact. Is it possible to do so without taking the filesystem completely
offline?
I thought about mounting read only and it might be the best choice available,
but then it will *fail* write accesses. I would prefer if these are just
stalled.
I tried xfs_freeze -f on my laptop home directory, but then did not machine to
get it check via xfs_check or xfs_repair -nd... is it possible at all?
Ciao,
--
Martin Steigerwald - team(ix) GmbH - http://www.teamix.de
gpg: 19E3 8D42 896F D004 08AC A0CA 1E10 C593 0399 AE90
|