On Wed, Feb 13, 2008 at 11:51:51AM +0100, Christian Røsnes wrote:
> Over the past month I've been hit with two cases of "xfs_trans_cancel
> at line 1150"
> The two errors occurred on different raid sets. In both cases the
> error happened during
> rsync from a remote server to this server, and the local partition
> which reported
> the error was 99% full (as reported by df -k, see below for details).
>
> System: Dell 2850
> Mem: 4GB RAM
> OS: Debian 3 (32-bit)
> Kernel: 2.6.17.7 (custom compiled)
>
> I've been running this kernel since Aug 2006 without any of these
> problems, until a month ago.
>
> I've not used any of the previous kernel in the 2.6.17 series.
>
> /usr/src/linux-2.6.17.7# grep 4K .config
> # CONFIG_4KSTACKS is not set
>
>
> Are there any known XFS problems with this kernel version and nearly
> full partitions ?
Yes. Deadlocks that weren't properly fixed until 2.6.18 (partially
fixed in 2.6.17) and an accounting problem in the transaction code
that leads to the shutdown you are seeing. The accounting problem is
fixed by this commit:
http://git.kernel.org/?p=linux/kernel/git/torvalds/linux-2.6.git;a=commit;h=45c34141126a89da07197d5b89c04c6847f1171a
which I think went into 2.6.22.
Luckily, neither of these problems result in corruption.
> I'm thinking about upgrading the kernel to a newer version, to see if
> it fixes this problem.
> Are there any known XFS problems with version 2.6.24.2 ?
Yes - a problem with readdir. The fix is currently in the stable
queue (i.e for 2.6.24.3):
http://git.kernel.org/?p=linux/kernel/git/stable/stable-queue.git;a=commit;h=ee864b866419890b019352412c7bc9634d96f61b
So we are just waiting for Greg to release 2.6.24.3 now.
Cheers,
Dave.
--
Dave Chinner
Principal Engineer
SGI Australian Software Group
|