> > >on a particular host (Linux 2.4.6-prexy-xfs, RH 7.1 + LVM + XFS, Amanda
> > >2.4.2p2) sometimes sendsize fails to estimate sizes with xfsdump.
>
> I recognized kernel oopses in syslog when Amanda tried to backup this
> host (maxdumps=3):
[...]
> Ok, I'll upgrade the kernel from 2.4.6-pre5-xfs to 2.4.7-xfs before any
> further investigation...
Boy, that was a story...
- Upgraded to Kernel 2.4.7-xfs and rebooted.
- Mounting of /var failed with OOPS (sorry, no copy)
- Rebooted to single user mode
- "xfs_repair /dev/vg1/var" ran forever (> 30 min) with no output.
Instead, each RAID array on that Compaq ML 530 containing 13 hard disks
was read sequentially (that's what the LED's suggested).
- xfs_repair of other XFS filesystems showed that same behaviour
- no interruption possible, had to power off to reboot
- Commented out the XFS filesystems in /etc/fstab avoiding them to be
mounted by RedHat's init scripts even in single user mode - / was still
on ext2fs
- Reboot (i.e., power off)
- xfs_repair for the /var filesystem produced a very ugly output (sorry,
no copy)
- xfs_repair of other XFS filesystems was ok
- After editing /etc/fstab again I rebooted and went back to normal
operation
Conclusions:
- Under rare circumstances it is possible to crash the kernel by trying
to mount a damaged XFS filesystem with no chance to further xfs_repair
with that kernel
Many thanks to John R. Jackson for his help how to drive Amanda's
sendsize by hand.
|