Bug 406 - gentoo-ppc64 : XFS dies under load
: gentoo-ppc64 : XFS dies under load
Status: RESOLVED FIXED
Product: XFS
Classification: Unclassified
Component: XFS kernel code
: Current
: All Linux
: blocker
: ---
Assigned To: XFS power people
:
:
:
Depends on:
Blocks:
  Show dependency treegraph
 
Reported: 2005-04-09 10:58 CDT by Omkhar Arasaratnam
Modified: 2005-04-12 20:46 CDT (History)
0 users

See Also:


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description Omkhar Arasaratnam 2005-04-09 10:58:43 CDT
I'm assisting with the gentoo-ppc64 (testing on IBM POWER5) port. In testing 
XFS I've found that XFS dies under moderate load. I noticed a number of I/O 
errors during an rsync operatation. This prompted me to unmount and run 
xfs_check which reported a journal needed to be commited. I umounted/remounted 
and got a kernel panic. 
Other fs's (ext2/3,reiserfs, jfs) seem to work without issue. I'm a little 
stumped and would like some advice as to how to continue.
Comment 1 Nathan Scott 2005-04-10 16:18:44 CDT
Whats your kernel version there?

And what does "dies" mean in this context - kernel panic?  Or just the I/O
errors?  Can you give more details of these I/O errors too - are they being
reported by the kernel or by rsync?

| This prompted me to unmount and run xfs_check which reported a journal
| needed to be commited

That should never happen after a clean unmount... do you see the same issues
with a non-patched kernel.org kernel?  You using a non-patched gcc there?

thanks.
Comment 2 Omkhar Arasaratnam 2005-04-11 20:17:30 CDT
|Whats your kernel version there?

I've tried on 2.6.9+Gentoo patches. I will try 2.6.11.6 in a couple of days 
(more on this below)

|And what does "dies" mean in this context - kernel panic?  Or just the I/O
|errors?  Can you give more details of these I/O errors too - are they being
|reported by the kernel or by rsync?

both the kernel (dmesg) and rsync (stderr) reported I/O errors. No IMMEDIATE 
kernel panic, but the kernel did panic after I attempted to remount the fs. 
Each subsequent time the server would reboot it would hang when attempting to 
mount the xfs filesystem - but no panic

|| This prompted me to unmount and run xfs_check which reported a journal
|| needed to be commited

|That should never happen after a clean unmount... do you see the same issues
|with a non-patched kernel.org kernel?  You using a non-patched gcc there?

The gcc is patched (I'm not sure that I know of any distro that used vanilla 
gcc... That said here is my idea:

I'm in the middle of spinning up a couple of release discs over the next 
couple of days - once this is complete I will try reverting the filespace to 
the xfs system. Are there any special flags I should use with mkfs.xfs ?
Comment 3 Eric Sandeen 2005-04-11 21:08:56 CDT
can you paste in the "I/O" errors you're seeing?  Some actual description of
how xfs "died" would be a good place to start.  dmesg messages would be most
useful.

Thanks,

-Eric
Comment 4 Omkhar Arasaratnam 2005-04-12 12:44:49 CDT
I began testing again today, I started by zeroing the partition and using 
2.6.11.6 (instead of the 2.6.9-gentoo kernel). Things seem to be working well 
thus far - will keep the bug open while I test with a couple of more kernels - 
thanks.

Comment 5 Omkhar Arasaratnam 2005-04-12 18:46:14 CDT
Latest Gentoo versioned kernel works perfectly after some intense testing - 
closing out (that was an easy one eh ?)

;-)