-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1
It appears that the machine doesn't fully lock up. Even though the cp
processes never finish and all HDD activity stops, I can still ping the
machine; however, the console or ssh don't respond.
I can get into kdb though. I'm not a kernel hacker (other wise I would have
fixed this) so I'm unsure what to do to make any sense out of it.
Once the machine stops responding I enter kdb and it tells me which pid I
have interrupted. The results of the 2 attempts follows (shorthand for the
moment - if you need the real output just tell me):
The only things runnning both times were kupdated and a couple of cp
processes.
btp (1st go):
shrink_cache (kernel)
try_to_free_pages (kernel)
balance_classzone (kernel)
__alloc_pages (kernel)
[xfs] linufs_lookup (xfs)
link_path_walk (kernel)
open_namei (kernel)
dentry_open (kernel)
btp (2nd go):
try_to_release_page (kernel)
shrink_cache (kernel)
try_to_free_pages (kernel)
balance_classzone (kernel)
__alloc_pages (kernel)
[xfs] linvfs_write (xfs)
sys_write (kernel)
system_call (kernel)
To me at least it seems to be an interaction problem between XFS and kupdated.
Does anyone have any ideas of where to go from here in my quest to sort this
out? Any help would be appreciated.
- --
Adrian Head
(Public Key available on request.)
On Mon, 17 Dec 2001 10:21, you wrote:
> I am again in the process of building a couple of file servers for various
> purposes over the last week and thought that I'd have another serious look
> at using XFS on the new file servers.
>
> Base system is Redhat 7.1 with a custom kernel - 2.4.16+xfs.
>
> My standard process before putting servers into production is to run a few
> tests to make sure that I can trust the hardware/software combination. One
> of my standard tests is to simulate many users simultaniously copying many
> files across the filesystem. The volume is a software raid5 over 4 IDE
> drives.
>
> It is during this test that the machine hangs after getting almost 95%
> complete. I have tried running this test using XFS, ext2, ext3, reiserfs
> and only XFS fails to complete. This situation is completely reproducable
> every time I have run this test to date.
>
> I'm at a loss as to what to do next to troubleshoot this problem or even
> what info people need.
>
>
> Attached is some info:
- --
Adrian Head
(Public Key available on request.)
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.0.6 (GNU/Linux)
Comment: For info see http://www.gnupg.org
iD8DBQE8HT3w8ZJI8OvSkAcRAuPaAJ9eVZlbmjg+nGxXLjbjeYuL8JuIuwCfQELV
jFlQjLbn/F6Q/qYNePj/4u4=
=IzNB
-----END PGP SIGNATURE-----
|