[Top] [All Lists]

Re: Important regression with XFS update for 2.6.24-rc6

To: David Chinner <dgc@xxxxxxx>
Subject: Re: Important regression with XFS update for 2.6.24-rc6
From: Damien Wyart <damien.wyart@xxxxxxx>
Date: Tue, 18 Dec 2007 15:30:31 +0100
Cc: Christoph Hellwig <hch@xxxxxxxxxxxxx>, Lachlan McIlroy <lachlan@xxxxxxx>, Peter Leckie <pleckie@xxxxxxx>, Linus Torvalds <torvalds@xxxxxxxxxxxxxxxxxxxx>, linux-xfs@xxxxxxxxxxx, LKML <linux-kernel@xxxxxxxxxxxxxxx>
In-reply-to: <20071218122445.GJ4396912@xxxxxxx> (David Chinner's message of "Tue, 18 Dec 2007 23:24:45 +1100")
References: <20071218112804.GA3069@xxxxxxxxxxxxxxxxxxxxx> <20071218122445.GJ4396912@xxxxxxx>
Sender: xfs-bounce@xxxxxxxxxxx
User-agent: Gnus/5.110007 (No Gnus v0.7) Emacs/23.0.50
* David Chinner <dgc@xxxxxxx> [071218 13:24]:
> Ok. I haven't noticed anything wrong with directories up to about
> 250,000 files in the last few days. The ls -l I just did on
> a directory with 15000 entries (btree format) used about 5MB of RAM.
> extent format directories appear to work fine as well (tested 500
> entries).

Ok, nice to know the problem is not so frequent.

> Can you:

>       a) isolate the problem to one patch or the other. My guess
>          would be the directory mod, but.....

Yes, it is indeed the directory patch. But even if I still sometimes get
huge memory usage with ls (using the patched kernel), this is quite
rare, and the problem is now mainly getting entries in the listing
repeated, and the ls process taking longer than without the patch. But
this is mainly after booting. I guess the cache plays a role and even
using drop_caches, I can't reproduce the problem. Only on fresh reboot
do I get it systematically, but much less often the memory problem. And
as said earlier, after fresh boot on rc5-git5 without the directory
patch, the ls -l goes normal (no repeated entries).

>       b) show your working ;)

Sorry, I forgot this part in my initial report.

>               - what platform (i386, x86_64, etc)


>               - what debug options

Nothing special, the kernel has 4K stacks, and xfs partitions are
mounted with noatime,nodiratime.

>               - commands and output that shows the problem

It is mainly "ls -l" in a quite crowded directory.

>               - strace of ls -l going bad
>               - xfs_info from filesystem in question

I have put the files at http://damien.wyart.free.fr/xfs/

strace_xfs_problem.1.gz and strace_xfs_problem.2.gz have been created
with the problematic kernel, and are quite bigger than
strace_xfs_problem.normal.gz, which has been created with the vanilla
rc5-git5. There is also xfs_info.

I can provide further details if needed (maybe kernel config, but
nothing special on the xfs side), but I confirm the behavior is
different with and without the directory patch
(041388b54ed95cd169546bd83bacd08ee32bd7ea on oss.sgi), and doesn't look
normal with the patch.

Damien Wyart

<Prev in Thread] Current Thread [Next in Thread>