X-Spam-Checker-Version: SpamAssassin 3.4.0-r929098 (2010-03-30) on oss.sgi.com X-Spam-Level: X-Spam-Status: No, score=-1.9 required=5.0 tests=BAYES_00 autolearn=ham version=3.4.0-r929098 Received: from cuda.sgi.com (cuda1.sgi.com [192.48.157.11]) by oss.sgi.com (8.14.3/8.14.3/SuSE Linux 0.8) with ESMTP id p4341Hdb258218 for ; Mon, 2 May 2011 23:01:17 -0500 X-ASG-Debug-ID: 1304395492-503501960000-NocioJ X-Barracuda-URL: http://cuda.sgi.com:80/cgi-bin/mark.cgi Received: from trent.utfs.org (localhost [127.0.0.1]) by cuda.sgi.com (Spam Firewall) with ESMTP id 95E7C11C6221 for ; Mon, 2 May 2011 21:04:53 -0700 (PDT) Received: from trent.utfs.org (trent.utfs.org [194.246.123.103]) by cuda.sgi.com with ESMTP id oMgu2lNO4gvFFolF for ; Mon, 02 May 2011 21:04:53 -0700 (PDT) Received: by trent.utfs.org (Postfix, from userid 8) id 85A403DDC9; Tue, 3 May 2011 06:04:52 +0200 (CEST) Received: from trent.utfs.org (localhost [127.0.0.1]) by trent.utfs.org (Postfix) with ESMTP id A65393DD45; Tue, 3 May 2011 06:04:40 +0200 (CEST) Received: from localhost (localhost [127.0.0.1]) by trent.utfs.org (Postfix) with ESMTP id 7F74D3DBCA; Tue, 3 May 2011 06:04:40 +0200 (CEST) Date: Mon, 2 May 2011 21:04:40 -0700 (PDT) From: Christian Kujau To: Dave Chinner cc: Markus Trippelsdorf , LKML , xfs@oss.sgi.com, minchan.kim@gmail.com X-ASG-Orig-Subj: Re: 2.6.39-rc4+: oom-killer busy killing tasks Subject: Re: 2.6.39-rc4+: oom-killer busy killing tasks In-Reply-To: <20110503005114.GE2978@dastard> Message-ID: References: <20110427102824.GI12436@dastard> <20110428233751.GR12436@dastard> <20110429201701.GA13166@x4.trippels.de> <20110501080149.GD13542@dastard> <20110502121958.GA2978@dastard> <20110503005114.GE2978@dastard> User-Agent: Alpine 2.01 (DEB 1266 2009-07-14) MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII X-AV-Checked: ClamAV using ClamSMTP (127.0.0.1) X-Barracuda-Connect: trent.utfs.org[194.246.123.103] X-Barracuda-Start-Time: 1304395493 X-Barracuda-Bayes: INNOCENT GLOBAL 0.0000 1.0000 -2.0210 X-Barracuda-Virus-Scanned: by cuda.sgi.com at sgi.com X-Barracuda-Spam-Score: -2.02 X-Barracuda-Spam-Status: No, SCORE=-2.02 using per-user scores of TAG_LEVEL=2.0 QUARANTINE_LEVEL=1000.0 KILL_LEVEL=2.1 tests= X-Barracuda-Spam-Report: Code version 3.2, rules version 3.2.2.62618 Rule breakdown below pts rule name description ---- ---------------------- -------------------------------------------------- X-Virus-Scanned: ClamAV version 0.94.2, clamav-milter version 0.94.2 on oss.sgi.com X-Virus-Status: Clean On Tue, 3 May 2011 at 10:51, Dave Chinner wrote: > Can you run an event trace of all the XFS events during a find for > me? Don't do it over the entire subset of the filesystem - only You mean "event tracing", as in Documentation/trace/events.txt. For that I will have to enable CONFIG_FTRACE and CONFIG_FUNCTION_TRACER and probably others, right? Looking at http://lwn.net/Articles/341899, I see CONFIG_EVENT_TRACING and the way to enable event tracing for "all events in fs/xfs" would be: echo 1 > /sys/kernel/debug/tracing/events/xfs/enable > 100,000 inodes is sufficient (i.e. kill the find once the xfs inode > cache slab reaches 100k inodes. While still running the event trace, > can you then drop the caches (echo 3 > /proc/sys/vm/drop_caches) and > check that the xfs inode cache is emptied? If it isn't emptied, drop > caches again to see if that empties it. If you coul dthen post the > event trace, I might be able to see what is going strange with the > shrinker and/or reclaim. Will try to do all that. I wonder why nobody else is affected by this. Because nobody else runs powerpc or UP any more? I'm sure other people's filesystems are way bigger than mine, with much more inodes to cache... Thanks for your time, Christian. -- BOFH excuse #136: Daemons loose in system.