Received: with ECARTIS (v1.0.0; list xfs); Wed, 13 Aug 2008 15:04:58 -0700 (PDT) X-Spam-Checker-Version: SpamAssassin 3.3.0-r574664 (2007-09-11) on oss.sgi.com X-Spam-Level: X-Spam-Status: No, score=-2.0 required=5.0 tests=AWL,BAYES_00,RDNS_NONE autolearn=no version=3.3.0-r574664 Received: from cuda.sgi.com ([192.48.176.15]) by oss.sgi.com (8.12.11.20060308/8.12.11/SuSE Linux 0.7) with ESMTP id m7DM4P4d031023 for ; Wed, 13 Aug 2008 15:04:27 -0700 X-ASG-Debug-ID: 1218665141-733d00d50000-NocioJ X-Barracuda-URL: http://cuda.sgi.com:80/cgi-bin/mark.cgi Received: from ishtar.tlinx.org (localhost [127.0.0.1]) by cuda.sgi.com (Spam Firewall) with ESMTP id 57E8419D15B2 for ; Wed, 13 Aug 2008 15:05:42 -0700 (PDT) Received: from ishtar.tlinx.org (ishtar.tlinx.org [64.81.245.74]) by cuda.sgi.com with ESMTP id pj4PdFmDDd5hFayC for ; Wed, 13 Aug 2008 15:05:42 -0700 (PDT) Received: from [192.168.3.11] (Athena [192.168.3.11]) by ishtar.tlinx.org (8.14.1/8.12.10/SuSE Linux 0.7) with ESMTP id m7DM5Eux018105; Wed, 13 Aug 2008 15:05:15 -0700 Message-ID: <48A35A99.1080300@tlinx.org> Date: Wed, 13 Aug 2008 15:05:13 -0700 From: Linda Walsh User-Agent: Thunderbird 2.0.0.16 (Windows/20080708) MIME-Version: 1.0 To: Dave Chinner , xfs-oss CC: LKML , Eric Sandeen X-ASG-Orig-Subj: Re: XFS Lock debugging noise or real problem? Subject: Re: XFS Lock debugging noise or real problem? References: <48A093A7.40606@tlinx.org> <48A09CA9.9080705@sandeen.net> <48A0F686.2090700@tlinx.org> <48A0F9FC.1070805@sandeen.net> <48A20E9E.9090100@tlinx.org> <20080813005852.GW6119@disturbed> In-Reply-To: <20080813005852.GW6119@disturbed> Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit X-Barracuda-Connect: ishtar.tlinx.org[64.81.245.74] X-Barracuda-Start-Time: 1218665143 X-Barracuda-Bayes: INNOCENT GLOBAL 0.0022 1.0000 -2.0067 X-Barracuda-Virus-Scanned: by cuda.sgi.com at sgi.com X-Barracuda-Spam-Score: -2.01 X-Barracuda-Spam-Status: No, SCORE=-2.01 using per-user scores of TAG_LEVEL=2.0 QUARANTINE_LEVEL=1000.0 KILL_LEVEL=2.1 tests= X-Barracuda-Spam-Report: Code version 3.2, rules version 3.2.1.2577 Rule breakdown below pts rule name description ---- ---------------------- -------------------------------------------------- X-Virus-Scanned: ClamAV 0.91.2/6021/Wed Feb 27 15:55:48 2008 on oss.sgi.com X-Virus-Status: Clean X-archive-position: 17520 X-ecartis-version: Ecartis v1.0.0 Sender: xfs-bounce@oss.sgi.com Errors-to: xfs-bounce@oss.sgi.com X-original-sender: xfs@tlinx.org Precedence: bulk X-list: xfs Dave Chinner wrote: > Once again, > "a problem with the generic code inverting the normal lock order". > > This one cannot deadlock, though, because by definition > any inode on the unused list is, well, unused and hence we can't be > holding a reference to it... ---- This is great, maybe...but what do you mean by "generic"? Is this generic in the FS layer such that we'd see this with all FS types? Or is it generic "XFS" code that only happens with various, "application", (user) coded applications that use locking in a correct, but non-standard order? Some of these messages come out of utilities that I wouldn't think would be using kernel-locking, 'explicitly', at all (gnu-sort). If the generic code didn't invert lock orders from the "norm", could these errors be deleted? Is it code that all resides in the kernel and could be made consistent or is it also user-level (or glibc) apps that are using locks in strange, but correct ways? I'd *like* to keep lock provability 'on' -- but I don't want to waste people's time chasing after non-problems and so far I've seen at least 3 different locking sequences that all appear to be harmless. The problem with false positives is that it will either force the user to ignore (or turn off) the validation code, or generate periodic noise when these things arise... Isn't it generally considered pretty 'bad' to generate so many false positives -- or is lock-proving only for for "lock debugging" -- and not to be used except on development or test systems?