Received: with ECARTIS (v1.0.0; list xfs); Fri, 22 Aug 2008 05:37:17 -0700 (PDT) X-Spam-Checker-Version: SpamAssassin 3.3.0-r574664 (2007-09-11) on oss.sgi.com X-Spam-Level: X-Spam-Status: No, score=-0.3 required=5.0 tests=AWL,BAYES_00,RDNS_NONE autolearn=no version=3.3.0-r574664 Received: from cuda.sgi.com ([192.48.176.15]) by oss.sgi.com (8.12.11.20060308/8.12.11/SuSE Linux 0.7) with ESMTP id m7MCbCSr024139 for ; Fri, 22 Aug 2008 05:37:13 -0700 X-ASG-Debug-ID: 1219408712-0c9402ce0000-NocioJ X-Barracuda-URL: http://cuda.sgi.com:80/cgi-bin/mark.cgi Received: from mail.ukfsn.org (localhost [127.0.0.1]) by cuda.sgi.com (Spam Firewall) with ESMTP id D7B0F1A2F112 for ; Fri, 22 Aug 2008 05:38:32 -0700 (PDT) Received: from mail.ukfsn.org (mail.ukfsn.org [77.75.108.10]) by cuda.sgi.com with ESMTP id 002HP3cxoyzNaOw5 for ; Fri, 22 Aug 2008 05:38:32 -0700 (PDT) Received: from localhost (smtp-filter.ukfsn.org [192.168.54.205]) by mail.ukfsn.org (Postfix) with ESMTP id 8325CDEF5A; Fri, 22 Aug 2008 13:38:34 +0100 (BST) Received: from mail.ukfsn.org ([192.168.54.25]) by localhost (smtp-filter.ukfsn.org [192.168.54.205]) (amavisd-new, port 10024) with ESMTP id hewcE-f8XFoA; Fri, 22 Aug 2008 12:02:04 +0100 (BST) Received: from elm.dgreaves.com (78-32-229-233.no-dns-yet.enta.net [78.32.229.233]) by mail.ukfsn.org (Postfix) with ESMTP id 53203DECF8; Fri, 22 Aug 2008 13:38:34 +0100 (BST) Received: from ash.dgreaves.com ([10.0.0.90]) by elm.dgreaves.com with esmtp (Exim 4.62) (envelope-from ) id 1KWVuR-00043b-JK; Fri, 22 Aug 2008 13:38:31 +0100 Message-ID: <48AEB347.2000200@dgreaves.com> Date: Fri, 22 Aug 2008 13:38:31 +0100 From: David Greaves User-Agent: Mozilla-Thunderbird 2.0.0.16 (X11/20080724) MIME-Version: 1.0 To: "Rafael J. Wysocki" CC: "'linux-kernel@vger.kernel.org'" , xfs@oss.sgi.com, linux-fsdevel@vger.kernel.org, Dave Chinner , Andrew Morton X-ASG-Orig-Subj: Re: Regression? 2.6.27-rc3 segfault on cold boot; not on warm boot. Subject: Re: Regression? 2.6.27-rc3 segfault on cold boot; not on warm boot. References: <48AD3921.5090709@dgreaves.com> <200808212026.17590.rjw@sisk.pl> In-Reply-To: <200808212026.17590.rjw@sisk.pl> X-Enigmail-Version: 0.95.0 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit X-Barracuda-Connect: mail.ukfsn.org[77.75.108.10] X-Barracuda-Start-Time: 1219408713 X-Barracuda-Bayes: INNOCENT GLOBAL 0.0000 1.0000 -2.0210 X-Barracuda-Virus-Scanned: by cuda.sgi.com at sgi.com X-Barracuda-Spam-Score: -2.02 X-Barracuda-Spam-Status: No, SCORE=-2.02 using per-user scores of TAG_LEVEL=2.0 QUARANTINE_LEVEL=1000.0 KILL_LEVEL=2.1 tests= X-Barracuda-Spam-Report: Code version 3.2, rules version 3.2.1.3389 Rule breakdown below pts rule name description ---- ---------------------- -------------------------------------------------- X-Virus-Scanned: ClamAV 0.91.2/8072/Fri Aug 22 02:20:03 2008 on oss.sgi.com X-Virus-Status: Clean X-archive-position: 17676 X-ecartis-version: Ecartis v1.0.0 Sender: xfs-bounce@oss.sgi.com Errors-to: xfs-bounce@oss.sgi.com X-original-sender: david@dgreaves.com Precedence: bulk X-list: xfs Rafael J. Wysocki wrote: > [Adding CCs] > > [The issue is probably present in 2.6.26 too] > > On Thursday, 21 of August 2008, David Greaves wrote: >> I have a desktop system that has started having problems booting up in the morning. >> >> It appears to just happen on more recent kernels. >> I was having unrelated CDROM problems with a driver in an old kernel and decided >> to test 2.6.27-rcX >> The CDROM problem is fine now. >> >> However I started having problems on -rc1. I found that the machine was hanging >> soon after booting and needed a reboot. After a reboot it would work fine for >> the rest of the day. >> When -rc3 came out I tried that and the problem still appears to be there. >> >> The normal process is now to boot to single-user, ctrl-alt-sysreq-SUB and then >> reboot to multi-user. This isn't ideal. >> >> >> If I cold boot 2.6.25.3 the problem doesn't occur. >> I will try different versions over the next few days. As promised, I tried 2.6.26.3 this morning and didn't have the problem. I will try again a few times to confirm. Nb Although the log had an XFS failure, I think the real issue is the segfaults; I think XFS is a casuality, not a cause. Th problem almost always results in an XFS crash at some point - but usually preceded by many segfaults in random binaries. David