Received: (from majordomo@localhost) by oss.sgi.com (8.11.2/8.11.3) id g0VNJvq12638 for linux-xfs-outgoing; Thu, 31 Jan 2002 15:19:57 -0800 Received: from rj.sgi.com (rj.sgi.com [204.94.215.100]) by oss.sgi.com (8.11.2/8.11.3) with SMTP id g0VNJod12613 for ; Thu, 31 Jan 2002 15:19:50 -0800 Received: from zeus-e8.americas.sgi.com (zeus-e8.americas.sgi.com [128.162.8.103]) by rj.sgi.com (8.11.4/8.11.4/linux-outbound_gateway-1.1) with ESMTP id g0VMJeY15453 for ; Thu, 31 Jan 2002 14:19:40 -0800 Received: from daisy-e185.americas.sgi.com (daisy-e185.americas.sgi.com [128.162.185.214]) by zeus-e8.americas.sgi.com (SGI-SGI-8.9.3/americas-smart-nospam1.1) with ESMTP id QAA29362; Thu, 31 Jan 2002 16:18:24 -0600 (CST) Received: from jen.americas.sgi.com (jen.americas.sgi.com [128.162.187.49]) by daisy-e185.americas.sgi.com (SGI-8.9.3/SGI-server-1.7) with ESMTP id QAA44982; Thu, 31 Jan 2002 16:18:24 -0600 (CST) Received: by jen.americas.sgi.com (8.11.6/SGI-client-1.7) id g0VMG6r13622; Thu, 31 Jan 2002 16:16:06 -0600 Subject: Re: nfsd lockups with xfs during SPEC SFS testing From: Steve Lord To: "HABBINGA,ERIK ""(HP-Loveland,ex1)" Cc: "'linux-xfs@oss.sgi.com'" In-Reply-To: References: Content-Type: text/plain Content-Transfer-Encoding: 7bit X-Mailer: Evolution/1.0.2 Date: 31 Jan 2002 16:16:05 -0600 Message-Id: <1012515365.26363.274.camel@jen.americas.sgi.com> Mime-Version: 1.0 Sender: owner-linux-xfs@oss.sgi.com Precedence: bulk Status: O Content-Length: 1159 Lines: 31 On Thu, 2002-01-31 at 11:59, HABBINGA,ERIK (HP-Loveland,ex1) wrote: > I'm running linux 2.4.17 with a version of XFS downloaded via CVS on Jan > 30th. When I run the SPEC SFS NFS test against this kernel, nfsd stops > responding after awhile. I captured the state of all of the system > processes via magic sysrq, and found 24 nfsd processes locked up in various > stages of the nfsd_lookup code: > > - 20 of them were locked up in the fh_lock call before lookup_one_len in > nfsd_lookup(). > - 2 processes were locked up in the _pagebuf_grab_lock call inside > _pagebuf_find_lockable_buffer(). > - 2 processes were locked up in the pagebuf_iowait() call in > pagebuf_iostart() > > Any ideas on what may be wrong, and how I can help debug and solve this > problem? I've attached the call traces for the locked up nfsd processes. I > can provide vmlinux and System.map for this kernel to help debugging. Is this a regression, i.e. did it used to work? And can you say when on the 30th? Thanks Steve -- Steve Lord voice: +1-651-683-3511 Principal Engineer, Filesystem Software email: lord@sgi.com