Received: (from majordomo@localhost) by oss.sgi.com (8.11.2/8.11.3) id g0V0FOv29038 for linux-xfs-outgoing; Wed, 30 Jan 2002 16:15:24 -0800 Received: from sgi.com (sgi-too.SGI.COM [204.94.211.39]) by oss.sgi.com (8.11.2/8.11.3) with SMTP id g0V0FHd29009 for ; Wed, 30 Jan 2002 16:15:17 -0800 Received: from zeus-e8.americas.sgi.com (zeus-e8.americas.sgi.com [128.162.8.103]) by sgi.com (980327.SGI.8.8.8-aspam/980304.SGI-aspam: SGI does not authorize the use of its proprietary systems or networks for unsolicited or bulk email from the Internet.) via ESMTP id PAA07835 for ; Wed, 30 Jan 2002 15:12:58 -0800 (PST) mail_from (lord@sgi.com) Received: from daisy-e185.americas.sgi.com (daisy-e185.americas.sgi.com [128.162.185.214]) by zeus-e8.americas.sgi.com (SGI-SGI-8.9.3/americas-smart-nospam1.1) with ESMTP id RAA14486; Wed, 30 Jan 2002 17:13:57 -0600 (CST) Received: from jen.americas.sgi.com (jen.americas.sgi.com [128.162.187.49]) by daisy-e185.americas.sgi.com (SGI-8.9.3/SGI-server-1.7) with ESMTP id RAA16054; Wed, 30 Jan 2002 17:13:57 -0600 (CST) Received: by jen.americas.sgi.com (8.11.6/SGI-client-1.7) id g0UNBnG27522; Wed, 30 Jan 2002 17:11:49 -0600 Subject: Re: How long should an xfs_freeze take? From: Steve Lord To: Chris Pascoe Cc: linux-xfs@oss.sgi.com In-Reply-To: <010601c1a9e0$f4210e20$47426682@csee.uq.edu.au> References: <023501c1a953$f059a0f0$47426682@csee.uq.edu.au> <3C57F97A.7000400@sgi.com> <010601c1a9e0$f4210e20$47426682@csee.uq.edu.au> Content-Type: text/plain Content-Transfer-Encoding: 7bit X-Mailer: Evolution/1.0.2 Date: 30 Jan 2002 17:11:49 -0600 Message-Id: <1012432309.26380.415.camel@jen.americas.sgi.com> Mime-Version: 1.0 Sender: owner-linux-xfs@oss.sgi.com Precedence: bulk Status: O Content-Length: 2050 Lines: 50 On Wed, 2002-01-30 at 16:51, Chris Pascoe wrote: > > How long freeze takes depends on how much data is dirty in the > > filesystem, it should take a similar time to an unmount. However, > > unless this was a very large and dirty filesystem this feels > > like a long time - although it was all system time, > > so something was going on. > > 6 hours is a long time for an unmount! The volume is ~70GB, and had no > activity on it before I tried to create the snapshots - the machine had just > been rebooted before some attempts. Whoa! it was early this morning, I missed a few digits when I read the elapsed time! Yes, that is a leetle on the long side. There is a complex loop in the xfs_syncsub function, not sure yet how it can get stuck though. Steve > > > The spot you kept seeing on the stack does not make much sense, vn_count > is > > basically an atomic_read. > > It just seems that's the place I've hit the break key the most. A closer > examination of my console log suggests it isn't stuck in vn_count, there are > a few occurances of it being somewhere else in xfs_iflush_all - so maybe > it's got itself tangled in a loop for a few hours? > > 0xe95c3b88 0xc01a1a63 xfs_iflush_all+0xc3 > (0xf6d1f000, 0x1, 0xf6d1f000, 0xc, 0xc039ce80) > 0xe95c3b88 0xc01a1ab9 xfs_iflush_all+0x119 > (0xf6d1f000, 0x1, 0xf6d1f000, 0xc, 0xc039ce80) > 0xe95c3b88 0xc01a1ac3 xfs_iflush_all+0x123 > (0xf6d1f000, 0x1, 0xf6d1f000, 0xc, 0xc039ce80) > > > How repeatable is this? > > The problem was persistent across reboots - I rebooted to install a > different LVM version at some stage (move to 1.0.2, from 1.0.1), and it > occurred five times in a row after that. I just rebooted now to say that it > still happens - but, alas - it doesn't want to any more. I'll try again at > some random times throughout the day. > > Chris -- Steve Lord voice: +1-651-683-3511 Principal Engineer, Filesystem Software email: lord@sgi.com