Received: with ECARTIS (v1.0.0; list xfs); Sun, 23 Sep 2007 21:18:03 -0700 (PDT) X-Spam-Checker-Version: SpamAssassin 3.2.0-pre1-r499012 (2007-01-23) on oss.sgi.com X-Spam-Level: X-Spam-Status: No, score=-1.3 required=5.0 tests=AWL,BAYES_00 autolearn=ham version=3.2.0-pre1-r499012 Received: from larry.melbourne.sgi.com (larry.melbourne.sgi.com [134.14.52.130]) by oss.sgi.com (8.12.10/8.12.10/SuSE Linux 0.7) with SMTP id l8O4HtQ3004118 for ; Sun, 23 Sep 2007 21:17:57 -0700 Received: from pc-bnaujok.melbourne.sgi.com (pc-bnaujok.melbourne.sgi.com [134.14.55.58]) by larry.melbourne.sgi.com (950413.SGI.8.6.12/950213.SGI.AUTOCF) via ESMTP id OAA06991; Mon, 24 Sep 2007 14:17:53 +1000 To: "Eric Sandeen" , xfs-oss Subject: Re: something very strange w/ filestreams... From: "Barry Naujok" Organization: SGI Content-Type: text/plain; format=flowed; delsp=yes; charset=iso-8859-15 MIME-Version: 1.0 References: <46F49C80.60007@sandeen.net> Content-Transfer-Encoding: 7bit Date: Mon, 24 Sep 2007 14:22:17 +1000 Message-ID: In-Reply-To: <46F49C80.60007@sandeen.net> User-Agent: Opera Mail/9.10 (Win32) X-Virus-Scanned: ClamAV version 0.90, clamav-milter version devel-120207 on oss.sgi.com X-Virus-Status: Clean X-archive-position: 13055 X-ecartis-version: Ecartis v1.0.0 Sender: xfs-bounce@oss.sgi.com Errors-to: xfs-bounce@oss.sgi.com X-original-sender: bnaujok@sgi.com Precedence: bulk X-list: xfs On Sat, 22 Sep 2007 14:39:28 +1000, Eric Sandeen wrote: > if I do: > > for I in 173 174 178; do ./check $I; done > > it's not terribly interesting, things seem to go ok, just normal > filestreams failures ;-) > > if I do: > > ./check 173 174 178 > > things go very badly; the very first repair in 178 finds a horribly > corrupted filesystem, and repair tips over (memory appears corrupted, as > witnessed by): > >> xfs_repair: zone calloc failed (, 572662388 bytes): Cannot allocate >> memory > > hm, no zone name, length of 0x22222274? > > I already provided a metadump image to Barry, but I wonder why the > timing(?) seems to make a difference here... first sign of things going > awry in repair is: > > Phase 2 - using internal log > - zero log... > - scan filesystem freespace and inode maps... > bad length 131072 for agf 0, should be 4096 > bad length # 131072 for agi 0, should be 4096 > would reset bad agf for ag 0 > would reset bad agi for ag 0 > .... > > not sure what's going on here, but it only seems to happen if I do those > 2 filestreams test immediately before 178... > > oh, and this is over LVM, just for fun. Eric, you have this patch installed don't you? http://oss.sgi.com/archives/xfs/2007-07/msg00139.html