Received: from oss.sgi.com (localhost [127.0.0.1]) by oss.sgi.com (8.12.5/8.12.5) with ESMTP id g7EAdYRw021179 for ; Wed, 14 Aug 2002 03:39:34 -0700 Received: (from majordomo@localhost) by oss.sgi.com (8.12.5/8.12.3/Submit) id g7EAdYfG021178 for linux-xfs-outgoing; Wed, 14 Aug 2002 03:39:34 -0700 X-Authentication-Warning: oss.sgi.com: majordomo set sender to owner-linux-xfs@oss.sgi.com using -f Received: from sgi.com (sgi-too.SGI.COM [204.94.211.39]) by oss.sgi.com (8.12.5/8.12.5) with SMTP id g7EAdMRw021150 for ; Wed, 14 Aug 2002 03:39:22 -0700 Received: from kendy.up.ac.za (kendy.up.AC.za [137.215.101.101]) by sgi.com (980327.SGI.8.8.8-aspam/980304.SGI-aspam: SGI does not authorize the use of its proprietary systems or networks for unsolicited or bulk email from the Internet.) via ESMTP id DAA08929 for ; Wed, 14 Aug 2002 03:42:26 -0700 (PDT) mail_from (paul@up.ac.za) Received: from [137.215.95.15] (helo=mx1.up.ac.za) by kendy.up.ac.za with esmtp (Exim 3.35 #1) id 17evQs-0005zV-00 for linux-xfs@oss.sgi.com; Wed, 14 Aug 2002 12:31:18 +0200 Received: from tzone.up.ac.za ([137.215.145.210] helo=up.ac.za) by mx1.up.ac.za with esmtp (Exim 3.12 #1) id 17evQr-0002ct-00 for linux-xfs@oss.sgi.com; Wed, 14 Aug 2002 12:31:17 +0200 Message-ID: <3D5A3174.1A91A496@up.ac.za> Date: Wed, 14 Aug 2002 12:31:16 +0200 From: Paul Schutte X-Mailer: Mozilla 4.77 [en] (X11; U; Linux 2.4.19-rc3-xfs-tzone i686) X-Accept-Language: en MIME-Version: 1.0 To: XFS mailing list Subject: Problem repairing filesystem Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit X-Scanner: exiscan *17evQr-0002ct-00*tIgDJxlTU2g* (University of Pretoria, South Africa) X-Spam-Status: No, hits=0.0 required=5.0 tests= version=2.20 X-Spam-Level: Sender: owner-linux-xfs@oss.sgi.com Precedence: bulk Background: ---------------- I ran a ftp server on a pentium II 333Mhz with 256M RAM, using the 2.4.9-31-xfs kernel. Used 4 x 120 Gb IDE drives in a RAID 5 array on an Adaptec 2400 hardware raid controller. There is a 4Gb root partition and a +/- 320Gb data partition. One of the drives failed and the machine crashed. We replaced the drive and rebuild the array. I booted up with a CD that I created a while a go with 2.4.19-pre9-20020604 and mounted a nfs root partition with all the xfs tools on it. We ran xfs_repair (version 2.2.1) on the root partition of the raid array. A lot of the files have the dreaded zero problem, but apart from that it is mountable and usable. The problem: ------------------ We ran xfs_repair on the 320Gb partition. After about 15min xfs_repair died with 'Terminated' being print on the console. dmesg reveals: Out of Memory: Killed process 269 (xfs_repair). I recreated the swap partition and activated it. Ran xfs_repair again. --snip-- Phase 6 - check inode connectivity... - resetting contents of realtime bitmap and summary inodes - ensuring existence of lost+found directory - traversing filesystem starting at / ... - traversal finished ... - traversing all unattached subtrees ... fatal error -- can't read block 0 for directory inode 2097749 When you mount the filesystem, it is empty (except for lost+found which is also empty) The output of xfs_repair is large about 300k bzip2'ed. It would be best if interested parties download it. http://www2.up.ac.za/paul/xfs_repair.out.bz2 http://www2.up.ac.za/paul/dmesg.out.bz2 Questions: -------------- Have I lost the 320G partition or does someone still have a trick up their sleeve ? Would it be possible to make xfs_repair use a lot less memory ? My guess is that the filesystem got it's final blow by xfs_repair exiting prematurely. Any suggestions are welcome. Paul Schutte