xfs
[Top] [All Lists]

BUG 799518 - panic machine when mounting filesystems and xfs_repair core

To: nb@xxxxxxx
Subject: BUG 799518 - panic machine when mounting filesystems and xfs_repair core dump
From: pv@xxxxxxxxxxxxx (nelsond@xxxxxxxxxxxx)
Date: Sun, 20 Aug 2000 19:42:12 -0700 (PDT)
Cc: ananth@xxxxxxxxxxxxxxxxxxxx, linux-xfs@xxxxxxxxxxx
Reply-to: sgi.bugs.xfs@xxxxxxxxxxxxxxxxx
Sender: owner-linux-xfs@xxxxxxxxxxx
Webexec: webpvsubmit,PvProjectIncident
Webpv: harrier.corp.sgi.com
View Incident: 
http://co-op.engr.sgi.com/BugWorks/code/bwxquery.cgi?search=Search&wlong=1&view_type=Bug&wi=799518

Submitter : nelsond                   Submitter Domain : engr               
Assigned Engineer : nb                Assigned Domain : sgi.com             
Assigned Group : xfs-linux            Category : software                   
Customer Reported : F                 Priority : 2                          
Project : xfs-linux                   Status : open                         
Description :
I'm running the 2.4.0-test5-xfs code from bonnie from Aug 20th.

I've run into a panic when mounting my filesystems.  I have 
30x 40GB raid luns.  On thursday and friday I was hitting the 
bug where the filesystems would not mount without doing the xfs_repair on them.

I put the new kernel source on, and recreated the xfs user utilities.
I ran xfs_repair on my filesystems, and some of them had errors and caused 
xfs_repair
to core dump.

Here is the stack trace from the xfs_repair core dump:
[root@tigershark 100gb_30_lun_load]# gdb xfs_repair core
GNU gdb 19991004
Copyright 1998 Free Software Foundation, Inc.
GDB is free software, covered by the GNU General Public License, and you are
welcome to change it and/or distribute copies of it under certain conditions.
Type "show copying" to see the conditions.
There is absolutely no warranty for GDB.  Type "show warranty" for details.
This GDB was configured as "i386-redhat-linux"...
Core was generated by `xfs_repair /dev/scsi/host3/bus0/target0/lun3/part10'.
Program terminated with signal 6, Aborted.
Reading symbols from /lib/libc.so.6...done.
Reading symbols from /lib/ld-linux.so.2...done.
#0  0x4003ad41 in __kill () from /lib/libc.so.6
(gdb) where
#0  0x4003ad41 in __kill () from /lib/libc.so.6
#1  0x4003a9b6 in raise (sig=6) at ../sysdeps/posix/raise.c:27
#2  0x4003c0d8 in abort () at ../sysdeps/generic/abort.c:88
#3  0x40034bae in __assert_fail () at assert.c:59
#4  0x8094bfb in xfs_iflush_fork (ip=0x8151ba8, dip=0x8151c88, whichfork=0, 
bp=0x8151c70) at util.c:336
#5  0x8095223 in libxfs_iflush_int (ip=0x8151ba8, bp=0x8151c70) at util.c:443
#6  0x80899a0 in libxfs_trans_log_inode (tp=0x814dbb8, ip=0x8151ba8, flags=1) 
at rdwr.c:456
#7  0x8074e5a in mk_root_dir (mp=0xbffffa44) at phase6.c:703
#8  0x807e11b in phase6 (mp=0xbffffa44) at phase6.c:3780
#9  0x8086c77 in main (argc=2, argv=0xbffffd14) at xfs_repair.c:487
(gdb) quit
======================================================================

After the xfs_repair, I tried to mount the filesystems, and got the following 
panic. 

===========================================================================

Start mounting filesystem: sd(8,74)
XFS: corrupted root inode

0x84a: Root inode 128 is not a directory
Unable to handle kernel NULL pointer dereference at virtual address 0000001c
 printing eip:
c01b7b1c
*pde = 00000000

Entering kdb (0xdb594000) on processor 1 Panic: Oops
due to panic @ 0xc01b7b1c
eax = 0x00000000 ebx = 0xdb94dee0 ecx = 0xc1ef6400 edx = 0xdb94def8 
esi = 0xdb94dee0 edi = 0xdb595bb0 esp = 0xdb595b1c eip = 0xc01b7b1c 
ebp = 0xdb595b30  ss = 0x00000018  cs = 0x00000010 eflags = 0x00010246 
 ds = 0xdb940018  es = 0x00000018 origeax = 0xffffffff &regs = 0xdb595ae8
[1]kdb> bt
    EBP       EIP         Function(args)
0xdb595b30 0xc01b7b1c vn_count+0xc (0xe2c085a0, 0xe1f64ce0, 0xdb595eb4, 0x0, 
0xc03d87e0)
                               kernel .text 0xc0100000 0xc01b7b10 0xc01b7b20
0xdb595ec8 0xc01b6a2a linvfs_read_super+0x1c2 (0xe2f55400, 0x0, 0x0)
                               kernel .text 0xc0100000 0xc01b6868 0xc01b6b04
0xdb595ee8 0xc01389f5 read_super+0x105 (0x84a, 0xde8b4fa0, 0xc0358768, 0x0, 
0x0)                               kernel .text 0xc0100000 0xc01388f0 0xc0138a54
0xdb595f38 0xc0138c1b get_sb_bdev+0x15b (0xc0358768, 0xdb57a000, 0x0, 0x0)
                               kernel .text 0xc0100000 0xc0138ac0 0xc0138c70
0xdb595f88 0xc0139836 do_mount+0x1a2 (0xdb57a000, 0xf7755000, 0xdce4a000, 
0xc0ed0000, 0x0)
                               kernel .text 0xc0100000 0xc0139694 0xc0139940
0xdb595fbc 0xc01399e7 sys_mount+0xa7 (0x8059778, 0x80597a8, 0x80597d8, 
0xc0ed0000, 0x0)                               kernel .text 0xc0100000 
0xc0139940 0xc0139a58
           0xc010a660 system_call+0x34
                               kernel .text 0xc0100000 0xc010a62c 0xc010a664
[1]kdb> cpu 0

Entering kdb (0xe2f90000) on processor 0 due to cpu switch
[0]kdb> bt
    EBP       EIP         Function(args)
           0xc0267b4b stext_lock+0x77f
                               kernel .text.lock 0xc02673cc 0xc02673cc 
0xc026d5e00xe2f91ed8 0xc0119ca1 schedule+0x6c9
                               kernel .text 0xc0100000 0xc01195d8 0xc0119e90
           0xc01347c2 __wait_on_buffer+0xb2 (0xe17f7180)
                               kernel .text 0xc0100000 0xc0134710 0xc01347e8
0xe2f91f1c 0xc0128b2d waitfor_one_page+0x1d (0xc18ded50)
                               kernel .text 0xc0100000 0xc0128b10 0xc0128b54
0xe2f91f38 0xc0128bb7 do_buffer_fdatasync+0x63 (0xf79c0640, 0x0, 0xffffffff, 
0xc0128b10, 0xf79c0640)
                               kernel .text 0xc0100000 0xc0128b54 0xc0128c04
0xe2f91f6c 0xc0128c32 generic_buffer_fdatasync+0x2e (0xf79c0640, 0x0, 
0xffffffff)                               kernel .text 0xc0100000 0xc0128c04 
0xc0128c40
0xe2f91f98 0xc0151212 ext2_sync_file+0x52 (0xf7c0d4e0, 0xf762f0e0, 0x0)
                               kernel .text 0xc0100000 0xc01511c0 0xc01512c0
0xe2f91fbc 0xc0134bb8 sys_fsync+0x54 (0x1, 0xbffff028, 0x0, 0xbffff050, 
0x8058b58)                               kernel .text 0xc0100000 0xc0134b64 
0xc0134bdc
           0xc010a660 system_call+0x34
                               kernel .text 0xc0100000 0xc010a62c 0xc010a664
[0]kdb> cpu 2

Entering kdb (0xf7fbc000) on processor 2 due to cpu switch
[2]kdb> bt
    EBP       EIP         Function(args)
0xf7fbdfa4 0xc0108880 default_idle+0x30
                               kernel .text 0xc0100000 0xc0108850 0xc0108888
           0xc01088f2 cpu_idle+0x42
                               kernel .text 0xc0100000 0xc01088b0 0xc0108908
0xf7fbdfc0 0xc0377219 start_secondary+0x21
                               kernel .text.init 0xc0372000 0xc03771f8 
0xc0377220[2]kdb> cpu 3

Entering kdb (0xf7fba000) on processor 3 due to cpu switch
[3]kdb> bt
    EBP       EIP         Function(args)
0xf7fbbfa4 0xc0108880 default_idle+0x30
                               kernel .text 0xc0100000 0xc0108850 0xc0108888
           0xc01088f2 cpu_idle+0x42
                               kernel .text 0xc0100000 0xc01088b0 0xc0108908
0xf7fbbfc0 0xc0377219 start_secondary+0x21
                               kernel .text.init 0xc0372000 0xc03771f8 
0xc0377220

<Prev in Thread] Current Thread [Next in Thread>
  • BUG 799518 - panic machine when mounting filesystems and xfs_repair core dump, nelsond@xxxxxxxxxxxx <=