xfs
[Top] [All Lists]

RE: Kernel Oops'

To: <linux-xfs@xxxxxxxxxxx>
Subject: RE: Kernel Oops'
From: "Hengesbach, Jeff (US EL)" <jeff.hengesbach@xxxxxxxxxxx>
Date: Fri, 9 Nov 2001 15:36:06 -0500
Sender: owner-linux-xfs@xxxxxxxxxxx
Thread-index: AcFpKy8EnUdvvUMwQwyz37VES9KueAAMeoug
Thread-topic: Kernel Oops'
OK - thanks to Eric's help things appear back to normal now(is there is
such a thing).  The install CD I have is the first one that was put out
- I didn't find xfs_repair/xfs_check on it.  Luckily I have some space
on my /boot partition (mounted to a temp location) where I copied them
to(acually all the xfs_* progs some others are required).  xfs_check
found a few "link count mismatch.....disconnected inode" errors(I have
one written down incase there is interest - lots of typing) - I actually
have/had another directory that was causing me the same problem from a
few weeks back.

xfs_repair -n basically found the same things "entry .... references
non-existant inode.... would have cleared inode...."(I wrote one of
these monsters down also in case there is interest)

It didn't appear it wanted to destroy the fs and these were junk
directories/files anyway, so I ran xfs_repair on the parition which
dumped a few small files into lost+found, and allowed me to remove the
previously offending directories&files after the system booted normally.

The big question now is, what caused this in the first place ??????

My hats off to yourself(Eric) and all the other XFS
developers/contributors. I've been using linux+XFS on all my linux
installs since before the first installer came out(over 1 year ago
now??) - and have not expereinced an issue before this.

Much Thanks,

Jeff Hengesbach
Vantico Inc.
USA Unix Manager
517.324.1581
517.351.9003(Fax)


-----Original Message-----
From: Hengesbach, Jeff (US EL) 
Sent: Friday, November 09, 2001 9:34 AM
To: linux-xfs@xxxxxxxxxxx
Subject: Kernel Oops'


I'm having a kernel Oops occur when trying to rm or ls the directory:
/tmp/orbit-hengeje1
I can mv it to a different name.
After the mv it's automatically recreated, I can then and rm,ls,etc..
the recreated dir just fine.
However, I can not remove the renamed directory though - gets an oops
every time.

The first rm  -fr /tmp/orbit-hengeje1 will complete but message: "Can't
remove directory ... - (not empty)"
The second try of the command will hang and generate the Oops in the
syslog.
I guess its trying to tell me - If at first you don't succeed, don't try
again ;-)


Kernel versions and Oops'(direct from syslog):

This is cvs checkout from oss on the 6th or 7th of November:
Linux version 2.4.14-xfs (root@Dexter) (gcc version 2.96 20000731 (Red
Hat Linux 7.1 2.96-85)) #2 Wed Nov 7 10:04:17 EST 2001
Nov  9 07:56:09 localhost kernel: Unable to handle kernel NULL pointer
dereference at virtual address 00000152
Nov  9 07:56:09 localhost kernel:  printing eip:
Nov  9 07:56:09 localhost kernel: c01ad5dd
Nov  9 07:56:09 localhost kernel: *pde = 00000000
Nov  9 07:56:09 localhost kernel: Oops: 0000
Nov  9 07:56:09 localhost kernel: CPU:    0
Nov  9 07:56:10 localhost kernel: EIP:    0010:[xfs_iget+269/352]    Not
tainted
Nov  9 07:56:10 localhost kernel: EIP:    0010:[<c01ad5dd>]    Not
tainted
Nov  9 07:56:10 localhost kernel: EFLAGS: 00210246
Nov  9 07:56:10 localhost kernel: eax: 00000000   ebx: ffffffe8   ecx:
00000000   edx: c028d1e0
Nov  9 07:56:10 localhost kernel: esi: c076fcb4   edi: c028d1e0   ebp:
c076fca0   esp: d0b57e10
Nov  9 07:56:10 localhost kernel: ds: 0018   es: 0018   ss: 0018
Nov  9 07:56:10 localhost kernel: Process ls (pid: 9872,
stackpage=d0b57000)
Nov  9 07:56:10 localhost kernel: Stack: 00000000 00000000 c1654000
0000004b 0303151b d3ffe2d4 c01c21b7 c1654000
Nov  9 07:56:10 localhost kernel:        00000000 0303151b 00000000
00000000 d0b57e9c 00000000 00000000 00000000
Nov  9 07:56:10 localhost kernel:        00000008 00000018 000001f5
00000306 d3ffe2ec d3ffe2d4 00000008 d4079460
Nov  9 07:56:10 localhost kernel: Call Trace:
[xfs_dir_lookup_int+295/704] [xfs_lookup+151/272]
[linvfs_lookup+104/192] [real_lookup+79/192] [link_path_walk+1267/1808]
Nov  9 07:56:10 localhost kernel: Call Trace: [<c01c21b7>] [<c01c68d7>]
[<c01cf568>] [<c0137c0f>] [<c0138253>]
Nov  9 07:56:10 localhost kernel:    [getname+94/160]
[__user_walk+51/80] [sys_lstat64+20/112] [error_code+52/60]
[system_call+51/56]
Nov  9 07:56:10 localhost kernel:    [<c013795e>] [<c01387f3>]
[<c0135a14>] [<c0106ddc>] [<c0106ceb>]
Nov  9 07:56:10 localhost kernel:
Nov  9 07:56:10 localhost kernel: Code: 66 83 bb 6a 01 00 00 00 75 1a 0f
b7 83 50 01 00 00 25 f7 ff

************************************************************************
*******************************************

This is the linus kernel with the xfs patch from oss.sgi.com:
Linux version 2.4.7-xfs (root@Dexter) (gcc version 2.96 20000731 (Red
Hat Linux 7.1 2.96-85)) #2 Wed Oct 31 09:26:08 EST 2001
Nov  9 08:04:24 localhost kernel: Unable to handle kernel NULL pointer
dereference at virtual address 00000152
Nov  9 08:04:24 localhost kernel:  printing eip:
Nov  9 08:04:24 localhost kernel: c01aef5d
Nov  9 08:04:24 localhost kernel: *pde = 00000000
Nov  9 08:04:24 localhost kernel: Oops: 0000
Nov  9 08:04:24 localhost kernel: CPU:    0
Nov  9 08:04:24 localhost kernel: EIP:    0010:[xfs_iget+253/336]
Nov  9 08:04:24 localhost kernel: EIP:    0010:[<c01aef5d>]
Nov  9 08:04:24 localhost kernel: EFLAGS: 00010246
Nov  9 08:04:24 localhost kernel: eax: d661ab60   ebx: ffffffe8   ecx:
d78f4d68   edx: c02a6be0
Nov  9 08:04:24 localhost kernel: esi: 0303151b   edi: 00000000   ebp:
d661ab4c   esp: d4ce9e04
Nov  9 08:04:24 localhost kernel: ds: 0018   es: 0018   ss: 0018
Nov  9 08:04:24 localhost kernel: Process rm (pid: 1331,
stackpage=d4ce9000)
Nov  9 08:04:24 localhost kernel: Stack: 00000000 00000000 c17f0000
0000011c 0303151b d4cca964 c01c4467 c17f0000
Nov  9 08:04:24 localhost kernel:        00000000 0303151b 00000000
00000000 d4ce9e90 00000000 00000000 00000000
Nov  9 08:04:24 localhost kernel:        00000008 00000018 d4ccbaac
00000000 d4cca97c d4cca964 00000008 d4c4cfe0
Nov  9 08:04:24 localhost kernel: Call Trace:
[xfs_dir_lookup_int+295/704] [xfs_lookup+151/272]
[linvfs_lookup+104/192] [xfs_access+47/64] [real_lookup+79/192]
[path_walk+1425/2016] [zap_page_range+305/608]
Nov  9 08:04:24 localhost kernel: Call Trace: [<c01c4467>] [<c01c8d57>]
[<c01d1a58>] [<c01c7a7f>] [<c013a72f>] [<c013ae41>] [<c0120001>]
Nov  9 08:04:24 localhost kernel:        [__user_walk+58/96]
[sys_lstat64+19/112] [system_call+51/56]
Nov  9 08:04:24 localhost kernel:        [<c013b44a>] [<c0138503>]
[<c0106cdb>]
Nov  9 08:04:24 localhost kernel:
Nov  9 08:04:24 localhost kernel: Code: 66 83 bb 6a 01 00 00 00 75 1a 0f
b7 83 50 01 00 00 25 f7 ff

************************************************************************
***********************************************

This is the linus kernel with the xfs patch from oss.sgi.com:
Linux version 2.4.9-xfs (root@Dexter) (gcc version 2.96 20000731 (Red
Hat Linux 7.1 2.96-85)) #3 Wed Nov 7 09:58:40 EST 2001
Nov  9 08:07:28 localhost kernel: Unable to handle kernel NULL pointer
dereference at virtual address 00000152
Nov  9 08:07:28 localhost kernel:  printing eip:
Nov  9 08:07:28 localhost kernel: c01aef7d
Nov  9 08:07:28 localhost kernel: *pde = 00000000
Nov  9 08:07:28 localhost kernel: Oops: 0000
Nov  9 08:07:28 localhost kernel: CPU:    0
Nov  9 08:07:28 localhost kernel: EIP:    0010:[xfs_iget+253/336]
Nov  9 08:07:28 localhost kernel: EIP:    0010:[<c01aef7d>]
Nov  9 08:07:28 localhost kernel: EFLAGS: 00010246
Nov  9 08:07:28 localhost kernel: eax: d4defb00   ebx: ffffffe8   ecx:
c02a4b70   edx: c02a7300
Nov  9 08:07:28 localhost kernel: esi: 0303151b   edi: 00000000   ebp:
d4defaec   esp: d4e39e04
Nov  9 08:07:28 localhost kernel: ds: 0018   es: 0018   ss: 0018
Nov  9 08:07:28 localhost kernel: Process ls (pid: 1265,
stackpage=d4e39000)
Nov  9 08:07:28 localhost kernel: Stack: 00000000 00000000 c16a0000
00000000 0303151b d4e164c8 c01c4567 c16a0000
Nov  9 08:07:28 localhost kernel:        00000000 0303151b 00000000
00000000 d4e39e90 00000000 00000000 00000000
Nov  9 08:07:28 localhost kernel:        00000008 00000018 d4e47cec
00000000 d4e164e0 d4e164c8 00000008 d63d8ce0
Nov  9 08:07:28 localhost kernel: Call Trace:
[xfs_dir_lookup_int+295/704] [xfs_lookup+151/272]
[linvfs_lookup+104/192] [xfs_access+47/64] [real_lookup+79/192]
Nov  9 08:07:28 localhost kernel: Call Trace: [<c01c4567>] [<c01c8b97>]
[<c01d1588>] [<c01c789f>] [<c013a68f>]
Nov  9 08:07:28 localhost kernel:    [path_walk+1425/2016]
[__user_walk+58/96] [sys_lstat64+19/112] [system_call+51/56]
Nov  9 08:07:28 localhost kernel:    [<c013ada1>] [<c013b3aa>]
[<c0138463>] [<c0106ceb>]
Nov  9 08:07:28 localhost kernel:
Nov  9 08:07:28 localhost kernel: Code: 66 83 bb 6a 01 00 00 00 75 1a 0f
b7 83 50 01 00 00 25 f7 ff

Hardware is a Dell Latitude C600 Laptop 1GHz P3 384MB RAM 18GB disk.

Thanks,

Jeff Hengesbach
Vantico Inc.
USA Unix Manager
517.324.1581
517.351.9003(Fax)


<Prev in Thread] Current Thread [Next in Thread>
  • Kernel Oops', Hengesbach, Jeff (US EL)
    • RE: Kernel Oops', Hengesbach, Jeff (US EL) <=