netdev
[Top] [All Lists]

OOPS in tcp_input.c..? (stock redhat, 2.4.2-2)

To: netdev@xxxxxxxxxxx
Subject: OOPS in tcp_input.c..? (stock redhat, 2.4.2-2)
From: Ray Pitmon <rp@xxxxxxxxxxxxxxxxxx>
Date: Sat, 14 Jul 2001 15:18:57 -0500 (CDT)
Cc: rp@xxxxxxxxxxxxxxxxxx
Sender: owner-netdev@xxxxxxxxxxx
Hi,

First time posting here, please let me know if this is right, or if more info 
is needed.

I installed RedHat 7.1 on two similar systems, with the same results.  At a 
random time, maybe after a day or 2, maybe hours, I would get a kernel panic.  
It started after I started serving up a website (using apache, all static HTML) 
that produces ~30k/sec in traffic.  It was a stock 7.1 install, with all the 
errata applied.  One of the machines had an uptime of a month or so, producing 
~10k/sec web traffic, but then died a couple days after adding this additional 
load.

The machines are:
Dell Optiplex GX1, 640 MB RAM, P3-733 (coppermine)
Dell Optiplex GX1, 640 MB RAM, P3-550 (katmai)

(please send any replies to my email as well.. thx, -ray)

ksymoops output:

[root@web4 /tmp]# cat ksym-out
ksymoops 2.4.0 on i686 2.4.2-2.  Options used
     -v /boot/vmlinux-2.4.2-2 (specified)
     -k /proc/ksyms (default)
     -l /proc/modules (default)
     -o /lib/modules/2.4.2-2/ (default)
     -m /boot/System.map (specified)

Warning (compare_maps): ksyms_base symbol __VERSIONED_SYMBOL(shmem_file_setup) 
not found in vmlinux.  Ignoring ksyms_base entry
Warning (compare_maps): mismatch on symbol partition_name  , ksyms_base says 
c01af860, vmlinux says c0153510.  Ignoring ksyms_base entry
Warning (compare_maps): mismatch on symbol usb_devfs_handle  , usbcore says 
e088e1a0, /lib/modules/2.4.2-2/kernel/drivers/usb/usbcore.o says e088dcc0.  
Ignoring /lib/modules/2.4.2-2/kernel/drivers/usb/usbcore.o entry
Unable to handle kernel NULL pointer dereference at virtual address 00000044
c01d549c
Oops: 0000
CPU:    0
EIP:    0010:[<c01d549c>]
Using defaults from ksymoops -t elf32-i386 -a i386
EFLAGS: 00010246
eax: 0063a570   ebx: d12fcbf4   ecx: 00000001   edx: 00000000
esi: 00000000   edi: d12fcac0   ebp: 00000003   esp: c026fcd8
ds: 0018   es: 0018   ss: 0018
Process swapper (pid: 0, stackpage=c026f000)
Stack: 0000000c d12fcbf4 00000000 d12fcac0 0000010e c01d61b6 d12fcac0 d12fcbf4
       00000000 00000002 7e04a897 d12fcac0 7e04a897 0000010e 00000002 c01d6ac5
       d12fcac0 7e04a897 00000002 0000010e 00000002 7e04a897 c1afad00 d12fcbf4
Call Trace: [<c01d61b6>] [<c01d6ac5>] [<c01d98f9>] [<c01d8b0c>] [<c01e249d>] 
[<c01b93fc>] [<c01dfcc1>]
       [<c01dfb1b>] [<c01e00bd>] [<c01c104e>] [<c01ca6eb>] [<c01ca5f0>] 
[<c01c12db>] [<c01ca5f0>] [<c01c1312>]
       [<c01ca26b>] [<c01ca5f0>] [<c01ca8e3>] [<c01ca760>] [<c01c12db>] 
[<c01ca760>] [<c01c1312>] [<c01bbf03>]
       [<c01ca5bb>] [<c01ca760>] [<c01bc26a>] [<c010a30a>] [<c0119a8b>] 
[<c010a4bf>] [<c0107240>] [<c0107240>]
       [<c01090c4>] [<c0107240>] [<c0107240>] [<c0100018>] [<c0107263>] 
[<c01072e2>] [<c0105000>] [<c0100191>]
Code: 2b 42 44 ba 01 00 00 00 3b 83 84 00 00 00 0f 47 f2 85 f6 b8

>>EIP; c01d549c <tcp_time_to_recover+6c/1d0>   <=====
Trace; c01d61b6 <tcp_fastretrans_alert+376/4a0>
Trace; c01d6ac5 <tcp_ack+1d5/2d0>
Trace; c01d98f9 <tcp_rcv_state_process+2d9/8b0>
Trace; c01d8b0c <tcp_rcv_established+11c/7a0>
Trace; c01e249d <tcp_child_process+4d/a0>
Trace; c01b93fc <skb_checksum+4c/2b0>
Trace; c01dfcc1 <tcp_v4_do_rcv+121/170>
Trace; c01dfb1b <tcp_v4_checksum_init+6b/f0>
Trace; c01e00bd <tcp_v4_rcv+3ad/5e0>
Trace; c01c104e <nf_iterate+2e/80>
Trace; c01ca6eb <ip_local_deliver_finish+fb/170>
Trace; c01ca5f0 <ip_local_deliver_finish+0/170>
Trace; c01c12db <nf_hook_slow+ab/130>
Trace; c01ca5f0 <ip_local_deliver_finish+0/170>
Trace; c01c1312 <nf_hook_slow+e2/130>
Trace; c01ca26b <ip_local_deliver+1ab/1c0>
Trace; c01ca5f0 <ip_local_deliver_finish+0/170>
Trace; c01ca8e3 <ip_rcv_finish+183/1b0>
Trace; c01ca760 <ip_rcv_finish+0/1b0>
Trace; c01c12db <nf_hook_slow+ab/130>
Trace; c01ca760 <ip_rcv_finish+0/1b0>
Trace; c01c1312 <nf_hook_slow+e2/130>
Trace; c01bbf03 <netif_rx+73/e0>
Trace; c01ca5bb <ip_rcv+33b/370>
Trace; c01ca760 <ip_rcv_finish+0/1b0>
Trace; c01bc26a <net_rx_action+1aa/270>
Trace; c010a30a <handle_IRQ_event+3a/70>
Trace; c0119a8b <do_softirq+5b/80>
Trace; c010a4bf <do_IRQ+9f/b0>
Trace; c0107240 <default_idle+0/30>
Trace; c0107240 <default_idle+0/30>
Trace; c01090c4 <ret_from_intr+0/20>
Trace; c0107240 <default_idle+0/30>
Trace; c0107240 <default_idle+0/30>
Trace; c0100018 <startup_32+18/139>
Trace; c0107263 <default_idle+23/30>
Trace; c01072e2 <cpu_idle+52/70>
Trace; c0105000 <empty_bad_page+0/1000>
Trace; c0100191 <L6+0/2>
Code;  c01d549c <tcp_time_to_recover+6c/1d0>
00000000 <_EIP>:
Code;  c01d549c <tcp_time_to_recover+6c/1d0>   <=====
   0:   2b 42 44                  sub    0x44(%edx),%eax   <=====
Code;  c01d549f <tcp_time_to_recover+6f/1d0>
   3:   ba 01 00 00 00            mov    $0x1,%edx
Code;  c01d54a4 <tcp_time_to_recover+74/1d0>
   8:   3b 83 84 00 00 00         cmp    0x84(%ebx),%eax
Code;  c01d54aa <tcp_time_to_recover+7a/1d0>
   e:   0f 47 f2                  cmova  %edx,%esi
Code;  c01d54ad <tcp_time_to_recover+7d/1d0>
  11:   85 f6                     test   %esi,%esi
Code;  c01d54af <tcp_time_to_recover+7f/1d0>
  13:   b8 00 00 00 00            mov    $0x0,%eax

Kernel panic: Aiee, killing interrupt handler!

3 warnings issued.  Results may not be reliable.

<Prev in Thread] Current Thread [Next in Thread>