Hi,
First time posting here, please let me know if this is right, or if more info
is needed.
I installed RedHat 7.1 on two similar systems, with the same results. At a
random time, maybe after a day or 2, maybe hours, I would get a kernel panic.
It started after I started serving up a website (using apache, all static HTML)
that produces ~30k/sec in traffic. It was a stock 7.1 install, with all the
errata applied. One of the machines had an uptime of a month or so, producing
~10k/sec web traffic, but then died a couple days after adding this additional
load.
The machines are:
Dell Optiplex GX1, 640 MB RAM, P3-733 (coppermine)
Dell Optiplex GX1, 640 MB RAM, P3-550 (katmai)
(please send any replies to my email as well.. thx, -ray)
ksymoops output:
[root@web4 /tmp]# cat ksym-out
ksymoops 2.4.0 on i686 2.4.2-2. Options used
-v /boot/vmlinux-2.4.2-2 (specified)
-k /proc/ksyms (default)
-l /proc/modules (default)
-o /lib/modules/2.4.2-2/ (default)
-m /boot/System.map (specified)
Warning (compare_maps): ksyms_base symbol __VERSIONED_SYMBOL(shmem_file_setup)
not found in vmlinux. Ignoring ksyms_base entry
Warning (compare_maps): mismatch on symbol partition_name , ksyms_base says
c01af860, vmlinux says c0153510. Ignoring ksyms_base entry
Warning (compare_maps): mismatch on symbol usb_devfs_handle , usbcore says
e088e1a0, /lib/modules/2.4.2-2/kernel/drivers/usb/usbcore.o says e088dcc0.
Ignoring /lib/modules/2.4.2-2/kernel/drivers/usb/usbcore.o entry
Unable to handle kernel NULL pointer dereference at virtual address 00000044
c01d549c
Oops: 0000
CPU: 0
EIP: 0010:[<c01d549c>]
Using defaults from ksymoops -t elf32-i386 -a i386
EFLAGS: 00010246
eax: 0063a570 ebx: d12fcbf4 ecx: 00000001 edx: 00000000
esi: 00000000 edi: d12fcac0 ebp: 00000003 esp: c026fcd8
ds: 0018 es: 0018 ss: 0018
Process swapper (pid: 0, stackpage=c026f000)
Stack: 0000000c d12fcbf4 00000000 d12fcac0 0000010e c01d61b6 d12fcac0 d12fcbf4
00000000 00000002 7e04a897 d12fcac0 7e04a897 0000010e 00000002 c01d6ac5
d12fcac0 7e04a897 00000002 0000010e 00000002 7e04a897 c1afad00 d12fcbf4
Call Trace: [<c01d61b6>] [<c01d6ac5>] [<c01d98f9>] [<c01d8b0c>] [<c01e249d>]
[<c01b93fc>] [<c01dfcc1>]
[<c01dfb1b>] [<c01e00bd>] [<c01c104e>] [<c01ca6eb>] [<c01ca5f0>]
[<c01c12db>] [<c01ca5f0>] [<c01c1312>]
[<c01ca26b>] [<c01ca5f0>] [<c01ca8e3>] [<c01ca760>] [<c01c12db>]
[<c01ca760>] [<c01c1312>] [<c01bbf03>]
[<c01ca5bb>] [<c01ca760>] [<c01bc26a>] [<c010a30a>] [<c0119a8b>]
[<c010a4bf>] [<c0107240>] [<c0107240>]
[<c01090c4>] [<c0107240>] [<c0107240>] [<c0100018>] [<c0107263>]
[<c01072e2>] [<c0105000>] [<c0100191>]
Code: 2b 42 44 ba 01 00 00 00 3b 83 84 00 00 00 0f 47 f2 85 f6 b8
>>EIP; c01d549c <tcp_time_to_recover+6c/1d0> <=====
Trace; c01d61b6 <tcp_fastretrans_alert+376/4a0>
Trace; c01d6ac5 <tcp_ack+1d5/2d0>
Trace; c01d98f9 <tcp_rcv_state_process+2d9/8b0>
Trace; c01d8b0c <tcp_rcv_established+11c/7a0>
Trace; c01e249d <tcp_child_process+4d/a0>
Trace; c01b93fc <skb_checksum+4c/2b0>
Trace; c01dfcc1 <tcp_v4_do_rcv+121/170>
Trace; c01dfb1b <tcp_v4_checksum_init+6b/f0>
Trace; c01e00bd <tcp_v4_rcv+3ad/5e0>
Trace; c01c104e <nf_iterate+2e/80>
Trace; c01ca6eb <ip_local_deliver_finish+fb/170>
Trace; c01ca5f0 <ip_local_deliver_finish+0/170>
Trace; c01c12db <nf_hook_slow+ab/130>
Trace; c01ca5f0 <ip_local_deliver_finish+0/170>
Trace; c01c1312 <nf_hook_slow+e2/130>
Trace; c01ca26b <ip_local_deliver+1ab/1c0>
Trace; c01ca5f0 <ip_local_deliver_finish+0/170>
Trace; c01ca8e3 <ip_rcv_finish+183/1b0>
Trace; c01ca760 <ip_rcv_finish+0/1b0>
Trace; c01c12db <nf_hook_slow+ab/130>
Trace; c01ca760 <ip_rcv_finish+0/1b0>
Trace; c01c1312 <nf_hook_slow+e2/130>
Trace; c01bbf03 <netif_rx+73/e0>
Trace; c01ca5bb <ip_rcv+33b/370>
Trace; c01ca760 <ip_rcv_finish+0/1b0>
Trace; c01bc26a <net_rx_action+1aa/270>
Trace; c010a30a <handle_IRQ_event+3a/70>
Trace; c0119a8b <do_softirq+5b/80>
Trace; c010a4bf <do_IRQ+9f/b0>
Trace; c0107240 <default_idle+0/30>
Trace; c0107240 <default_idle+0/30>
Trace; c01090c4 <ret_from_intr+0/20>
Trace; c0107240 <default_idle+0/30>
Trace; c0107240 <default_idle+0/30>
Trace; c0100018 <startup_32+18/139>
Trace; c0107263 <default_idle+23/30>
Trace; c01072e2 <cpu_idle+52/70>
Trace; c0105000 <empty_bad_page+0/1000>
Trace; c0100191 <L6+0/2>
Code; c01d549c <tcp_time_to_recover+6c/1d0>
00000000 <_EIP>:
Code; c01d549c <tcp_time_to_recover+6c/1d0> <=====
0: 2b 42 44 sub 0x44(%edx),%eax <=====
Code; c01d549f <tcp_time_to_recover+6f/1d0>
3: ba 01 00 00 00 mov $0x1,%edx
Code; c01d54a4 <tcp_time_to_recover+74/1d0>
8: 3b 83 84 00 00 00 cmp 0x84(%ebx),%eax
Code; c01d54aa <tcp_time_to_recover+7a/1d0>
e: 0f 47 f2 cmova %edx,%esi
Code; c01d54ad <tcp_time_to_recover+7d/1d0>
11: 85 f6 test %esi,%esi
Code; c01d54af <tcp_time_to_recover+7f/1d0>
13: b8 00 00 00 00 mov $0x0,%eax
Kernel panic: Aiee, killing interrupt handler!
3 warnings issued. Results may not be reliable.
|