[Top] [All Lists]

Re: Problems with 2.4.19-pre9

To: Steve Lord <lord@xxxxxxx>
Subject: Re: Problems with 2.4.19-pre9
From: Paul Schutte <paul@xxxxxxxx>
Date: Fri, 31 May 2002 19:11:27 +0200
Cc: Florin Iucha <florin@xxxxxxxxx>, linux-xfs@xxxxxxxxxxx
References: <20020531115832.GA1401@iucha.net> <3CF76A46.3FEC1553@up.ac.za> <1022856397.27212.1.camel@jen.americas.sgi.com>
Sender: owner-linux-xfs@xxxxxxxxxxx
My prevoius observations were made on 2.4.18 which I checked out on 24 May.
The 2.4.19-pre9 looks to be a bit worse than 2.4.18.
The machine was unable to complete the bootup.

I attach the whole capture file.
It looks like everthing is stuck in schedule.
It is odd that no filesystems was mounted yet, at this point.
The machine has frozen cold.

I will now try hang.patch and see how it goes.


Steve Lord wrote:

> On Fri, 2002-05-31 at 07:19, Paul Schutte wrote:
> > Changes that went in on  May  24th caused the problems.
> > I checked the tree out on the morning of 24 May and Steve dropped a few
> > changes to get XFS out of the irq business later that day.
> > I checked the tree out again before I went home.
> > The first one works perfectly, the second one shows the behaviour described.
> > I checked with kdb to see what happends when everythings pauses.
> >
> > keventd is shown to be running when I do a ps.
> > It is busy doing  "schedule"
> > The cp that shows the pause behaviour are also in "schedule".
> > It looks like same kind of deadlock that resolves it self after a while.
> > It takes between 5 to 30 seconds for the cp to continue.
> >
> > Hope this helps.
> >
> > Paul
> >
> Paul, which compiler do you use? The problem seems to go away for people
> who switch away from that debian compiler. Also if possible, during the
> pause, can you look at the stack for pagebuf_io_CPU0 and report where
> that is?
> Thanks
>    Steve
> --
> Steve Lord                                      voice: +1-651-683-3511
> Principal Engineer, Filesystem Software         email: lord@xxxxxxx
Linux version 2.4.19-pre9-20020531i (root@xxxxxxxxxxxxxx) (gcc version 2.95.4 
20011002 (Debian prerelease)) #3 Fri May 31 18:37:55 SAST 2002
BIOS-provided physical RAM map:
 BIOS-e820: 0000000000000000 - 000000000009e400 (usable)
 BIOS-e820: 000000000009e400 - 00000000000a0000 (reserved)
 BIOS-e820: 00000000000e8000 - 0000000000100000 (reserved)
 BIOS-e820: 0000000000100000 - 000000003fff0000 (usable)
 BIOS-e820: 000000003fff0000 - 000000003ffffc00 (ACPI data)
 BIOS-e820: 000000003ffffc00 - 0000000040000000 (ACPI NVS)
 BIOS-e820: 00000000fec00000 - 00000000fec10000 (reserved)
 BIOS-e820: 00000000fee00000 - 00000000fee01000 (reserved)
 BIOS-e820: 00000000fff00000 - 0000000100000000 (reserved)
127MB HIGHMEM available.
896MB LOWMEM available.
On node 0 totalpages: 262128
zone(0): 4096 pages.
zone(1): 225280 pages.
zone(2): 32752 pages.
Kernel command line: auto BOOT_IMAGE=Linux rw root=3002 console=ttyS0, 
Initializing CPU#0
Detected 498.757 MHz processor.
Console: colour VGA+ 80x25
Calibrating delay loop... 996.14 BogoMIPS
Memory: 1032356k/1048512k available (1707k kernel code, 15764k reserved, 1011k 
data, 464k init, 131008k highmem)
kdb version 2.1 by Scott Lurndal, Keith Owens. Copyright SGI, All Rights 
Dentry cache hash table entries: 131072 (order: 8, 1048576 bytes)
Inode cache hash table entries: 65536 (order: 7, 524288 bytes)
Mount-cache hash table entries: 16384 (order: 5, 131072 bytes)
Buffer-cache hash table entries: 65536 (order: 6, 262144 bytes)
Page-cache hash table entries: 262144 (order: 8, 1048576 bytes)
CPU: L1 I cache: 16K, L1 D cache: 16K
CPU: L2 cache: 512K
CPU serial number disabled.
Intel machine check architecture supported.
Intel machine check reporting enabled on CPU#0.
CPU: Intel Pentium III (Katmai) stepping 02
Enabling fast FPU save and restore... done.
Enabling unmasked SIMD FPU exception support... done.
Checking 'hlt' instruction... OK.
POSIX conformance testing by UNIFIX
mtrr: v1.40 (20010327) Richard Gooch (rgooch@xxxxxxxxxxxxx)
mtrr: detected mtrr type: Intel
PCI: PCI BIOS revision 2.10 entry at 0xfdaf0, last bus=1
PCI: Using configuration type 1
PCI: Probing PCI hardware
Unknown bridge resource 0: assuming transparent
Unknown bridge resource 1: assuming transparent
Unknown bridge resource 2: assuming transparent
PCI: Using IRQ router PIIX [8086/7110] at 00:12.0
PCI: Cannot allocate resource region 4 of device 00:12.1
Limiting direct PCI/PCI transfers.
isapnp: Scanning for PnP cards...
isapnp: No Plug & Play device found
Linux NET4.0 for Linux 2.4
Based upon Swansea University Computer Society NET3.039
Initializing RT netlink socket
Starting kswapd
allocated 32 pages and 32 bhs reserved for the highmem bounces
Installing knfsd (copyright (C) 1996 okir@xxxxxxxxxxxx).
SGI XFS with no debug enabled
pty: 256 Unix98 ptys configured
Serial driver version 5.05c (2001-07-08) with MANY_PORTS SHARE_IRQ SERIAL_PCI 
ISAPNP enabled
ttyS00 at 0x03f8 (irq = 4) is a 16550A
Uniform Multi-Platform E-IDE driver Revision: 6.31
ide: Assuming 33MHz system bus speed for PIO modes; override with idebus=xx
PIIX4: IDE controller on PCI bus 00 dev 91
PIIX4: detected chipset, but driver not compiled in!
PIIX4: chipset revision 1
PIIX4: not 100% native mode: will probe irqs later
ide0 at 0x1f0-0x1f7,0x3f6 on irq 14
hda: ATAPI 40X CD-ROM drive, 128kB Cache
Uniform CD-ROM driver Revision: 3.12
Floppy drive(s): fd0 is 1.44M
FDC 0 is a National Semiconductor PC87306
RAMDISK driver initialized: 16 RAM disks of 4096K size 1024 blocksize
loop: loaded (max 8 devices)
PCI: Assigned IRQ 5 for device 00:0e.1
DAC960: ***** DAC960 RAID Driver Version 2.4.11 of 11 October 2001 *****
DAC960: Copyright 1998-2001 by Leonard N. Zubkoff <lnz@xxxxxxxxxxxxx>

Entering kdb (current=0xc03a8000, pid 0) due to Keyboard Entry
kdb> bt
    EBP       EIP         Function(args)
0xc03a9fd0 0xc0106f26 default_idle+0x26 (0x0, 0x9ea00, 0xc0105000)
                               kernel .text 0xc0100000 0xc0106f00 0xc0106f30
0xc03a9fe4 0xc0106f99 cpu_idle+0x41
                               kernel .text 0xc0100000 0xc0106f58 0xc0106fac
0xc03a9fec 0xc010502a stext+0x2a (0xc041f780)
                               kernel .text 0xc0100000 0xc0105000 0xc0105030
0xc03a9ff8 0xc03aa807 start_kernel+0x157
                               kernel .text.init 0xc03aa000 0xc03aa6b0 
kdb> ps
Task Addr  Pid      Parent   [*] cpu  State Thread     Command
0xc1c14000 00000001 00000000  1  000  stop  0xc1c14270 swapper
0xf7ef8000 00000002 00000001  1  000  stop  0xf7ef8270 keventd
0xf7ef4000 00000003 00000001  1  000  stop  0xf7ef4270 ksoftirqd_CPU0
0xf7ef2000 00000004 00000001  1  000  stop  0xf7ef2270 kswapd
0xf7ece000 00000005 00000001  1  000  stop  0xf7ece270 bdflush
0xf7ecc000 00000006 00000001  1  000  stop  0xf7ecc270 kupdated
0xf7ec6000 00000007 00000001  1  000  stop  0xf7ec6270 pagebufd
0xf7ec4000 00000008 00000001  1  000  stop  0xf7ec4270 pagebuf_io_CPU0
kdb> btp 1
    EBP       EIP         Function(args)
0xc1c15e3c 0xc01130b6 schedule+0x2ca (0xf7e88000, 0xc1c15e90, 0x0, 0xc1c14000, 
                               kernel .text 0xc0100000 0xc0112dec 0xc01130e0
           0xc011333e wait_for_completion+0x72 (0xf7e88120, 0xf7e88000, 0x292, 
0x0, 0x0)
                               kernel .text 0xc0100000 0xc01132cc 0xc011335c
0xc1c15ea0 0xc0218314 DAC960_ExecuteCommand+0x68 (0xf7e88120, 0xf7e88000, 0x1, 
                               kernel .text 0xc0100000 0xc02182ac 0xc0218320
0xc1c15eb8 0xc0218385 DAC960_V1_ExecuteType3+0x65 (0xf7e88000, 0x53, 
0xf7e892fc, 0xf7e88000, 0xf7e88000)
                               kernel .text 0xc0100000 0xc0218320 0xc02183ac
0xc1c15fa0 0xc0218e44 DAC960_V1_ReadControllerConfiguration+0x20 (0xf7e88000, 
                               kernel .text 0xc0100000 0xc0218e24 0xc0219310
0xc1c15fb0 0xc021b308 DAC960_InitializeController+0x10 (0xf7e88000, 0xc041d038, 
0xc03a9fcc, 0x0)
                               kernel .text 0xc0100000 0xc021b2f8 0xc021b3b4
0xc1c15fc8 0xc021b5e6 DAC960_Initialize+0x66 (0x10f00)
                               kernel .text 0xc0100000 0xc021b580 0xc021b62c
0xc1c15fd4 0xc03aa819 do_initcalls+0xd (0xc1c15fec)
                               kernel .text.init 0xc03aa000 0xc03aa80c 
0xc1c15fdc 0xc03aa862 do_basic_setup+0x32 (0x10f00, 0xc03a9fcc)
                               kernel .text.init 0xc03aa000 0xc03aa830 
           0xc010503a init+0xa
                               kernel .text 0xc0100000 0xc0105030 0xc0105140
           0xc010724c kernel_thread+0x28
more> btp 2
Only 'q' or 'Q' are processed at more prompt, input ignored
                               kernel .text 0xc0100000 0xc0107224 0xc010725c
kdb> btp 2
    EBP       EIP         Function(args)
0xf7ef9fa8 0xc01130b6 schedule+0x2ca (0x700, 0xc1c15fb4, 0xc03c2bb8, 
0xf7ef8574, 0xf7ef8564)
                               kernel .text 0xc0100000 0xc0112dec 0xc01130e0
           0xc0120a85 context_thread+0xfd
                               kernel .text 0xc0100000 0xc0120988 0xc0120b30
           0xc010724c kernel_thread+0x28
                               kernel .text 0xc0100000 0xc0107224 0xc010725c
kdb> btp 3
    EBP       EIP         Function(args)
0xf7ef5fdc 0xc01130b6 schedule+0x2ca (0x10f00, 0xc1c15fa0)
                               kernel .text 0xc0100000 0xc0112dec 0xc01130e0
           0xc0119984 ksoftirqd+0x74
                               kernel .text 0xc0100000 0xc0119910 0xc01199c4
           0xc010724c kernel_thread+0x28
                               kernel .text 0xc0100000 0xc0107224 0xc010725c
kdb> btp 4
    EBP       EIP         Function(args)
0xf7ef3fc8 0xc01130b6 schedule+0x2ca (0x10f00, 0xc1c15fa4, 0x0, 0x0, 0xf7ef2000)
                               kernel .text 0xc0100000 0xc0112dec 0xc01130e0
           0xc012b0cb kswapd+0x7f
                               kernel .text 0xc0100000 0xc012b04c 0xc012b110
           0xc010724c kernel_thread+0x28
                               kernel .text 0xc0100000 0xc0107224 0xc010725c
kdb> btp 5
    EBP       EIP         Function(args)
0xf7ecffbc 0xc01130b6 schedule+0x2ca (0x1f4, 0x0, 0xf7ece000, 0xc0392e4c, 
                               kernel .text 0xc0100000 0xc0112dec 0xc01130e0
           0xc0113399 interruptible_sleep_on+0x3d (0x10f00, 0xc1c15fa8, 
                               kernel .text 0xc0100000 0xc011335c 0xc01133ac
           0xc013463a bdflush+0xa6
                               kernel .text 0xc0100000 0xc0134594 0xc013463c
           0xc010724c kernel_thread+0x28
                               kernel .text 0xc0100000 0xc0107224 0xc010725c
kdb> btp 6
    EBP       EIP         Function(args)
0xf7ecdfb0 0xc01130b6 schedule+0x2ca (0xf7ecdfc4, 0xf7ecc000, 0xc02b2877, 
0xc042744c, 0xc042744c)
                               kernel .text 0xc0100000 0xc0112dec 0xc01130e0
           0xc0112dbb schedule_timeout+0x73 (0x10f00, 0xc1c15f9c, 0xc03c2c00)
                               kernel .text 0xc0100000 0xc0112d48 0xc0112ddc
0xf7ecdfec 0xc01346b4 kupdate+0x78
                               kernel .text 0xc0100000 0xc013463c 0xc0134738
           0xc010724c kernel_thread+0x28
                               kernel .text 0xc0100000 0xc0107224 0xc010725c
kdb> btp 7
    EBP       EIP         Function(args)
0xf7ec7f94 0xc01130b6 schedule+0x2ca (0x700, 0x0, 0xf7ec6000, 0xc039809c, 
                               kernel .text 0xc0100000 0xc0112dec 0xc01130e0
           0xc0113399 interruptible_sleep_on+0x3d (0xf7ec7fd8, 0xf7ec7fd8, 
0x700, 0xc1c15f50, 0xc1c1e3e0)
                               kernel .text 0xc0100000 0xc011335c 0xc01133ac
           0xc01d92fa pagebuf_daemon+0x9a
                               kernel .text 0xc0100000 0xc01d9260 0xc01d945c
           0xc010724c kernel_thread+0x28
                               kernel .text 0xc0100000 0xc0107224 0xc010725c
kdb> btp 8
    EBP       EIP         Function(args)
0xf7ec5fb0 0xc01130b6 schedule+0x2ca (0x700, 0xc1c15f50, 0x0, 0xf7ec5fdc, 0x0)
                               kernel .text 0xc0100000 0xc0112dec 0xc01130e0
           0xc01d91c9 pagebuf_iodone_daemon+0x109
                               kernel .text 0xc0100000 0xc01d90c0 0xc01d9230
           0xc010724c kernel_thread+0x28
                               kernel .text 0xc0100000 0xc0107224 0xc010725c
<Prev in Thread] Current Thread [Next in Thread>