xfs
[Top] [All Lists]

Re: Problems with 2.4.19-pre9

To: Steve Lord <lord@xxxxxxx>, Florin Iucha <florin@xxxxxxxxx>, linux-xfs@xxxxxxxxxxx
Subject: Re: Problems with 2.4.19-pre9
From: Paul Schutte <paul@xxxxxxxx>
Date: Fri, 31 May 2002 19:35:13 +0200
References: <20020531115832.GA1401@iucha.net> <3CF76A46.3FEC1553@up.ac.za> <1022856397.27212.1.camel@jen.americas.sgi.com> <3CF7AEBF.CAB8CCFB@up.ac.za>
Sender: owner-linux-xfs@xxxxxxxxxxx
hang.patch does exactly the same.

Do you need a stack trace of this as well ?

Paul

Paul Schutte wrote:

> My prevoius observations were made on 2.4.18 which I checked out on 24 May.
> The 2.4.19-pre9 looks to be a bit worse than 2.4.18.
> The machine was unable to complete the bootup.
>
> I attach the whole capture file.
> It looks like everthing is stuck in schedule.
> It is odd that no filesystems was mounted yet, at this point.
> The machine has frozen cold.
>
> I will now try hang.patch and see how it goes.
>
> Paul
>
> Steve Lord wrote:
>
> > On Fri, 2002-05-31 at 07:19, Paul Schutte wrote:
> > > Changes that went in on  May  24th caused the problems.
> > > I checked the tree out on the morning of 24 May and Steve dropped a few
> > > changes to get XFS out of the irq business later that day.
> > > I checked the tree out again before I went home.
> > > The first one works perfectly, the second one shows the behaviour 
> > > described.
> > > I checked with kdb to see what happends when everythings pauses.
> > >
> > > keventd is shown to be running when I do a ps.
> > > It is busy doing  "schedule"
> > > The cp that shows the pause behaviour are also in "schedule".
> > > It looks like same kind of deadlock that resolves it self after a while.
> > > It takes between 5 to 30 seconds for the cp to continue.
> > >
> > > Hope this helps.
> > >
> > > Paul
> > >
> >
> > Paul, which compiler do you use? The problem seems to go away for people
> > who switch away from that debian compiler. Also if possible, during the
> > pause, can you look at the stack for pagebuf_io_CPU0 and report where
> > that is?
> >
> > Thanks
> >
> >    Steve
> >
> > --
> >
> > Steve Lord                                      voice: +1-651-683-3511
> > Principal Engineer, Filesystem Software         email: lord@xxxxxxx
>
>   ------------------------------------------------------------------------
> Linux version 2.4.19-pre9-20020531i (root@xxxxxxxxxxxxxx) (gcc version 2.95.4 
> 20011002 (Debian prerelease)) #3 Fri May 31 18:37:55 SAST 2002
> BIOS-provided physical RAM map:
>  BIOS-e820: 0000000000000000 - 000000000009e400 (usable)
>  BIOS-e820: 000000000009e400 - 00000000000a0000 (reserved)
>  BIOS-e820: 00000000000e8000 - 0000000000100000 (reserved)
>  BIOS-e820: 0000000000100000 - 000000003fff0000 (usable)
>  BIOS-e820: 000000003fff0000 - 000000003ffffc00 (ACPI data)
>  BIOS-e820: 000000003ffffc00 - 0000000040000000 (ACPI NVS)
>  BIOS-e820: 00000000fec00000 - 00000000fec10000 (reserved)
>  BIOS-e820: 00000000fee00000 - 00000000fee01000 (reserved)
>  BIOS-e820: 00000000fff00000 - 0000000100000000 (reserved)
> 127MB HIGHMEM available.
> 896MB LOWMEM available.
> On node 0 totalpages: 262128
> zone(0): 4096 pages.
> zone(1): 225280 pages.
> zone(2): 32752 pages.
> Kernel command line: auto BOOT_IMAGE=Linux rw root=3002 console=ttyS0, 
> console=tty1
> Initializing CPU#0
> Detected 498.757 MHz processor.
> Console: colour VGA+ 80x25
> Calibrating delay loop... 996.14 BogoMIPS
> Memory: 1032356k/1048512k available (1707k kernel code, 15764k reserved, 
> 1011k data, 464k init, 131008k highmem)
> kdb version 2.1 by Scott Lurndal, Keith Owens. Copyright SGI, All Rights 
> Reserved
> Dentry cache hash table entries: 131072 (order: 8, 1048576 bytes)
> Inode cache hash table entries: 65536 (order: 7, 524288 bytes)
> Mount-cache hash table entries: 16384 (order: 5, 131072 bytes)
> Buffer-cache hash table entries: 65536 (order: 6, 262144 bytes)
> Page-cache hash table entries: 262144 (order: 8, 1048576 bytes)
> CPU: L1 I cache: 16K, L1 D cache: 16K
> CPU: L2 cache: 512K
> CPU serial number disabled.
> Intel machine check architecture supported.
> Intel machine check reporting enabled on CPU#0.
> CPU: Intel Pentium III (Katmai) stepping 02
> Enabling fast FPU save and restore... done.
> Enabling unmasked SIMD FPU exception support... done.
> Checking 'hlt' instruction... OK.
> POSIX conformance testing by UNIFIX
> mtrr: v1.40 (20010327) Richard Gooch (rgooch@xxxxxxxxxxxxx)
> mtrr: detected mtrr type: Intel
> PCI: PCI BIOS revision 2.10 entry at 0xfdaf0, last bus=1
> PCI: Using configuration type 1
> PCI: Probing PCI hardware
> Unknown bridge resource 0: assuming transparent
> Unknown bridge resource 1: assuming transparent
> Unknown bridge resource 2: assuming transparent
> PCI: Using IRQ router PIIX [8086/7110] at 00:12.0
> PCI: Cannot allocate resource region 4 of device 00:12.1
> Limiting direct PCI/PCI transfers.
> isapnp: Scanning for PnP cards...
> isapnp: No Plug & Play device found
> Linux NET4.0 for Linux 2.4
> Based upon Swansea University Computer Society NET3.039
> Initializing RT netlink socket
> Starting kswapd
> allocated 32 pages and 32 bhs reserved for the highmem bounces
> Installing knfsd (copyright (C) 1996 okir@xxxxxxxxxxxx).
> SGI XFS with no debug enabled
> pty: 256 Unix98 ptys configured
> Serial driver version 5.05c (2001-07-08) with MANY_PORTS SHARE_IRQ SERIAL_PCI 
> ISAPNP enabled
> ttyS00 at 0x03f8 (irq = 4) is a 16550A
> Uniform Multi-Platform E-IDE driver Revision: 6.31
> ide: Assuming 33MHz system bus speed for PIO modes; override with idebus=xx
> PIIX4: IDE controller on PCI bus 00 dev 91
> PIIX4: detected chipset, but driver not compiled in!
> PIIX4: chipset revision 1
> PIIX4: not 100% native mode: will probe irqs later
> hda: NEC CD-ROM DRIVE:282, ATAPI CD/DVD-ROM drive
> ide0 at 0x1f0-0x1f7,0x3f6 on irq 14
> hda: ATAPI 40X CD-ROM drive, 128kB Cache
> Uniform CD-ROM driver Revision: 3.12
> Floppy drive(s): fd0 is 1.44M
> FDC 0 is a National Semiconductor PC87306
> RAMDISK driver initialized: 16 RAM disks of 4096K size 1024 blocksize
> loop: loaded (max 8 devices)
> PCI: Assigned IRQ 5 for device 00:0e.1
> DAC960: ***** DAC960 RAID Driver Version 2.4.11 of 11 October 2001 *****
> DAC960: Copyright 1998-2001 by Leonard N. Zubkoff <lnz@xxxxxxxxxxxxx>
>
> Entering kdb (current=0xc03a8000, pid 0) due to Keyboard Entry
> kdb> bt
>     EBP       EIP         Function(args)
> 0xc03a9fd0 0xc0106f26 default_idle+0x26 (0x0, 0x9ea00, 0xc0105000)
>                                kernel .text 0xc0100000 0xc0106f00 0xc0106f30
> 0xc03a9fe4 0xc0106f99 cpu_idle+0x41
>                                kernel .text 0xc0100000 0xc0106f58 0xc0106fac
> 0xc03a9fec 0xc010502a stext+0x2a (0xc041f780)
>                                kernel .text 0xc0100000 0xc0105000 0xc0105030
> 0xc03a9ff8 0xc03aa807 start_kernel+0x157
>                                kernel .text.init 0xc03aa000 0xc03aa6b0 
> 0xc03aa80c
> kdb> ps
> Task Addr  Pid      Parent   [*] cpu  State Thread     Command
> 0xc1c14000 00000001 00000000  1  000  stop  0xc1c14270 swapper
> 0xf7ef8000 00000002 00000001  1  000  stop  0xf7ef8270 keventd
> 0xf7ef4000 00000003 00000001  1  000  stop  0xf7ef4270 ksoftirqd_CPU0
> 0xf7ef2000 00000004 00000001  1  000  stop  0xf7ef2270 kswapd
> 0xf7ece000 00000005 00000001  1  000  stop  0xf7ece270 bdflush
> 0xf7ecc000 00000006 00000001  1  000  stop  0xf7ecc270 kupdated
> 0xf7ec6000 00000007 00000001  1  000  stop  0xf7ec6270 pagebufd
> 0xf7ec4000 00000008 00000001  1  000  stop  0xf7ec4270 pagebuf_io_CPU0
> kdb> btp 1
>     EBP       EIP         Function(args)
> 0xc1c15e3c 0xc01130b6 schedule+0x2ca (0xf7e88000, 0xc1c15e90, 0x0, 
> 0xc1c14000, 0x0)
>                                kernel .text 0xc0100000 0xc0112dec 0xc01130e0
>            0xc011333e wait_for_completion+0x72 (0xf7e88120, 0xf7e88000, 
> 0x292, 0x0, 0x0)
>                                kernel .text 0xc0100000 0xc01132cc 0xc011335c
> 0xc1c15ea0 0xc0218314 DAC960_ExecuteCommand+0x68 (0xf7e88120, 0xf7e88000, 
> 0x1, 0x53000000)
>                                kernel .text 0xc0100000 0xc02182ac 0xc0218320
> 0xc1c15eb8 0xc0218385 DAC960_V1_ExecuteType3+0x65 (0xf7e88000, 0x53, 
> 0xf7e892fc, 0xf7e88000, 0xf7e88000)
>                                kernel .text 0xc0100000 0xc0218320 0xc02183ac
> 0xc1c15fa0 0xc0218e44 DAC960_V1_ReadControllerConfiguration+0x20 (0xf7e88000, 
> 0xc041d038)
>                                kernel .text 0xc0100000 0xc0218e24 0xc0219310
> 0xc1c15fb0 0xc021b308 DAC960_InitializeController+0x10 (0xf7e88000, 
> 0xc041d038, 0xc03a9fcc, 0x0)
>                                kernel .text 0xc0100000 0xc021b2f8 0xc021b3b4
> 0xc1c15fc8 0xc021b5e6 DAC960_Initialize+0x66 (0x10f00)
>                                kernel .text 0xc0100000 0xc021b580 0xc021b62c
> 0xc1c15fd4 0xc03aa819 do_initcalls+0xd (0xc1c15fec)
>                                kernel .text.init 0xc03aa000 0xc03aa80c 
> 0xc03aa830
> 0xc1c15fdc 0xc03aa862 do_basic_setup+0x32 (0x10f00, 0xc03a9fcc)
>                                kernel .text.init 0xc03aa000 0xc03aa830 
> 0xc03aa870
>            0xc010503a init+0xa
>                                kernel .text 0xc0100000 0xc0105030 0xc0105140
>            0xc010724c kernel_thread+0x28
> more> btp 2
> Only 'q' or 'Q' are processed at more prompt, input ignored
>                                kernel .text 0xc0100000 0xc0107224 0xc010725c
> kdb> btp 2
>     EBP       EIP         Function(args)
> 0xf7ef9fa8 0xc01130b6 schedule+0x2ca (0x700, 0xc1c15fb4, 0xc03c2bb8, 
> 0xf7ef8574, 0xf7ef8564)
>                                kernel .text 0xc0100000 0xc0112dec 0xc01130e0
>            0xc0120a85 context_thread+0xfd
>                                kernel .text 0xc0100000 0xc0120988 0xc0120b30
>            0xc010724c kernel_thread+0x28
>                                kernel .text 0xc0100000 0xc0107224 0xc010725c
> kdb> btp 3
>     EBP       EIP         Function(args)
> 0xf7ef5fdc 0xc01130b6 schedule+0x2ca (0x10f00, 0xc1c15fa0)
>                                kernel .text 0xc0100000 0xc0112dec 0xc01130e0
>            0xc0119984 ksoftirqd+0x74
>                                kernel .text 0xc0100000 0xc0119910 0xc01199c4
>            0xc010724c kernel_thread+0x28
>                                kernel .text 0xc0100000 0xc0107224 0xc010725c
> kdb> btp 4
>     EBP       EIP         Function(args)
> 0xf7ef3fc8 0xc01130b6 schedule+0x2ca (0x10f00, 0xc1c15fa4, 0x0, 0x0, 
> 0xf7ef2000)
>                                kernel .text 0xc0100000 0xc0112dec 0xc01130e0
>            0xc012b0cb kswapd+0x7f
>                                kernel .text 0xc0100000 0xc012b04c 0xc012b110
>            0xc010724c kernel_thread+0x28
>                                kernel .text 0xc0100000 0xc0107224 0xc010725c
> kdb> btp 5
>     EBP       EIP         Function(args)
> 0xf7ecffbc 0xc01130b6 schedule+0x2ca (0x1f4, 0x0, 0xf7ece000, 0xc0392e4c, 
> 0xc0392e4c)
>                                kernel .text 0xc0100000 0xc0112dec 0xc01130e0
>            0xc0113399 interruptible_sleep_on+0x3d (0x10f00, 0xc1c15fa8, 
> 0xc03c2c00)
>                                kernel .text 0xc0100000 0xc011335c 0xc01133ac
>            0xc013463a bdflush+0xa6
>                                kernel .text 0xc0100000 0xc0134594 0xc013463c
>            0xc010724c kernel_thread+0x28
>                                kernel .text 0xc0100000 0xc0107224 0xc010725c
> kdb> btp 6
>     EBP       EIP         Function(args)
> 0xf7ecdfb0 0xc01130b6 schedule+0x2ca (0xf7ecdfc4, 0xf7ecc000, 0xc02b2877, 
> 0xc042744c, 0xc042744c)
>                                kernel .text 0xc0100000 0xc0112dec 0xc01130e0
>            0xc0112dbb schedule_timeout+0x73 (0x10f00, 0xc1c15f9c, 0xc03c2c00)
>                                kernel .text 0xc0100000 0xc0112d48 0xc0112ddc
> 0xf7ecdfec 0xc01346b4 kupdate+0x78
>                                kernel .text 0xc0100000 0xc013463c 0xc0134738
>            0xc010724c kernel_thread+0x28
>                                kernel .text 0xc0100000 0xc0107224 0xc010725c
> kdb> btp 7
>     EBP       EIP         Function(args)
> 0xf7ec7f94 0xc01130b6 schedule+0x2ca (0x700, 0x0, 0xf7ec6000, 0xc039809c, 
> 0xc039809c)
>                                kernel .text 0xc0100000 0xc0112dec 0xc01130e0
>            0xc0113399 interruptible_sleep_on+0x3d (0xf7ec7fd8, 0xf7ec7fd8, 
> 0x700, 0xc1c15f50, 0xc1c1e3e0)
>                                kernel .text 0xc0100000 0xc011335c 0xc01133ac
>            0xc01d92fa pagebuf_daemon+0x9a
>                                kernel .text 0xc0100000 0xc01d9260 0xc01d945c
>            0xc010724c kernel_thread+0x28
>                                kernel .text 0xc0100000 0xc0107224 0xc010725c
> kdb> btp 8
>     EBP       EIP         Function(args)
> 0xf7ec5fb0 0xc01130b6 schedule+0x2ca (0x700, 0xc1c15f50, 0x0, 0xf7ec5fdc, 0x0)
>                                kernel .text 0xc0100000 0xc0112dec 0xc01130e0
>            0xc01d91c9 pagebuf_iodone_daemon+0x109
>                                kernel .text 0xc0100000 0xc01d90c0 0xc01d9230
>            0xc010724c kernel_thread+0x28
>                                kernel .text 0xc0100000 0xc0107224 0xc010725c
> kdb>


<Prev in Thread] Current Thread [Next in Thread>