[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: XFS NFS server Oops
Steve +,
Thanks for the patch (vnode.patch) and sorry for the delay, we've had a
week or so of stability on this server so haven't had chance to install
the patched kernel.
However, managed to fit it into a regular maintenance period yesterday,
though I soon ran into (what I believe was) the bug reported by Dave
Alden and fixed by you/Eric Sandeen (thanks!) of NFS hangs
(page_buf_io.c) (though it was not quite as reproducible as it seemed to
be for Dave) - anyway updated to the latest CVS tree (as of ~ 12:00GMT
13th March) + the patch you sent me last week and so far so good! As
I indicated before I have seen upto 14 days between crashes so its a
bit early to tell if its fixed my problem but at least its run for
>24hrs now without any noticeable bad effects.
Regards and thanks.
Ian Hardy
On 06 Mar 2002 13:12:19 -0600 Steve Lord wrote:
>
> On Mon, 2002-02-04 at 07:30, Ian D. Hardy wrote:
> > Hi,
> >
> > Anyone any ideas on the following Oops (processed with ksymoops 2.4.3). It is
> > from a NFS server (Dual 1Ghz Supermicro LE, 1Gbyte RAM, 40Gbyte Maxtor IDE
> > system disk, Zero-D/GForce RI Fibrechannel to IDE hardware RAID-5 500Gbyte
> > disk unit). It is running the Linux 2.4.17-xfs kernel taken as a CVS image
> > on 27th January. The main area of disk it is serving is on the HW RAID unit,
> > which is the only XFS filesystem on the system. The system had been up
> > for just over 3 days when it crashed.
> >
> > I reported a very similar failure a few weeks ago, at that time running a
> > 2.4.9 based kernel, Steve Lord suggested that we tried the latest CVS image
> > as this had fixed some memory alloacation problems.
> >
> > The machine is essentially an NFS fileserver to a computational cluster. Though
> > of possible interest is the 'save' process that was running on one of the
> > processes, this is the Legato Networker backup client process (which was
> > performing a full backup of the XFS filesystem at the time). I don't think
> > this is significant as I was seeing these crashes (at ~4 to 12 day intervals)
> > with the 2.4.9 kernel not dependant upon a 'save' session running.
> >
> >
>
> Ian, can you try the attached patch against a current cvs kernel and see
> if it helps at all.
>
> Steve
>
> --
>
> Steve Lord voice: +1-651-683-3511
> Principal Engineer, Filesystem Software email: lord@sgi.com
>
> --------------------------------------------------------------------------------
> Name: vnode.patch
> vnode.patch Type: Plain Text (text/plain)
> Encoding: quoted-printable
--
/////////////Technical Coordination, Research Services////////////////////
Ian Hardy Tel: 023 80 593577
Computing Services Mobile: 0709 2127503
Southampton University email: idh@soton.ac.uk
Southampton S017 1BJ, UK. i.d.hardy@soton.ac.uk
\\'BUGS: The notion of errors is ill-defined' (IRIX man page for netstat)\