[Top] [All Lists]

mkfs.xfs pagefault when removed storage during operation

To: xfs@xxxxxxxxxxx
Subject: mkfs.xfs pagefault when removed storage during operation
From: Ajeet Yadav <ajeet.yadav.77@xxxxxxxxx>
Date: Tue, 1 Feb 2011 20:06:29 +0900
Dkim-signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:date:message-id:subject:from:to :content-type; bh=PoqIYE46eehhhNQZepHJ+mVODBjYogpn0DRO8V1LJwM=; b=XiW13yIDHK4avXX+81RnvnyH52sZgfYgSJz2/CHPPoxpjLjyS2kiFa4Mx/bohNQwJs E65PkXgIZIqK18SXBb1oopbJfzPTqbnVi19Djv1+bBy4rPb0Hl7Q37Kzj2lP8Khmh8jd jF4SqlRdMAngekKX3idT/LZzPpCEp2yO8PoUQ=
Domainkey-signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:date:message-id:subject:from:to:content-type; b=rwOInyQKRvpw6hcyCZ2fBC+fQhrAL3xwgTzbJ2yAQDT92RgXMUFuJ/FDRLc8U953ri TKKcD4TeXujyIuck3rE99T+HvdkveFTclhlQbU3jdHMiL0NDkc6sMtzDME6oMnKTAt8Q 6OTAyf4QV+pd1MTSiBQSccHo8xIIZxUTnC0RI=
We are testing mkfs.xfs and xfs_repair stability to look for crashes
and other issues specially with removable devices.
And unfortunately crashes does occur.
Code inspection shows in most cases the caller does not handle
libxfs_readbuf() for error cases i.e when return value = NULL.

Now I need your suggestion.
We should fix all such cases or the simplest way is to exit... if
read() or write() fails with EIO errorno in libxfs_readbufr() and
Fortunately these function already support exit, if we use flag
LIBXFS_EXIT_ON_FAILURE, LIBXFS_B_EXIT but they are used selectively.

The current problem is related to function libxfs_trans_read_buf()

       bp = libxfs_readbuf(dev, blkno, len, flags);
        fprintf(stderr, "trans_read_buf buffer %p, transaction %p\n", bp, tp);
        xfs_buf_item_init(bp, tp->t_mountp);
        bip = XFS_BUF_FSPRIVATE(bp, xfs_buf_log_item_t *);
        bip->bli_recur = 0;
        xfs_trans_add_item(tp, (xfs_log_item_t *)bip);

        /* initialise b_fsprivate2 so we can find it incore */
        XFS_BUF_SET_FSPRIVATE2(bp, tp);
        *bpp = bp;
        return 0;

if  libxfs_readbuf() fails due to device removal or other error, bp = NULL.
In function xfs_buf_item_init(bp, tp->t_mountp) as soon as bp is
dereferenced occurs

mkfs.xfs: unhandled page fault (11) at 0x00000070, code 0x017

<Prev in Thread] Current Thread [Next in Thread>