[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

use-after-free in xfs_bawrite()




Using Linus's current tree plus all the -mm gunk I get a fairly easy oops
running dbench on XFS on SMP with CONFIG_DEBUG_PAGEALLOC=y:

Program received signal SIGEMT, Emulation trap.
0xc0282c9d in xfs_iflush (ip=0xc284a004, flags=2) at fs/xfs/pagebuf/page_buf.h:397
397             if (!pb || atomic_read(&pb->pb_io_remaining))
(gdb) p pb
$1 = (page_buf_t *) 0xc98d1004
(gdb) p *pb
Cannot access memory at address 0xc98d1004
(gdb) bt
#0  0xc0282c9d in xfs_iflush (ip=0xc284a004, flags=2) at fs/xfs/pagebuf/page_buf.h:397
#1  0xc0283dee in xfs_inode_item_push (iip=0xc2849004) at fs/xfs/xfs_inode_item.c:882
#2  0xc0294ddf in xfs_trans_push_ail (mp=0xceb87004, threshold_lsn=21474846993) at fs/xfs/xfs_trans_ail.c:170
#3  0xc0287622 in xlog_grant_push_ail (mp=0xceb87004, need_bytes=492072) at fs/xfs/xfs_log.c:1390
#4  0xc028652c in xfs_log_reserve (mp=0xceb87004, unit_bytes=157880, cnt=3, ticket=0xc9451038, client=105, 
    flags=2) at fs/xfs/xfs_log.c:461
#5  0xc0293afd in xfs_trans_reserve (tp=0xc9451004, blocks=41, logspace=157880, rtextents=0, flags=4, logcount=3)
    at fs/xfs/xfs_trans.c:275
#6  0xc029c340 in xfs_mkdir (dir_bdp=0xc0c3e024, dentry=0xca964004, vap=0xc790fec4, vpp=0xc790fec0, credp=0x0)
    at fs/xfs/xfs_vnodeops.c:2878
#7  0xc02a687c in linvfs_mknod (dir=0xc0c3f024, dentry=0xca964004, mode=16832, rdev=0)
    at fs/xfs/linux/xfs_iops.c:136
#8  0xc02a6a4f in linvfs_mkdir (dir=0xc0c3f024, dentry=0xca964004, mode=448) at fs/xfs/linux/xfs_iops.c:190
#9  0xc016d838 in vfs_mkdir (dir=0xc0c3f024, dentry=0xca964004, mode=448) at fs/namei.c:1510
#10 0xc016d901 in sys_mkdir (pathname=0xbffff2e7 "CLIENTS/CLIENT16/~DMTMP/SEED", mode=448) at fs/namei.c:1537

The memory at 0xc98d1004 has been unmapped.

The oops is on the xfs_iflush() -> xfs_bawrite() -> pagebuf_run_queues() path.

It looks to me like pagebuf_iostart() has called pagebuf_iorequest() which called
_pagebuf_iodone() which called pagebuf_iodone() whuich threw away the pagebuf.

If this is vaguely correct then this part of pagebuf_iostart():

	/* Wait for I/O if we are not an async request */
	if ((status == 0) && (flags & PBF_ASYNC) == 0) {

also needs attention...


The below quick patch fixes it up.  But it also causes zillions of dentries
and inodes to be leaked for some reason.  Consider it a technology
demonstration!

XFS has waaaaay too much inlining btw ;)

Seems that dbench is not XFS's favourite benchmark.  How come?  Do I need
more logbufs?



 fs/xfs/xfs_buf.h |    2 ++
 1 files changed, 2 insertions(+)

diff -puN fs/xfs/xfs_buf.h~a fs/xfs/xfs_buf.h
--- 25/fs/xfs/xfs_buf.h~a	2003-08-02 00:53:59.000000000 -0700
+++ 25-akpm/fs/xfs/xfs_buf.h	2003-08-02 00:56:03.000000000 -0700
@@ -220,8 +220,10 @@ static inline int	xfs_bawrite(void *mp, 
 	bp->pb_fspriv3 = mp;
 	bp->pb_strat = xfs_bdstrat_cb;
 	xfs_buf_undelay(bp);
+	atomic_inc(&bp->pb_hold);
 	if ((ret = pagebuf_iostart(bp, PBF_WRITE | PBF_ASYNC)) == 0)
 		pagebuf_run_queues(bp);
+	pagebuf_rele(bp);
 	return ret;
 }
 

_