Re: Error during bulk removal of files

To: Subranshu Patel <spatel.ml@xxxxxxxxx>
Subject: Re: Error during bulk removal of files
From: Mark Tinguely <tinguely@xxxxxxx>
Date: Fri, 10 May 2013 11:40:16 -0500
Cc: xfs@xxxxxxxxxxx
On 05/10/13 11:17, Subranshu Patel wrote:

<reformated> Call Trace:

 [<ffffffffa056c492>] xlog_wait+0x72/0x90 [xfs]
 [<ffffffff81060250>] ? default_wake_function+0x0/0x20
 [<ffffffffa056e08b>] xlog_grant_log_space+0x3ab/0x520 [xfs]
 [<ffffffffa0582d1a>] ? kmem_zone_zalloc+0x3a/0x50 [xfs]
 [<ffffffff8127f3ac>] ? random32+0x1c/0x20
 [<ffffffffa057d201>] ? xfs_trans_ail_push+0x21/0x80 [xfs]
 [<ffffffffa056e2e6>] xfs_log_reserve+0xe6/0x140 [xfs]
 [<ffffffffa057bad0>] xfs_trans_reserve+0xa0/0x210 [xfs]
 [<ffffffffa055f0a3>] xfs_fs_log_dummy+0x43/0x90 [xfs]
 [<ffffffffa056c9c4>] ? xfs_log_need_covered+0x94/0xd0 [xfs]
 [<ffffffffa0591831>] xfs_sync_worker+0x81/0x90 [xfs]
 [<ffffffffa059171e>] xfssyncd+0x17e/0x210 [xfs]
 [<ffffffffa05915a0>] ? xfssyncd+0x0/0x210 [xfs]
 [<ffffffff81091d66>] kthread+0x96/0xa0
 [<ffffffff8100c14a>] child_rip+0xa/0x20
 [<ffffffff81091cd0>] ? kthread+0x0/0xa0
 [<ffffffff8100c140>] ? child_rip+0x0/0x20

Ran out of log grant space trying to write out the dummy record in the sync worker.

Something is not getting moved off the AIL. This could be a symptom; for example a lock not being released could lead to depleted log space. What else is happening on the filesystem?


The XFS filesystem size is 14TB (LVM), kernel version 2.6.32

Machine has a memory of 96GB

Please let me know if this is a known issue and have been already
fixed in higher kernel version.



