[PATCH] xfs: logsunit rounding causes iclog corruption/crash

Mark Tinguely tinguely at sgi.com
Sun Mar 3 13:04:39 CST 2013


On 03/02/13 17:05, Dave Chinner wrote:
> On Sat, Mar 02, 2013 at 02:14:47PM -0600, Mark Tinguely wrote:
>> When the iclog buffer size and log stripe unit are both defined and
>> the log stripe unit is less the log buffer size then the buffer is
>> rounded up to the log stripe unit size during the xlog_sync().
>>
>> This rounding can exceed the iclog buffer length and in xlog_data_pack():
>>   1) Cause corruption inside the iclog buffer because there will not be
>>      enough space for the headers in the front of the iclog buffer for
>>      the rounding.
>>   2) Cause corruption in memory that follows the iclog buffer when
>>      stamping the lsn in each of the rounded blocks.
>>   3) If CONFIG_XFS_DEBUG is defined will cause a crash in xlog_verify_iclog().
>>   4) Cause page fault crash if the memory after the buffer is not mapped.
>>
>> This has been found in XFS versions at least as far back as
>> Linux 2.6.32.
>>
>> This patch forces the iclog buffer to be a multiple of the log stripe
>> unit when they are both defined.
>>
>> Example:
>>    # mkfs.xfs -l su=192k -f /dev/sda2
>
> $ sudo mkfs.xfs -l sunit=192k /dev/vdb
> Specify log sunit in 512-byte blocks, no size suffix
> ....
> $
>
>>    # mount -o logbsize=256k /dev/sda3 /scratch
>>    # io such as fsstress in /scratch will immediately crash a debug xfs
>>      kernel and most like a non-debug xfs kernel.
>
> It's definitely not an immediate crash. xfstests runs for several
> minutes (including through fsstress workloads) before it finally
> falls over. I'd suggest a targetted xfstest is needed for this...
>
>> Index: b/fs/xfs/xfs_super.c
>> ===================================================================
>> --- a/fs/xfs/xfs_super.c
>> +++ b/fs/xfs/xfs_super.c
>> @@ -1357,11 +1357,23 @@ xfs_finish_flags(
>>   		if (mp->m_logbsize<= 0&&
>>   		mp->m_sb.sb_logsunit>  XLOG_BIG_RECORD_BSIZE) {
>>   			mp->m_logbsize = mp->m_sb.sb_logsunit;
>> -		} else if (mp->m_logbsize>  0&&
>> -			   mp->m_logbsize<  mp->m_sb.sb_logsunit) {
>> -			xfs_warn(mp,
>> +		} else if (mp->m_logbsize>  0) {
>> +			if (mp->m_logbsize>  mp->m_sb.sb_logsunit) {
>> +				int	logbsize;
>> +				/* round up to the next multiple of logsunit */
>> +				logbsize = roundup(mp->m_logbsize,
>> +							 mp->m_sb.sb_logsunit);
>> +				if (logbsize>  XLOG_MAX_RECORD_BSIZE)
>> +					/* buffer size too large. round down. */
>> +					logbsize -= mp->m_sb.sb_logsunit;

eeks:
				if (logbsize != mp->m_logbsize) {
>> +				xfs_warn(mp, "log bufsize rounded from %d to %d",
>> +					 mp->m_logbsize, logbsize);
>> +				mp->m_logbsize = logbsize;
				}

>> +			} else if (mp->m_logbsize<  mp->m_sb.sb_logsunit) {
>> +				xfs_warn(mp,
>>   		"logbuf size must be greater than or equal to log stripe size");
>> -			return XFS_ERROR(EINVAL);
>> +				return XFS_ERROR(EINVAL);
>> +			}
>
> If the user has specified an invalid log buffer size, then reject it
> with:
>
> logbsize XXX is not an integer multiple of the log stripe unit YYY
>
> Rounding means that the user isn't getting what they want and they
> may not realise it. If they make a mistake, they should be informed
> and forced to fix it before going any further.
>
> Cheers,
>
> Dave.

The code already silently changes the log blocksize if
mp->m_sb.sb_logsunit > mp->m_logbsize.

IMO, it should fix it not a multiple too.

--Mark.



More information about the xfs mailing list