file preallocation without unwritten flag being set
sandeen at sandeen.net
Wed May 13 00:08:48 CDT 2009
p v wrote:
> I want to avoid any metadata modifications while doing O_DIRECT reads
> (the fs is mounted with noatime). Right now I am doing it mostly for
> testing - I am seeing a performance degradation going from raw to xfs
> on a 10TB filesystem - probably due to my application but I am trying
> to narrow it down so I am starting with running randomio benchmark on
> raw - then 10TB file, then 10 1TB files, then 100 100GB files, ...
you may want to try the inode64 mount option so the allocator is free to
roam your whole 10T ...
> But in general certain applications can definitely take care of the
> preallocated space (db, FB haystack, ...).
Ok, so it sounds like you do understand the implications and you want to
be able to write into prealloc space without any metadata updates as
they are converted to initialized extents... :)
> What they require is
> minimal fragmentation so they would prefer to preallocate the space
> (fill the whole fs with contigous files) and then maintain in-files
> app specific metadata (such as valid offsets of initialized data,
> ...). What I would really like is to have vxfs equivalent of setext
> options -
> setext -r <reservation> -f chggsize
> And on top of that I would really love to have is vxfs equivalent of
> "nomtime" mount option. Then with O_DIRECT I have raw-like
> With the unwritten mkfs option I could get the setext semantics. So
> what's the trick (before I dive into the xfs layout)? I am guessing
> that there is no equivalent for nomtime option?
well, the unwritten=0 option did get removed:
TBH I'm not entirely sure why.
The unwritten flag is per-filesystem not per-file; you can still clear
that feature bit:
#define XFS_SB_VERSION_EXTFLGBIT 0x1000
by using xfs_db in -x expert mode to rewrite every superblock's
"versionnum" without that bit set.
The xfs_db "version" command will give you a more textual representation
of what is actually set before & after.
You could script the sb rewrites...
For what it's worth, your xfs_db tricks below to preallocate seem a bit
This should suffice:
xfs_io -f /hay/foo
xfs_io> resvsp 0 1024g
xfs_io> truncate 1024g
Oh and you're right, there's no "nomtime" option AFAIK.
> Peter Vajgel
More information about the xfs