On Wed, Dec 19, 2012 at 05:17:45PM -0500, Joe Landman wrote:
> Hi folks:
> I've been using sparse files on xfs for a while, and every now and
> then have the pleasure of copying them. Using coreutils cp
> (standard linux cp) usually winds up with the utility seeking
> through the entire file (yes, even with with the --sparse=* options
> It seems to me that the code is blissfully unware of the file
> extents, and its sparse implementation amounts to a read, a check to
> see if it needs to write it as a hole, write it, and the next seek.
> Iterate until done.
> Here's my question. Is there a way to (easily) programmatically
> hand cp (or any other utility) something akin to the output of
> xfs_bmap, and thus save it potentially *gargantuan* amounts of
> seeking over known zero regions? File sparsity in these cases are
> from 80-99% in some cases (fills of 1-20%) for multi GB/TB sized
> Pointers appreciated. I am looking at the copy routines in
> coreutils now, looking to see if we can increase its intelligence
> somewhat w.r.t. sparse files.
Here's a good overview of the state of play:
And what you really want is a version of cp that supports these:
$ man lseek
Seeking file data and holes
Since version 3.1, Linux supports the following additional
values for whence:
Adjust the file offset to the next location in the
file greater than or equal to offset containing data.
If offset points to data, then the file offset is set
Adjust the file offset to the next hole in the file
greater than or equal to offset. If offset points
into the middle of a hole, then the file offset is
set to offset. If there is no hole past offset, then
the file offset is adjusted to the end of the file
(i.e., there is an implicit hole at the end of any
I'm pretty sure coreutils support is in the pipeline right now....