xfs
[Top] [All Lists]

hardlinking and deleting milions of small files

To: "xfs@xxxxxxxxxxx" <xfs@xxxxxxxxxxx>
Subject: hardlinking and deleting milions of small files
From: Arkadiusz MiÅkiewicz <arekm@xxxxxxxx>
Date: Sun, 24 Jul 2016 14:38:10 +0200
Delivered-to: xfs@xxxxxxxxxxx
Dkim-signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=maven.pl; s=maven; h=from:to:subject:date:user-agent:mime-version :content-transfer-encoding:message-id; bh=oshwm8PPYOje1sMTamQWxjzHYLQ7/eRfohEPakQzMpU=; b=XKCz7CAV5A3+JrcQdmXxyDx3b/8Q1WvXiuUmZhi02KqRqC/XW/uJ7o6by3PXnWApMy 03pyXGeRd1X2aaE+/vtJ2VHt6XZaRn9FEU67f5sv0PwMi//D3nYWPT4yvsv1O2gzBKCK FSCou6K8eokPDZ2Zy6ZYfIQNq5f9BGQ6F96EA=
User-agent: KMail/1.13.7 (Linux/4.7.0-rc7-00092-g47ef4ad; KDE/4.14.21; x86_64; ; )
Hello.

I'm using rsnapshot to backup big servers (like 5TB fs, 25 000 000 inodes, 
small files - mailboxes in form of maildirs, so each mail is a separate file). 
Backup server - kernel 4.6.3, V4 xfs filesystems.

cp -al for that amount takes about 1.5 day.
rm -rf of hardlinked copy takes another 1.5 day

(and toons of ram for these operations; causing OOM until recent kernels made 
reclaim better, so no more OOM)


Now the weird part - similar operations on ext4 finish in matter of hours.


Are there any possibilities for xfs to improve in these areas? 

From irc #xfs from few months ago the conclusion was that xfs isn't best in 
such operations.

ps. Didn't do scientific comparison (I'm just viewing backup logs of two 
similar mail servers (similar hardware, similar storage size) being backed up 
to single backup server onto two partitions - one with xfs and one with ext4 
on it))
-- 
Arkadiusz MiÅkiewicz, arekm / ( maven.pl | pld-linux.org )

<Prev in Thread] Current Thread [Next in Thread>