xfs
[Top] [All Lists]

Re: readdir() ordering guarantees on XFS

To: xfs@xxxxxxxxxxx
Subject: Re: readdir() ordering guarantees on XFS
From: dizzy <dizzy@xxxxxxxxx>
Date: Tue, 10 Jun 2008 11:20:27 +0300
In-reply-to: <20080610035547.GZ10720@disturbed>
References: <200806061634.13990.dizzy@xxxxxxxxx> <20080610035547.GZ10720@disturbed>
Sender: xfs-bounce@xxxxxxxxxxx
User-agent: KMail/1.9.9
On Tuesday 10 June 2008 06:55:47 Dave Chinner wrote:
> On Fri, Jun 06, 2008 at 04:34:13PM +0300, dizzy wrote:
> > Hello
> >
> > POSIX leaves unspecified the order of getting the entries with readdir().
> > This is normal since different filesystems may implement their own
> > techniques to organize entries in a directory (linear, hash, various
> > search trees, etc).
> >
> > But if I can makes sure that several Linux machines will have the same FS
> > (ie XFS), mount options and same kernels can assume that traversing the
> > same file hierarchy structure (that is a file structure with the exact
> > same directories and files as names, structure, attributes, except maybe
> > "ctime" which we can't really control in Linux) can I expect that
> > traversing using readdir() will give me the entries in the exact same
> > order?
>
> No. For speed I suggest sorting the inode stat() calls in ascending
> inode number order before issuing them. 

But this does not solve the main requirement, that is the files traversed on 
the multiple Linux machines have to be sent in the same order (not sure if I 
have specified this in the original mesage, sorry if not). For now I'm 
sorting them lexicographically which is pretty slow. Sorting them by inode 
would not give them in the same order.

> Also, perhaps you should 
> look at:
>
> http://oss.oracle.com/~mason/acp/
>
> To see if you can use similar techniques to speed directory
> traversal.

Funny that you mention acp. We have benchmarked simple "tar" reading and "acp" 
reading of directory structures and on XFS "tar" reading is faster (but not 
on ext3), here are some results (reading a linux kernel tree after a flush of 
the cache by "tar"-ing a huge ammount of data, double the memory size):
- xfs: acp: 1m32s, tar: 1m12s
- ext3: acp: 0m1.5s, tar: 0m2.8s

Although in the test ext3 seems to be much faster than XFS overall in reading, 
it isn't so in writing so we will stick with XFS as it's fast enough for 
reading and fast for writing. Anyway that is another topic.

We still have that ordering issues tho from the original message :)

-- 
Mihai RUSU                                      Email: dizzy@xxxxxxxxx
                        "Linux is obsolete" -- AST


<Prev in Thread] Current Thread [Next in Thread>