[Top] [All Lists]

Re: XFS performance tracking and regression monitoring

To: Mark Goodwin <markgw@xxxxxxx>
Subject: Re: XFS performance tracking and regression monitoring
From: Dave Chinner <david@xxxxxxxxxxxxx>
Date: Fri, 24 Oct 2008 14:54:11 +1100
Cc: xfs-oss <xfs@xxxxxxxxxxx>
In-reply-to: <490108E6.7060502@xxxxxxx>
Mail-followup-to: Mark Goodwin <markgw@xxxxxxx>, xfs-oss <xfs@xxxxxxxxxxx>
References: <490108E6.7060502@xxxxxxx>
User-agent: Mutt/1.5.18 (2008-05-17)
On Fri, Oct 24, 2008 at 09:29:42AM +1000, Mark Goodwin wrote:
> We're about to deploy a system+jbod dedicated for performance
> regression tracking. The idea is to build the XFS dev branch
> nightly, run a bunch of self contained benchmarks, and generate
> a progressive daily report - date on the X-axis, with (perhaps)
> wallclock runtime on the y-axis.

wallclock runtime is not indicative of relative performance
for many benchmarks. e.g. dbench runs for a fixed time and
then gives a throughput number as it's output. It's the throughput
you want to compare.....

> The aim is to track relative XFS performance on a daily basis
> for various workloads on identical h/w. If each workload runs for
> approx the same duration, the reports can all share the same
> generic y-axis. THe long term trend should have a positive
> gradient.

If you are measuring walltime, then you should see a negative
gradient as an indication of improvement....

> Regressions can be date correlated with commits.

For the benchmarks to be useful as regression tests, then the
harness really needs to be profiling and gathering statistics at the
same time so that we might be able to determine what caused the

> Comments, benchmark suggestions?

The usual set - bonnie++, postmark, ffsb, fio, sio, etc.

Then some artificial tests that stress scalability like speed of
creating 1m small files with long names in a directory, the speed of
a cold cache read of the directory, the speed of a hot-cache read of
the directory, time to stat all the files (cold and hot cache),
time to remove all the files, etc. And then how well it scales
as you do this with more threads and directories in parallel...

> ANyone already running this?
> Know of a test harness and/or report generator?

Perhap you might want to look more closely at FFSB - it has a
fairly interesting automated test harness. e.g. it was used to
produce these:


And you can probably set up custom workloads to cover all the things
that the standard benchmarks do.....


Dave Chinner

<Prev in Thread] Current Thread [Next in Thread>