On 6/6/2013 10:50 PM, Dave Chinner wrote:
> Hence I suggest just waiting for it to complete - if you have
> millions of inodes it could hours to complete, so just let it go
> On Thu, Jun 06, 2013 at 07:59:34PM -0700, Adam Brenner wrote:
> I do not mind high CPU usage or slow performance while the mount point
> is active, but having it unavailable is not an option we would like
> stick with. I suspect running quota check on 14TB would take roughly a
> full work day and then some.
This 14TB XFS is the mpc-data1 workspace (/w1/$USER), correct? User
compute cluster jobs access this filesystem 24x7, correct? So to allow
for the remount quotacheck to complete you will need to have users move
jobs to the 40TB mpc-data2 workspace, or another temporary workspace,
until /w1 is accessible again. Or, their jobs simply must wait until
quotacheck completes. Or?
As Dave said there's no way around the initial quotacheck. If you
provide the file count Dave requested he may be able to provide a rough
estimate of completion time. This may benefit your outage planning.
I.e. it make complete much sooner than you currently guess, or it may
actually take substantially longer. I learned long ago that when devs
and experienced users on this mailing list ask for information, there's
a good reason. Thus it's best to answer all questions. You skipped
FWIW, I just checked mount(8), the XFS FAQ, User Guide, and Training
Labs, and I could not find a caveat or warning WRT mount time when
enabling quotas on existing filesystems. Dave, might this be suitable
for the FAQ? I'd guess many users have no idea the entire filesystem
must be walked when quota is enabled, since most enable it on freshly
made XFS. Could we add to the end of this something like below?
"A check is run at mount after enabling quotas. If you already have
many files this will take some time: multiple hours with millions of
files. During this time the filesystem is unavailable."
I don't believe I ever got an account, so I don't think I can make
edits. If this is a bad suggestion, maybe that's why I have no acct. ;)