[Top] [All Lists]

XFS Freezing occasionally hangs

To: xfs@xxxxxxxxxxx
Subject: XFS Freezing occasionally hangs
From: Ryan Campbell <ryan.campbell@xxxxxxxxx>
Date: Thu, 24 Jan 2013 09:38:26 -0600
Delivered-to: xfs@xxxxxxxxxxx
Dkim-signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:x-received:date:message-id:subject:from:to :content-type; bh=jKox6fuAYW8QJI7c0fI+5+rIqToErCZaJeYtr57JfZ0=; b=f61OCuoE+CwJwVhOClciR20UIX7JqaSLzSnhPy1SxeWleZlt19gztCTI9M2QUq0h/s jJx0htJqhGC+6qoRgzbnQGIY39g4g1W/27z+Ii6URw5SZbLaU6aQuEWlW6s88hMD5FXa shR6JAtJ/a9PGqCDXb/UceVt9fUbinJ+SAKx22O/Cr6ni5mgxl4qVjcxzdeUpVyebUsO 927KjEp2vR79deFefjZ7Qh+HyluOJA7UGb2yHvp94EAcaZNvdyD1PXGtGgMso+XC6wDc qsGeZCbgnMmgEZVgimDPGAWT0h0z7GAoLU805iDKmvd6Vp3A8oJclmPEaZ7jdeXAPIJk ikmg==
We use XFS on EC2 EBS running on Arch Linux (kernel 3.6.7-1-ARCH). Before we take an EC2 snapshot, we call xfs_freeze on the mount so that we can have a consistent snapshot.

On instances running for around 30 days, we see the XFS freeze hang, resulting in a skyrocketing load (but not CPU). We can't kill the xfs_freeze process. The only route available is usually to force a reboot of the server without cleaning unmounting.

There are backtraces in dmesg complaining both of hung xfs processes, and hung java processes. https://gist.github.com/3d46830eac52df44d30f

Is this a known issue? We call xfs_freeze quite often on thousands of XFS volumes, so I wouldn't be surprised if we have encountered an edge case considering how often we do this.

Any help appreciated. I'm recampbell on #xfs/freenode.

<Prev in Thread] Current Thread [Next in Thread>