xfs
[Top] [All Lists]

[XFS] Any process to a particular XFS device hung in D state forever.

To: xfs@xxxxxxxxxxx
Subject: [XFS] Any process to a particular XFS device hung in D state forever.
From: Hugo Kuo <hugo@xxxxxxxxxxxxxx>
Date: Tue, 19 Apr 2016 17:56:19 +0800
Cc: Darrell Bishop <darrell@xxxxxxxxxxxxxx>
Delivered-to: xfs@xxxxxxxxxxx
Dkim-signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=swiftstack-com.20150623.gappssmtp.com; s=20150623; h=mime-version:from:date:message-id:subject:to:cc; bh=8ZYYACu8c6Q0wIZQiuYy4jqsdCRiVLpbzWuwOdpgsjk=; b=oO61jW58sy2cVNyZ0KukFfdP+67dyYylKplYKDAu7fHjuuOStVd+LIgJvJLmBLedyE 0lewTet4Ia0bPweVjdRJsPExqnkqwzW1kvzKs3enH7zt87tg1NI+jRa4ZR4uqmrMQZ6k qcyMMjApD4BCqzqzfuJ2gFgxEQfbrbILtGlMtSsxsAf50SwZxCbBxhWdpsaaMMUder1k jiojrULDtxZWFtABnNcE2Xl+mmpTzGzww71pRS5XI5i/TKyHlxOuduyQLBlCHpdp1h7H fxnqAiF7jSwFa6c/9Y3Po5PuxDiK2laajWyPk7QO8DjwfLDyXywBlsNH22biZtVe23DM EaUQ==
Hi XFS team,Â

We encountered a problem frequently in past three weeks. Our daemons store data to XFS partition associate with xattr. Â

Disk seems not responding since all processes to this disk in D state and can't be killed at all.Â
  • It happens on several disks. I feel it's randomly.Â
  • Reboot seems solve the problem temporarily.Â
  • All disks are multipath devices.Â

I suspected that's an issue from disk corrupted at beginning. But smartctl doesn't show any clue about disk bad. And reboot makes the problem gone away.Â

  • Any process to this disk is blocked. Even a simple $ls .ÂKernel log
  • I tested the disk by read bytes on block via $dd . It works fine without any error in dmesg.Â
  • The `xfs_repair -n` output of a problematic mount pointÂ[xfs_repair -n]Â. It is still processing.Â
  • Kernel :ÂLinux node9 2.6.32-573.8.1.el6.x86_64 #1 SMP Tue Nov 10 18:01:38 UTC 2015 x86_64 x86_64 x86_64 GNU/Linux
  • OS :ÂCentOS release 6.5 (Final)
  • XFS :Âxfsprogs.x86_64 Â Â Â Â 3.1.1-14.el6

There's an interesting behaviour of $ls command.

* This is completed in 1sec. Very quick and give me the result in the test.d864 file $ls /srv/node/d864/tmp > test.d864
* This is hanging $ls /srv/node/d864/tmp

Inline image 1

I suspect there's something wrong with imap. Is there a known bug ?

Thanks // Hugo Â

<Prev in Thread] Current Thread [Next in Thread>