xfs
[Top] [All Lists]

Re: exception Emask 0x0 SAct 0x1 / SErr 0x0 action 0x2 frozen

To: Justin Piszcz <jpiszcz@xxxxxxxxxxxxxxx>
Subject: Re: exception Emask 0x0 SAct 0x1 / SErr 0x0 action 0x2 frozen
From: Alan Cox <alan@xxxxxxxxxxxxxxxxxxx>
Date: Fri, 10 Oct 2008 20:27:51 +0100
Cc: Tejun Heo <tj@xxxxxxxxxx>, "Mr. James W. Laferriere" <babydr@xxxxxxxxxxxxxxxx>, Tom Mortensen <tmmlkml@xxxxxxxxx>, Bill Davidsen <davidsen@xxxxxxx>, Gwendal Grignou <gwendal@xxxxxxxxxx>, Brian Rademacher <rad@xxxxxxxxxxxx>, linux-ide@xxxxxxxxxxxxxxx, linux-raid maillist <linux-raid@xxxxxxxxxxxxxxx>, Linux Kernel Maillist <linux-kernel@xxxxxxxxxxxxxxx>, Bruce Allen <ballen@xxxxxxxxxxxxxxxxxxxx>, smartmontools-support@xxxxxxxxxxxxxxxxxxxxx, xfs@xxxxxxxxxxx
In-reply-to: <alpine.DEB.1.10.0810101505420.12937@xxxxxxxxxxxxxxxx>
Organization: Red Hat UK Cyf., Amberley Place, 107-111 Peascod Street, Windsor, Berkshire, SL4 1TE, Y Deyrnas Gyfunol. Cofrestrwyd yng Nghymru a Lloegr o'r rhif cofrestru 3798903
References: <3E6D0C24877245C3A5A327129A7B4CA7@909927SOSLA> <alpine.DEB.1.10.0809220828490.23159@xxxxxxxxxxxxxxxx> <5419A02543384E69AF2BA82FF6E66C14@909927SOSLA> <alpine.DEB.1.10.0809220903400.23159@xxxxxxxxxxxxxxxx> <4B1ABD0393EF40FFB0A12242FE8356AF@909927SOSLA> <alpine.DEB.1.10.0809220915470.23159@xxxxxxxxxxxxxxxx> <alpine.DEB.1.10.0809220926240.23159@xxxxxxxxxxxxxxxx> <e7510f760809231114y6cbaf13aqf7c63bd48a73c978@xxxxxxxxxxxxxx> <48DBAB81.8050900@xxxxxxx> <48E08E2D.6050905@xxxxxxxxxx> <a52a95e30809301347s70b57ebfh280a6871329afd72@xxxxxxxxxxxxxx> <alpine.DEB.1.10.0809301715280.14600@xxxxxxxxxxxxxxxx> <alpine.LNX.2.00.0809301946020.16014@xxxxxxxxxxxxxxxxxxxxxxxxx> <alpine.DEB.1.10.0810010403500.16900@xxxxxxxxxxxxxxxx> <alpine.DEB.1.10.0810010710520.22435@xxxxxxxxxxxxxxxx> <48E6D481.7050407@xxxxxxxxxx> <alpine.DEB.1.10.0810040410170.14969@xxxxxxxxxxxxxxxx> <alpine.DEB.1.10.0810101505420.12937@xxxxxxxxxxxxxxxx>
> [420781.333189] ata6.00: cmd b0/d8:00:00:4f:c2/00:00:00:00:00/00 tag 0
>                               ^^ ^^(b0/d8)^^ ^^(4f:c2)
> [420781.333190]          res 40/00:ff:00:00:00/00:00:00:00:00/00 Emask 0x4
> (timeout)                    ^^ 40:00:ff
> [420781.333194] ata6.00: status: { DRDY }

They are the send and received task file (command blocks) for the failed
command.

> [420781.333200] ata6: hard resetting link
> [420781.638589] ata6: SATA link up 3.0 Gbps (SStatus 123 SControl 300)
> [420781.662166] ata6.00: configured for UDMA/133
> [420781.662166] ata6: EH complete
> 
> (at the time there was little to no I/O occuring on this block device, but 
> disks on the raid5 volume were being accessed at the time, so there was 
> system activity, mainly disk reads 300-500KiB/s over ethernet)
> 
>   Nick's(?) problem:
> 
> > Nick
> > 
> > ata1.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x2 frozen
> > ata1.00: cmd ea/00:00:00:00:00/00:00:00:00:00/a0 tag 0
>                 ^^ ^^ (ea/00) vs. (b0/d8) - mind are always the same (FYI)
> >         res 40/00:00:00:00:00/00:00:00:00:00/a0 Emask 0x4 (timeout)
>                ^^ ^^ (40:00: but no ff)
> 
>            The rest of the messages are the same.  Is there any correlation
>            that can be made here?  When this happens to others, is it
>            always the same codes as shown above or do they change?  If they
>            do not change, how come they vary between users who have this
>            problem?
> 
> > ata1.00: status: { DRDY }
> > ata1: soft resetting link
> > ata1: SATA link up 3.0 Gbps (SStatus 123 SControl 300)
> > ata1.00: configured for UDMA/133
> > ata1: EH complete
> > sd 1:0:0:0: [sda] 2930277168 512-byte hardware sectors (1500302 MB)
> > sd 1:0:0:0: [sda] Write Protect is off
> > sd 1:0:0:0: [sda] Mode Sense: 00 3a 00 00
> > sd 1:0:0:0: [sda] Write cache: enabled, read cache: enabled, doesn't
> > support DPO or FUA
> 
> Can anything be said about these errors, can we classify them into groups?
> Or are they just random? It does not appear to happen more or less with 
> one filesystem or another either, one guy is using ext3, I am using XFS-- 
> certainly something much deeper..
> 
> Justin.
> --
> To unsubscribe from this list: send the line "unsubscribe linux-ide" in
> the body of a message to majordomo@xxxxxxxxxxxxxxx
> More majordomo info at  http://vger.kernel.org/majordomo-info.html


-- 
--
        "Alan, I'm getting a bit worried about you."
                                -- Linus Torvalds

<Prev in Thread] Current Thread [Next in Thread>