| To: | Tejun Heo <htejun@xxxxxxxxx> |
|---|---|
| Subject: | Re: [linux-lvm] 2.6.22-rc5 XFS fails after hibernate/resume |
| From: | David Greaves <david@xxxxxxxxxxxx> |
| Date: | Thu, 21 Jun 2007 19:06:29 +0100 |
| Cc: | David Chinner <dgc@xxxxxxx>, David Robinson <zxvdr.au@xxxxxxxxx>, LVM general discussion and development <linux-lvm@xxxxxxxxxx>, "'linux-kernel@xxxxxxxxxxxxxxx'" <linux-kernel@xxxxxxxxxxxxxxx>, xfs@xxxxxxxxxxx, linux-pm <linux-pm@xxxxxxxxxxxxxx>, LinuxRaid <linux-raid@xxxxxxxxxxxxxxx>, "Rafael J. Wysocki" <rjw@xxxxxxx> |
| In-reply-to: | <4678DF56.1020903@gmail.com> |
| References: | <46744065.6060605@dgreaves.com> <4674645F.5000906@gmail.com> <46751D37.5020608@dgreaves.com> <4676390E.6010202@dgreaves.com> <20070618145007.GE85884050@sgi.com> <4676D97E.4000403@dgreaves.com> <4677A0C7.4000306@dgreaves.com> <4677A596.7090404@gmail.com> <4677E496.3080506@dgreaves.com> <4678DF56.1020903@gmail.com> |
| Sender: | xfs-bounce@xxxxxxxxxxx |
| User-agent: | Mozilla-Thunderbird 2.0.0.0 (X11/20070601) |
been away, back now... Tejun Heo wrote: David Greaves wrote:Having retained this new configuration for a couple of days now I haven't had any problems.Tejun Heo wrote:How reproducible is the problem? Does the problem go away or occur more often if you change the drive you write the memory image to?I don't think there should be activity on the sda drive during resume itself. This is good but not really ideal since / isn't mirrored anymore :( I was trying to say that I don't actually see any errors being handled in normal operation.Of course, I don't understand why it *should* EH during resume, it doesn't during boot or normal operation... I'm not sure if you are saying that these PHY RDY events are normally handled quietly (which would explain it). yes - but the erroring device which is not being used is on the same controller as the device with the in-use resume partition.It's really weird tho. The PHY RDY status changed events are coming from the device which is NOT used while resuming and it's before any actual PM events are triggered. Your kernel just boots, swsusp realizes it's resuming and tries to read memory image from the swap device. yes While reading, the disk controller raises consecutive PHY readiness changed interrupts. EH recovers them alright but the end result seems to indicate that the loaded image is corrupt. Yes, that's consistent with what I'm seeing. When I move the swap/resume partition to a different controller (ie when I broke the / mirror and used the freed space) the problem seems to go away. I am seeing messages in dmesg though: ATA: abnormal status 0xD0 on port 0xf881e0c7 ATA: abnormal status 0xD0 on port 0xf881e0c7 ATA: abnormal status 0xD0 on port 0xf881e0c7 ATA: abnormal status 0xD0 on port 0xf881e0c7 ATA: abnormal status 0xD0 on port 0xf881e0c7 ata1.00: configured for UDMA/100 ata2.00: revalidation failed (errno=-2) ata2: failed to recover some devices, retrying in 5 secs sd 0:0:0:0: [sda] 390721968 512-byte hardware sectors (200050 MB) sd 0:0:0:0: resuming sd 0:0:0:0: [sda] Starting disk ATA: abnormal status 0x7F on port 0x00019807 ATA: abnormal status 0x7F on port 0x00019007 ATA: abnormal status 0x7F on port 0x00019007 ATA: abnormal status 0x7F on port 0x00019807 ata2: SATA link up 1.5 Gbps (SStatus 113 SControl 310) ATA: abnormal status 0xD0 on port 0xf881e0c7 ATA: abnormal status 0xD0 on port 0xf881e0c7 ATA: abnormal status 0xD0 on port 0xf881e0c7 ATA: abnormal status 0xD0 on port 0xf881e0c7 ATA: abnormal status 0xD0 on port 0xf881e0c7 ata1.00: configured for UDMA/100 ata2.00: revalidation failed (errno=-2) ata2: failed to recover some devices, retrying in 5 secs So, there's no device suspend/resume code involved at all. The kernel just booted and is trying to read data from the drive. Please try with only the first drive attached and see what happens. That's kinda hard; swap and root are on different drives... Does it help that although the errors above appear, the system seems OK when I just use the other controller? I have to be cautious what I do with this machine as it's the wife's active desktop box <grin>. David |
| Previous by Date: | Re: is this thing on..., Eric Sandeen |
|---|---|
| Next by Date: | Re: XFS_IOC_RESVSP64 for swap files, Nathan Scott |
| Previous by Thread: | Re: [linux-lvm] 2.6.22-rc5 XFS fails after hibernate/resume, David Greaves |
| Next by Thread: | Re: [linux-lvm] 2.6.22-rc5 XFS fails after hibernate/resume, David Greaves |
| Indexes: | [Date] [Thread] [Top] [All Lists] |