|To:||David Chinner <dgc@xxxxxxx>|
|Subject:||Re: 2.6.22-rc3 hibernate(?) fails totally - regression (xfs on raid6)|
|From:||David Greaves <david@xxxxxxxxxxxx>|
|Date:||Thu, 07 Jun 2007 14:59:58 +0100|
|Cc:||Tejun Heo <htejun@xxxxxxxxx>, Linus Torvalds <torvalds@xxxxxxxxxxxxxxxxxxxx>, "Rafael J. Wysocki" <rjw@xxxxxxx>, xfs@xxxxxxxxxxx, "'linux-kernel@xxxxxxxxxxxxxxx'" <linux-kernel@xxxxxxxxxxxxxxx>, linux-pm <linux-pm@xxxxxxxxxxxxxx>, Neil Brown <neilb@xxxxxxx>|
|References:||<firstname.lastname@example.org> <46609FAD.email@example.com> <firstname.lastname@example.org> <4661EFBB.email@example.com> <alpine.LFD.firstname.lastname@example.org> <4662D852.email@example.com> <firstname.lastname@example.org> <46668EE0.email@example.com> <46679D56.firstname.lastname@example.org> <4667DE2D.email@example.com> <20070607110708.GS86004887@sgi.com>|
|User-agent:||Mozilla-Thunderbird 188.8.131.52 (X11/20070601)|
David Chinner wrote:
On Thu, Jun 07, 2007 at 11:30:05AM +0100, David Greaves wrote:Tejun Heo wrote:Hello,
Seems like a good next step...
It's a rather full 1.2Tb raid6 array - can't reformat it - sorry :)Does the lockup happen with a different filesystem on the md device? Or if you can't test that, does any other XFS filesystem you have show the same problem?
I only noticed the problem when I umounted the fs during tests to prevent corruption - and it worked. I'm doing a sync each time it hibernates (see below) and a couple of paranoia xfs_repairs haven't shown any problems.
I do have another xfs filesystem on /dev/hdb2 (mentioned when I noticed the md/XFS correlation). It doesn't seem to have/cause any problems.
If it is xfs that is causing the problem, what happens if you remount read-only instead of unmounting before shutting down?
Yes, I'm happy to try these tests. nb, the hibernate script is: ethtool -s eth0 wol g sync echo platform > /sys/power/disk echo disk > /sys/power/state
So there has always been a sync before any hibernate.
cu:~# mount -oremount,ro /huge
/dev/hda2 on / type xfs (rw)
proc on /proc type proc (rw)
sysfs on /sys type sysfs (rw)
usbfs on /proc/bus/usb type usbfs (rw)
tmpfs on /dev/shm type tmpfs (rw)
devpts on /dev/pts type devpts (rw,gid=5,mode=620)
nfsd on /proc/fs/nfsd type nfsd (rw)
/dev/hda1 on /boot type ext3 (rw)
/dev/md0 on /huge type xfs (ro)
/dev/hdb2 on /scratch type xfs (rw)
tmpfs on /dev type tmpfs (rw,size=10M,mode=0755)
rpc_pipefs on /var/lib/nfs/rpc_pipefs type rpc_pipefs (rw)
cu:(pid2862,port1022) on /net type nfs (intr,rw,port=1022,toplvl,map=/usr/share/am-utils/amd.net,noac)
elm:/space on /amd/elm/root/space type nfs (rw,vers=3,proto=tcp)
elm:/space-backup on /amd/elm/root/space-backup type nfs (rw,vers=3,proto=tcp)
elm:/usr/src on /amd/elm/root/usr/src type nfs (rw,vers=3,proto=tcp)
[this works and resumes]
cu:~# mount -oremount,rw /huge cu:~# /usr/net/bin/hibernate [this works and resumes too !]
cu:~# touch /huge/tst cu:~# /usr/net/bin/hibernate [but this doesn't even hibernate]
Nb the screen looks like this: http://www.dgreaves.com/pub/2.6.21-rc4-ptched-suspend-failure.jpg whether it hangs on suspend or resume.
So I wouldn't say it *is* XFS at fault - but there certainly seems to be an interaction...
At least it's easily reproducible :) Shame about the sysrq
I can think of other permutations of freeze/ro/writing tests but I'm just thrashing really. Happy for you to tell me what to try next ...
|<Prev in Thread]||Current Thread||[Next in Thread>|
|Previous by Date:||Re: XFS internal error xfs_da_do_buf(2) at line 2087 of file fs/xfs/xfs_da_btree.c. Caller 0xc01b00bd, David Chinner|
|Next by Date:||Re: 2.6.22-rc3 hibernate(?) fails totally - regression (xfs on raid6), David Greaves|
|Previous by Thread:||Re: 2.6.22-rc3 hibernate(?) fails totally - regression (xfs on raid6), David Chinner|
|Next by Thread:||Re: 2.6.22-rc3 hibernate(?) fails totally - regression (xfs on raid6), David Chinner|
|Indexes:||[Date] [Thread] [Top] [All Lists]|