On 05/06/2013 02:28 PM, Dave Chinner wrote:
On Mon, May 06, 2013 at 10:14:22AM +0200, Bernd Schubert wrote:
And anpther protection fault, this time with 3.9.0. Always happens
on one of the servers. Its ECC memory, so I don't suspect a faulty
memory bank. Going to fsck now-
Isn't that a bit overhead? And I can't provide /proc/meminfo and others,
as this issue causes a kernel panic a few traces later.
[303340.514052] general protection fault: 0000 [#1] SMP DEBUG_PAGEALLOC
[303340.517913] Modules linked in: fhgfs(O) fhgfs_client_opentk(O)
Kernel tainted with out of tree modules. Can you reproduce the
problem with them?
The modules are unused, as this is the server side. I disabled client
packages now and will re-run. But I really think that we should look for
memory/list corruption outside of fhgfs. Also very unlikely that always
only xfs would suffer, as there is also running ext4 for fhgfs meta data.
Also, it took from Friday evening till this morning to run into the
crash, so the next occurance might take some time. And I think tracing
xfs is out of question, as I need the disk space to store data (the
client side is running our stress test suite).