XFS on CoRAID errors with SMB

Jon Marshall jon at campbell-lange.net
Mon Nov 28 07:55:18 CST 2011


Hi,

We have recently experienced what appear to be XFS filesystem errors on
a samba share. The actual filesystem resides on a network attached
storage device, a Coraid. The attached server locked up totally, and we
forced to hard reset it.

I have the following trace from the kernel logs:

[6128798.051868] smbd: page allocation failure. order:4, mode:0xc0d0
[6128798.051872] Pid: 16908, comm: smbd Not tainted 2.6.32-5-amd64 #1
[6128798.051874] Call Trace:
[6128798.051882]  [<ffffffff810ba5d6>] ? __alloc_pages_nodemask+0x592/0x5f4
[6128798.051885]  [<ffffffff810b959c>] ? __get_free_pages+0x9/0x46
[6128798.051889]  [<ffffffff810e7ea1>] ? __kmalloc+0x3f/0x141
[6128798.051893]  [<ffffffff8110672c>] ? getxattr+0x89/0x117
[6128798.051896]  [<ffffffff810e5b65>] ? virt_to_head_page+0x9/0x2a
[6128798.051899]  [<ffffffff810f9bc4>] ? user_path_at+0x52/0x79
[6128798.051919]  [<ffffffffa0297b17>] ? xfs_xattr_put_listent+0x0/0xe5 [xfs]
[6128798.051922]  [<ffffffff810e5b65>] ? virt_to_head_page+0x9/0x2a
[6128798.051925]  [<ffffffff8118ddcb>] ? _atomic_dec_and_lock+0x33/0x50
[6128798.051928]  [<ffffffff811068b4>] ? sys_getxattr+0x45/0x60
[6128798.051931]  [<ffffffff81010b42>] ? system_call_fastpath+0x16/0x1b

smbd seems to throw these errors for about 15 minutes, then sshd starts
throwing errors and shortly after the system became unresponsive.

Just wondering if anyone had any experience of similar results, with XFS
on a CoRAID device or XFS SMB shares?

Thanks
Jon

-- 
Jon Marshall
Technical Officer
jon at campbell-lange.net
.
Campbell-Lange Workshop
www.campbell-lange.net
0207 6311 555
3 Tottenham Street London W1T 2AF
Registered in England No. 04551928




More information about the xfs mailing list