xfs
[Top] [All Lists]

Re: need help how to debug xfs crash issue xfs_iunlink_remove: xfs_inoto

To: Eric Sandeen <sandeen@xxxxxxxxxxx>
Subject: Re: need help how to debug xfs crash issue xfs_iunlink_remove: xfs_inotobp() returned error 22
From: 符永涛 <yongtaofu@xxxxxxxxx>
Date: Wed, 10 Apr 2013 13:34:39 +0800
Cc: "xfs@xxxxxxxxxxx" <xfs@xxxxxxxxxxx>
Delivered-to: xfs@xxxxxxxxxxx
Dkim-signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:x-received:in-reply-to:references:date:message-id :subject:from:to:cc:content-type; bh=juYyLYC7Avmw394UGtf77ApLkg9yG6BxOt9fo1W2Jeo=; b=WgZC3WuIgKHG1m8xJbeX3yWPlBCC2fvYn3rgtwVzAqBEiKKctndzBxpcLRsmWPJGIx Q09YYMBkT9vbYXZeZlugE6zw5gzkzxlPwwG6jk/Ls0tYFV+jb6DScjpa2WhlyWpyKuGS 6fr7k0V6msDw3dnhiScDgacJT/HLA6DAvKl0lz2JylC4jersT1ACWG45W9OWpNKro625 Zx/V4wOrYXO7eFKszLZsMYiMU+o3VkrqWYzGQIQA066u/1XhWkJdE/pTS+bY1cc4AUQI VqnaOeur8+i9yOHYfOPyId5b32eEJBFMVjMybUqMom5KSnVkCo60lzqse+lfIFrgtS/+ 6t3Q==
In-reply-to: <51644B87.60400@xxxxxxxxxxx>
References: <CADFMGuJm5bPPwbbUtYwrCVDL23KExJTw_-VRX2UEEdZjo+i5oA@xxxxxxxxxxxxxx> <51642E5E.3040403@xxxxxxxxxxx> <CADFMGuL7968v6L-3=j3FY3YYjeA_XH1CyuLgnL88u-abxiwHvg@xxxxxxxxxxxxxx> <51644B87.60400@xxxxxxxxxxx>
Here's the file info in lost+found:

[ lost+found]# pwd
/mnt/xfsd/lost+found
[ lost+found]# ls -l
总用量 4
---------T 1 root root 0 2月  28 15:42 3097
---------T 1 root root 0 2月  28 15:16 6169
[root@xxxxxxxxxxxx lost+found]# sudo getfattr -m . -d -e hex 6169
[root@xxxxxxxxxxxx lost+found]# sudo getfattr -m . -d -e hex 3097
# file: 3097
trusted.afr.ec-data-client-2=0x000000000000000000000000
trusted.afr.ec-data-client-3=0x000000000000000000000000
trusted.afr.ec-data1-client-2=0x000000000000000000000000
trusted.afr.ec-data1-client-3=0x000000000000000000000000
trusted.gfid=0x2bb701d327c44bb0af78d69e89f192a4
trusted.glusterfs.dht.linkto=0x65632d64617461312d7265706c69636174652d3400
trusted.glusterfs.quota.b8e8b3ef-0268-40af-93b6-257c4c7ef17a.contri=0x0000000004249000


It seems they're some link files for glusterfs dht xlator.

Thank you.


2013/4/10 Eric Sandeen <sandeen@xxxxxxxxxxx>
On 4/9/13 10:18 AM, 符永涛 wrote:
> The servers are back to service now and It's hard to run xfs_repair. It always happen bellow is the xfs_repair log when it happens on another server several days ago.

...

> 第二步
> repair的log
>
> sh-4.1$ sudo xfs_repair /dev/glustervg/glusterlv
> Phase 1 - find and verify superblock…
> Phase 2 - using internal log
>         - zero log…
>         - scan filesystem freespace and inode maps…
> agi unlinked bucket 0 is 4046848 in ag 0 (inode=4046848)
> agi unlinked bucket 5 is 2340485 in ag 0 (inode=2340485)
> agi unlinked bucket 6 is 2326854 in ag 0 (inode=2326854)
> agi unlinked bucket 8 is 1802120 in ag 0 (inode=1802120)
> agi unlinked bucket 14 is 495566 in ag 0 (inode=495566)
> agi unlinked bucket 16 is 5899536 in ag 0 (inode=5899536)
> agi unlinked bucket 19 is 4008211 in ag 0 (inode=4008211)
> agi unlinked bucket 21 is 4906965 in ag 0 (inode=4906965)
> agi unlinked bucket 23 is 2022231 in ag 0 (inode=2022231)
> agi unlinked bucket 24 is 1626200 in ag 0 (inode=1626200)
> agi unlinked bucket 25 is 938585 in ag 0 (inode=938585)
> agi unlinked bucket 30 is 4226526 in ag 0 (inode=4226526)
> agi unlinked bucket 34 is 4108962 in ag 0 (inode=4108962)
> agi unlinked bucket 37 is 1740389 in ag 0 (inode=1740389)
> agi unlinked bucket 39 is 247399 in ag 0 (inode=247399)
> agi unlinked bucket 40 is 6237864 in ag 0 (inode=6237864)
> agi unlinked bucket 43 is 3404331 in ag 0 (inode=3404331)
> agi unlinked bucket 45 is 2092717 in ag 0 (inode=2092717)
> agi unlinked bucket 48 is 4041008 in ag 0 (inode=4041008)
> agi unlinked bucket 50 is 1459762 in ag 0 (inode=1459762)
> agi unlinked bucket 56 is 852024 in ag 0 (inode=852024)

If this machine is still around in similar state, can you do a

# find /path/to/mount -inum $INODE_NUMBER

for the inode numbers above, and see what files they are?
That might give us a clue about what operations were happening
to them.  Dumping the gluster xattrs on those files
might also be interesting.  Just guesses here, but it'd be a
little more data.

(if this is an old repair, maybe doing the same for your most
recent incident would be best)

Thanks,
-Eric

>         - found root in ode chunk
> Phase 3 - for each AG…
>         - scan and clear agi unlinked lists…
>         - process known inodes and perform inode discovery…
>         - agno = 0
> 7f8220be6700: Badness in key lookup (length)
> bp=(bno 123696, len 16384 bytes) key=(bno 123696, len 8192 bytes)

(FWIW the above warnings look like an xfs_repair bug, not related)

-Eric




--
符永涛
<Prev in Thread] Current Thread [Next in Thread>