xfs
[Top] [All Lists]

Re: Storage server, hung tasks and tracebacks

To: Stan Hoeppner <stan@xxxxxxxxxxxxxxxxx>
Subject: Re: Storage server, hung tasks and tracebacks
From: Brian Candler <B.Candler@xxxxxxxxx>
Date: Thu, 3 May 2012 21:41:57 +0100
Cc: xfs@xxxxxxxxxxx
Dkim-signature: v=1; a=rsa-sha1; c=relaxed; d=pobox.com; h=date:from:to :cc:subject:message-id:references:mime-version:content-type :in-reply-to:content-transfer-encoding; s=sasl; bh=OhVy+Tt31P9ba E6NdCmX/4Ydv4c=; b=eF52N0hiKtmw5khDLDnL9n7LzHRF/UHi7GhQBYpadVSiP jmltvp0ynRZ3dtYydFAsf1IEfFrkIFjC9f+J3jQd9ogaE/mH6gTHzRM304IOV6bn sUgUUNGXqfHwGlNq7UTYfs7kKfzHuI1Bz/L9d2YFtd9zpbt7QIuPwiYIY7/bws=
Domainkey-signature: a=rsa-sha1; c=nofws; d=pobox.com; h=date:from:to:cc :subject:message-id:references:mime-version:content-type :in-reply-to:content-transfer-encoding; q=dns; s=sasl; b=wX28g9b 8E9o+mG4J75F0K8JNFntnSnday5sl2tNzJcfZjaT21Xa5fFBpQ11HGOO4yApWxbi WEm1J0SQEs072Zh4FopFbGUMqCSiO2AGIaa4QK8AGTIBDLt8rTEnGXYYDY4FUqvh scsxgpTyZvNmcAzBA+E6iRAuJJoFl/AnRquU=
In-reply-to: <4FA27EF8.6040002@xxxxxxxxxxxxxxxxx>
References: <20120502184450.GA2557@xxxxxxxx> <4FA27EF8.6040002@xxxxxxxxxxxxxxxxx>
User-agent: Mutt/1.5.21 (2010-09-15)
On Thu, May 03, 2012 at 07:50:00AM -0500, Stan Hoeppner wrote:
> > Any other suggestions (and of course interpretation of the kernel call
> > tracebacks) would be much appreciated.
> 
> Which mainboards are these Brian?  Make/model?

Tyan S5510, with Intel Xeon CPU E31225 @ 3.10GHz

Upgraded to BIOS 1.05a and iKVM 3.00

> Make/model/count of all add in cards?

1 x LSI SAS9201–16i
1 x LSI SAS92118–8i
1 x Intel X520-DA2 dual 10G NIC

although the 10G link wasn't being used for the most recent tests.

> Make/model of PSU?

Will have to check, I think it may be this one:
http://www.xcase.co.uk/XCASE-Power-Supply-p/psu-dolphin-900..htm

> Make model of chassis?

http://www.xcase.co.uk/24-bay-Hotswap-rackmount-chassis-norco-RPC-4224-p/case-xcase-rm424.htm

The drives are 24 x ST3000DM001 (I was hoping to get low-power Hitachi
drives but they weren't available at the time)

> I'll sleuth around and see what I can find.  Could be some obscure
> expansion card interaction.  Could be undersized PSUs or lack of
> backplanes spread evenly across the 12v rails of a multi-rail PSU, etc, etc.

Much appreciated.

However, last night I rebooted one box (the one which wouldn't let me ssh
in) then upgraded it to ubuntu 12.04.  It has been running a couple of
concurrent bonnie++ instances for over 24 hours without a hitch.

So maybe it's the mpt2sas driver which is the difference:

[dmesg from Ubuntu 11.10]
    mpt2sas version 08.100.00.02

[dmesg from Ubuntu 12.04]
    mpt2sas version 10.100.00.00

Regards,

Brian.

<Prev in Thread] Current Thread [Next in Thread>