[Top] [All Lists]


To: xfs@xxxxxxxxxxx
Subject: Re: XFS: Internal error XFS_WANT_CORRUPTED_RETURN
From: Stan Hoeppner <stan@xxxxxxxxxxxxxxxxx>
Date: Fri, 13 Dec 2013 03:46:14 -0600
Delivered-to: xfs@xxxxxxxxxxx
In-reply-to: <2A0A637F-7ED6-4743-8791-E57E22306139@xxxxxxxxxxxxxxxxx>
References: <20131211172725.GA4606@xxxxxxxxxx> <68DD7157-6ACE-4548-A466-C1EBD31B6DEB@xxxxxxxxxxxxxxxxx> <20131211185746.GA11861@xxxxxxxxxx> <2A0A637F-7ED6-4743-8791-E57E22306139@xxxxxxxxxxxxxxxxx>
Reply-to: stan@xxxxxxxxxxxxxxxxx
User-agent: Mozilla/5.0 (Windows NT 5.1; rv:24.0) Gecko/20100101 Thunderbird/24.2.0
On 12/11/2013 6:19 PM, Chris Murphy wrote:
> I suspect we've only just begun to see the myriad ways in which SSDs
> could fail. I ran across this article earlier today: 
> http://techreport.com/review/25681/the-ssd-endurance-experiment-testing-data-retention-at-300tb
>  What I thought was eye opening was a hashed file failing multiple
> times in a row with *different* hash values, being allowed to rest
> unpowered for five days and then passing. Eeek. Talk about a great
> setup for a lot of weird transient problems with that kind of
> reversal. What I can't tell is if there were read errors report to
> the SATA driver, or if (different) bad data from a particular page
> was sent to the driver.

The drive that exhibited this problem, the Samsung 840, is (one of) the
first on the market to use triple level cell NAND.  The drive is
marketed at consumers only.  The anomaly occurred after 100 TB of
writes, well beyond what is expected for a consumer drive.  After the
anomaly occurred the drive ran flawlessly up to 300 TB.

The rest of the drives, including the Samsung 840 Pro, use two cell MLC
NAND, and none of them have shown problems in their testing.  They've
been flawless.  So I disagree with your statement "we've only just begun
to see the myriad ways in which SSDs could fail".

What we have here is what we've always had.  A manufacturer using a
bleeding edge technology didn't have all the bugs identified and fixed
with the first rev of the product.  This isn't a problem with SSDs in
general, but one manufacturer, one new drive model, using a brand new
NAND type.


<Prev in Thread] Current Thread [Next in Thread>