xfs
[Top] [All Lists]

Re: [PATCH v2 5/5] dax: handle media errors in dax_do_io

To: Dave Chinner <david@xxxxxxxxxxxxx>
Subject: Re: [PATCH v2 5/5] dax: handle media errors in dax_do_io
From: Dan Williams <dan.j.williams@xxxxxxxxx>
Date: Tue, 26 Apr 2016 07:59:10 -0700
Cc: "Verma, Vishal L" <vishal.l.verma@xxxxxxxxx>, "linux-block@xxxxxxxxxxxxxxx" <linux-block@xxxxxxxxxxxxxxx>, "jack@xxxxxxx" <jack@xxxxxxx>, "axboe@xxxxxx" <axboe@xxxxxx>, "linux-nvdimm@xxxxxxxxxxx" <linux-nvdimm@xxxxxxxxxxx>, "linux-kernel@xxxxxxxxxxxxxxx" <linux-kernel@xxxxxxxxxxxxxxx>, "xfs@xxxxxxxxxxx" <xfs@xxxxxxxxxxx>, "hch@xxxxxxxxxxxxx" <hch@xxxxxxxxxxxxx>, "linux-mm@xxxxxxxxx" <linux-mm@xxxxxxxxx>, "Wilcox, Matthew R" <matthew.r.wilcox@xxxxxxxxx>, "linux-fsdevel@xxxxxxxxxxxxxxx" <linux-fsdevel@xxxxxxxxxxxxxxx>, "akpm@xxxxxxxxxxxxxxxxxxxx" <akpm@xxxxxxxxxxxxxxxxxxxx>, "linux-ext4@xxxxxxxxxxxxxxx" <linux-ext4@xxxxxxxxxxxxxxx>, "viro@xxxxxxxxxxxxxxxxxx" <viro@xxxxxxxxxxxxxxxxxx>
Delivered-to: xfs@xxxxxxxxxxx
Dkim-signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=intel-com.20150623.gappssmtp.com; s=20150623; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :cc; bh=oySleQSKSj+xRzPPwX/zcw4vyKkbunLUqrdEzjsWFHk=; b=MCtGOioRlvpS88BuuEAT8F5Ai8d7PaI9NkTbHpiP8p+v1fES+32csX8nkNnbtE1ddS 6yW48ZwM2wr9pRuUjjlIAM0jtlrYYE3Qp7t2ZhBe6aFe4QPxk8fE9Zk9L+HK/4/SHRM9 YVtGk0I6JKXhLEdaPs0yWUNoHMn2gxRytv/BF+pAVXbNb3RvABW8yjeOWS6D4eUpFILf i1oM945U9UT+iR+TVQdKIHNUl8E5aIH6AMEiyec5kqEFMGNPHRh+vSl3qy9WJ3M9yYwD g7ZlFqLs3tYw/9w5TKpWqlNhA5A7vssRiGK0kHackP5HEYkmudKZak64y8e2bM5PCuX6 H6Kw==
In-reply-to: <20160426082711.GC26977@dastard>
References: <20160420205923.GA24797@xxxxxxxxxxxxx> <1461434916.3695.7.camel@xxxxxxxxx> <20160425083114.GA27556@xxxxxxxxxxxxx> <1461604476.3106.12.camel@xxxxxxxxx> <20160425232552.GD18496@dastard> <CAPcyv4i6iwm1iY2mQ5yRbYfRexQroUX_R0B-db4ROU837fratw@xxxxxxxxxxxxxx> <20160426001157.GE18496@dastard> <CAPcyv4i0qnCrzsTQT-v84OhnhjmVBFJ8gKoyu6XkuUwH0babfQ@xxxxxxxxxxxxxx> <20160426025645.GG18496@dastard> <CAPcyv4hg6O3nvD7aXuFm_GAB-1GJxqfNn=RZswj47COa9bVygA@xxxxxxxxxxxxxx> <20160426082711.GC26977@dastard>
On Tue, Apr 26, 2016 at 1:27 AM, Dave Chinner <david@xxxxxxxxxxxxx> wrote:
> On Mon, Apr 25, 2016 at 09:18:42PM -0700, Dan Williams wrote:
[..]
> It seems to me you are focussing on code/technologies that exist
> today instead of trying to define an architecture that is more
> optimal for pmem storage systems. Yes, working code is great, but if
> you can't tell people how things like robust error handling and
> redundancy are going to work in future then it's going to take
> forever for everyone else to handle such errors robustly through the
> storage stack...

Precisely because higher order redundancy is built on top this baseline.

MD-RAID can't do it's error recovery if we don't have -EIO and
clear-error-on-write.  On the other hand, you're absolutely right that
we have a gaping hole on top of the SIGBUS recovery model, and don't
have a kernel layer we can interpose on top of DAX to provide some
semblance of redundancy.

In the meantime, a handful of applications with a team of full-time
site-reliability-engineers may be able to plug in external redundancy
infrastructure on top of what is defined in these patches.  For
everyone else, the hard problem, we need to do a lot more thinking
about a trap and recover solution.

<Prev in Thread] Current Thread [Next in Thread>