pcp
[Top] [All Lists]

Re: pcp updates: pmdaproc, cgroups, books

To: Nathan Scott <nathans@xxxxxxxxxx>
Subject: Re: pcp updates: pmdaproc, cgroups, books
From: "Frank Ch. Eigler" <fche@xxxxxxxxxx>
Date: Fri, 9 Jan 2015 09:55:01 -0500
Cc: pcp developers <pcp@xxxxxxxxxxx>
Delivered-to: pcp@xxxxxxxxxxx
In-reply-to: <2021314232.5965692.1420757204464.JavaMail.zimbra@xxxxxxxxxx>
References: <1309338393.770280.1416292315684.JavaMail.zimbra@xxxxxxxxxx> <1666386574.2247920.1416427865663.JavaMail.zimbra@xxxxxxxxxx> <2132304544.16180073.1418360958577.JavaMail.zimbra@xxxxxxxxxx> <20141212061823.GC14953@xxxxxxxxxx> <53646500.16198226.1418365398316.JavaMail.zimbra@xxxxxxxxxx> <20141212164033.GD14953@xxxxxxxxxx> <1544484578.17657959.1418625363923.JavaMail.zimbra@xxxxxxxxxx> <20141219162030.GC11308@xxxxxxxxxx> <2021314232.5965692.1420757204464.JavaMail.zimbra@xxxxxxxxxx>
User-agent: Mutt/1.4.2.2i
Hi -


> ----- Original Message -----
> > [...]
> > Yes, except that pmval is limited to a single metric per invocation.
> > A single "pminfo cgroup" run can exercise all metrics, and "pminfo
> > cgroup cgroup cgroup ..." can endurance-test all metrics, not just the
> > one that we found/fixed this leak in, so it's more forward-looking.
> 
> This is trivially tackled as in test qa/957 and results in a better
> test - that one even points out which cluster the pmdalinux bugs it
> finds are in (done that way on purpose, to save diagnosis time)!

Valgrind is fine, but does not provide sufficient coverage to catch
all instances of this kind of problem.


> [...]  Either I'm missing something, or you're missing what is
> happening in proc_refresh - it doesn't matter how many PMIDs get
> thrown at a single fetch, it will refresh each cluster only once per
> fetch...  so it has to be the fetch batching that triggers the leak,
> right?

Did you run the test I posted?  I included results (taken before and
after the fd-leak patch) that speak for themselves, and don't rely on
speculation about what internal behaviors should have or not have
triggered the bug.


- FChE

<Prev in Thread] Current Thread [Next in Thread>