pcp
[Top] [All Lists]

Re: [pcp] Updates

To: Nathan Scott <nscott@xxxxxxxxxx>
Subject: Re: [pcp] Updates
From: Mark Goodwin <goodwinos@xxxxxxxxx>
Date: Mon, 16 Feb 2009 10:52:41 +1100
Cc: Martin Hicks <mort@xxxxxxxx>, pcp@xxxxxxxxxxx
Dkim-signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:received:message-id:date:from :user-agent:mime-version:to:cc:subject:references:in-reply-to :content-type:content-transfer-encoding; bh=zrsJrKDJByh2zd0F6SChm8Fy39PyF4UH51gdNscAQOc=; b=qvr+FqPznva+10+UvxPl9sTPSUoHuvPyhoR9d5w6WTZq07g61VwJn2xzj3KYbpoXkN jyiFRIQG86C8dlhOYXTGAIVwQj13Kr6es6ebiutF7h2dmwglOFYiFbUFnrzoErZoX71V 1Hc+ZMTesOD50totHDhgTSrdnL1mvCdYkaG1k=
Domainkey-signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=message-id:date:from:user-agent:mime-version:to:cc:subject :references:in-reply-to:content-type:content-transfer-encoding; b=qasvdKF7nR0ymCRwu/PgwX84fsieCZZu6Wg4xw5nSCjkFmed9JsNcn96S9GbJWu1r7 cOcmesdDH3mAq0ktpDv+VDNDMTd+4IV54TeaCr0DiGDaPsAZiug4u3bxOFtPpJX3sjon Z6aSnEc/TbVZ+V7Zv9ZetTSD3zcM5xnaqAuTo=
In-reply-to: <1234734630.4120.10.camel@xxxxxxxxxxxxxxxxxx>
References: <20090213202902.GV20819@xxxxxxxxxxxxxxxxxxxxxxxxx> <1234567127.4234.8.camel@xxxxxxxxxxxxxxxxxx> <20090213234200.GD7466@xxxxxxxxxxxxxxxxx> <1234734630.4120.10.camel@xxxxxxxxxxxxxxxxxx>
User-agent: Thunderbird 2.0.0.19 (X11/20090105)
Nathan Scott wrote:
On Fri, 2009-02-13 at 18:42 -0500, Martin Hicks wrote:
On Sat, Feb 14, 2009 at 10:18:47AM +1100, Nathan Scott wrote:
On Fri, 2009-02-13 at 14:29 -0600, Martin Hicks wrote:
I've created my own tree on oss.  I'll be pushing patches here
before I commit them to the real git tree, to give the community
time to review.
Good stuff... will have a look soon.  At first glance, I was looking
for a pmdacluster / pmclusterd man page ... do those exist anywhere
yet?
Unfortunately not.

I'll write up a couple of man pages, once I've finished my resume ;-)

OK, that'd help in trying to understand this beast.  Maybe a
diagram or two as well, after having sniffed around the code
briefly. :)

There is an internal (to SGI) presentation related to this;
perhaps Martin could convince his management to publish it
under pcp-web. Basically, the cluster PMDA does instance
domain aggregation for two or more client hosts running
pmclusterd, which runs in local context mode.


I've left the initscript for pmclusterd (which is the client "push"
daemon) out of /etc/init.d, because it is fairly rare that anyone will
use it.

pmclusterd could be installed by default but definitely should
be chkconfig off by default. The cluster PMDA can be installed
by default (but not configured or run by default, as in ./Install).


It probably also needs some loving to be useful on a machine other than
Altix ICE.

It will work correctly on generic clusters, but will require
some manual config. Compute nodes and head nodes on ICE clusters
have topologically generated hostnames, and hence pmclusterd and
the cluster PMDA can be auto-configured. Basically, pmclusterd
on each compute node needs to know the hostname of it's head node,
which is where the cluster PMDA is running - on ICE, the head
node's host name is guessable from the compute node's hostname.

Hmm.  If its a rare / non-generic solution we should probably
not install it by default?  Hard to make the call without some
docs on what it is trying to achieve though.

I'm particularly curious about the patch below that I got from
Michael Newton.  I presume it is a real bugfix, but I don't
understand this area of the code yet...

That patch looks good to me, BTW.

Yes I reviewed that patch back in December  - it's
a bit hairy but I think it's fine.

Cheers
-- Mark

<Prev in Thread] Current Thread [Next in Thread>