From owner-failsafe@oss.sgi.com Thu Jul 27 14:48:51 2000 Received: by oss.sgi.com id ; Thu, 27 Jul 2000 14:48:41 -0700 Received: from pyongsan.compgen.com ([158.155.0.1]:22795 "EHLO gw1.compgen.com") by oss.sgi.com with ESMTP id ; Thu, 27 Jul 2000 14:48:32 -0700 Received: from uxeric.compgen.com (root@uxeric.compgen.com [158.155.4.32]) by gw1.compgen.com (8.8.7/8.8.7) with ESMTP id RAA19468 for ; Thu, 27 Jul 2000 17:48:24 -0400 Received: (from eric@localhost) by uxeric.compgen.com (8.9.3/8.9.3) id RAA21965; Thu, 27 Jul 2000 17:48:24 -0400 From: "Eric Z. Ayers" MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit Message-ID: <14720.44583.485879.72937@gargle.gargle.HOWL> Date: Thu, 27 Jul 2000 17:48:23 -0400 (EDT) To: failsafe@oss.sgi.com Subject: Installed Linux Failsafe today X-Mailer: VM 6.72 under 21.1 (patch 8) "Bryce Canyon" XEmacs Lucid Reply-To: Eric.Ayers@compgen.com X-Face: (3Y\Z;G!Ce[Q\WBgGFLgcaL%v[kJ'@9s`Qn1<)EEL5tSW7IDvX[{APQ5]eY}uF}%qbD[-@N !5]S!%o0*DbAB?~o%tca^?3@zU~"fQ@MTiClP>w%`Y8oG&6|:>2F=bhnf2>bPedqw-.T>U-BaI`F>1 QY@?oGJ0.lV?b@0HgvaOt>=0,/@,=(kE"J++vO?K"3ve@,"sunF0HnU|h&|:}%|P6%BohO_*mAHJ#g EHc;_'bXG|kCLMSF`:/O_F0fuJ:j2^C\NJ(:$izN@mbXQo(IL,BO.7<]wT?:5.A<$C Sender: owner-failsafe@oss.sgi.com Precedence: bulk Return-Path: X-Orcpt: rfc822;failsafe-outgoing Hello, I am developing a hardware platform that will include: 2 X86 boxes Internal IDE disk in each box Shared SCSI (AHA2940 cards) with (software) RAID1 Hopefully some sort of journaling FS when it becomes available Right now I have a single 3c59x card in each machine, but I will have dual ethernet in the final config (probabaly Intel nics) serial cable interconnect for heartbeat Yesterday, I finally got some test hardware in place and I've installed Red Hat 6.2 in the IDE drive on each node. Notably, I installed without X windows server support, but the X libraries are there. I suppose I could install the X server and KDE, but my goal is to use a minimal OS installation. On the first box, I installed the failsafe binaries with 'fsinstall', and then 'fsinstall server' and 'fsinstall client' I made sure that ulimit -c 100000 was in effect. When I try to launch the GUI, I see the following: [root@dru1a /tmp]# /usr/bin/fstask current locale is not supported in X11, locale is set to CX locale modifiers are not supported, using defaultWarning: translation table syntax error: Unknown keysym name: osfActivate Warning: ... found while parsing ':osfActivate: ManagerParentActivate()' Warning: translation table syntax error: Unknown keysym name: osfCancel Warning: ... found while parsing ':osfCancel: ManagerParentCancel()' Warning: translation table syntax error: Unknown keysym name: osfSelect Warning: ... found while parsing ':osfSelect: ManagerGadgetSelect()' ... SIGSEGV received at bffff094 in /lib/libc.so.6. Processing terminated Writing stack trace to javacore6408.txt ... [root@dru1a /tmp]# ls -la core ls: core: No such file or directory [root@dru1a /tmp]# cat javacore6408.txt Fri Jan 2 02:50:45 1998 SIGSEGV received at bffff094 in /lib/libc.so.6. Processing terminated jre full version "JDK 1.1.8 IBM build l118-20000515 (JIT enabled: jitc)" Operating Environment --------------------- Host : dru1a. OS Level : 2.2.14-5.0.#1 Tue Mar 7 21:07:39 EST 2000 glibc Version : 2.1.3 No. of Procs : 1 Memory Info: total: used: free: shared: buffers: cached: Mem: 130990080 126160896 4829184 14827520 54972416 41476096 Swap: 134144000 5046272 129097728 MemTotal: 127920 kB MemFree: 4716 kB MemShared: 14480 kB Buffers: 53684 kB Cached: 40504 kB BigTotal: 0 kB BigFree: 0 kB SwapTotal: 131000 kB SwapFree: 126072 kB dump crashed ------------------------------------------------------------------------------ I got rid of the keysym errors by copying XKeysymDB from another machine to /usr/lib/X11/, but the java 'core dump' continues. ------------------------------------------------------------------------------ Here's the list of rpms I installed: [root@dru1a /tmp]# rpm -qa | grep sysadm sysadm_base-server-1.3.2-6 sysadm_base-lib-1.3.2-6 sysadm_base-tcpmux-1.3.2-6 sysadm_failsafe-server-0.1-4 sysadm_base-client-1.3.2-6 sysadm_failsafe-client-0.1-4 [root@dru1a /tmp]# rpm -qa | grep failsafe failsafe-0.1-2891 sysadm_failsafe-server-0.1-4 sysadm_failsafe-client-0.1-4 [root@dru1a /tmp]# rpm -qa | grep ci ci-0.1-4354 [root@dru1a /tmp]# rpm -qa | grep cas cas-0.1-4885 ------------------------------------------------------------------------------ [root@dru1a /tmp]# ls /etc/config/ privileges [root@dru1a /tmp]# cat /etc/config/privileges on [root@dru1a /tmp]# ------------------------------------------------------------------------------ Any suggestions? Regards, -Eric. -- Eric Z. Ayers Phone: +1 404-705-2864 Computer Generation Incorporated FAX: +1 404-705-2805 Building G, 4th Floor, 5775 Peachtree-Dunwoody Rd., Atlanta, GA 30342 USA eric@compgen.com From owner-failsafe@oss.sgi.com Thu Jul 27 16:48:41 2000 Received: by oss.sgi.com id ; Thu, 27 Jul 2000 16:48:31 -0700 Received: from deliverator.sgi.com ([204.94.214.10]:31 "EHLO deliverator.sgi.com") by oss.sgi.com with ESMTP id ; Thu, 27 Jul 2000 16:48:09 -0700 Received: from nodin.corp.sgi.com (nodin.corp.sgi.com [192.26.51.193]) by deliverator.sgi.com (980309.SGI.8.8.8-aspam-6.2/980310.SGI-aspam) via ESMTP id QAA28408 for ; Thu, 27 Jul 2000 16:40:44 -0700 (PDT) mail_from (rusty@rlyeh.engr.sgi.com) Received: from rlyeh.engr.sgi.com (rlyeh.engr.sgi.com [163.154.5.94]) by nodin.corp.sgi.com (980427.SGI.8.8.8/980728.SGI.AUTOCF) via ESMTP id QAA50375 for ; Thu, 27 Jul 2000 16:47:46 -0700 (PDT) Received: (from rusty@localhost) by rlyeh.engr.sgi.com (SGI-8.9.3/8.9.3) id QAA16311; Thu, 27 Jul 2000 16:44:23 -0700 (PDT) From: "Rusty Ballinger" Message-Id: <10007271644.ZM116181@rlyeh.engr.sgi.com> Date: Thu, 27 Jul 2000 16:44:23 -0700 In-Reply-To: Padmanabhan Sreenivasan "[Fwd: Installed Linux Failsafe today]" (Jul 27, 2:53pm) References: <3980AF4D.F12AEE06@engr.sgi.com> X-Face: #)4}U4e`O6YEe%oBzE}>ycmT!Xt?Myiqo~|p3Wh'UuQ[N7)&4\4?8:1n)bmPX]b@#k94%!VojpODdmk:sCr1b\-aXD&P:wjBqupMB:ag6}BwVseJZM@K{$E|0J9}&,Rpdg{&N4/Y8&PTm6>|r[,gI2T*qN!`AZhl>Bdy7JR`dDvP(/pz.}?Q@dg':mlV`RX51Z_ZG?Gta|Q!iA[MaOh Reply-To: rusty@sgi.com X-Mailer: Z-Mail (3.2.3 08feb96 MediaMail) To: Eric.Ayers@compgen.com Subject: Re: [Fwd: Installed Linux Failsafe today] Cc: paddy@rlyeh.engr.sgi.com, failsafe@oss.sgi.com Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Sender: owner-failsafe@oss.sgi.com Precedence: bulk Return-Path: X-Orcpt: rfc822;failsafe-outgoing > Yesterday, I finally got some test hardware in place and I've > installed Red Hat 6.2 in the IDE drive on each node. Notably, I > installed without X windows server support, but the X libraries are > there. I suppose I could install the X server and KDE, but my goal is > to use a minimal OS installation. You're running the java client on one of the server machines, but you don't have an X server running? I'm pretty sure that won't work! You can run the GUI from another machine and connect to the servers; on the GUI machine, you will need to install IBMJava118-JRE (IBM's Java runtime) sysadm_base-client sysadm_failsafe-client Then when you run fstask on the client machine, you should get a login dialog which lets you connect to your servers. --Rusty From owner-failsafe@oss.sgi.com Fri Jul 28 05:08:47 2000 Received: by oss.sgi.com id ; Fri, 28 Jul 2000 05:08:37 -0700 Received: from 255.255.255.255.in-addr.de ([212.8.197.242]:2067 "HELO 255.255.255.255.in-addr.de") by oss.sgi.com with SMTP id ; Fri, 28 Jul 2000 05:08:21 -0700 Received: (qmail 16321 invoked from network); 28 Jul 2000 12:06:21 -0000 Received: from unknown (HELO hermes.marowsky-bree.de) (127.0.0.1) by 127.0.0.1 with SMTP; 28 Jul 2000 12:06:21 -0000 Received: by hermes.marowsky-bree.de (Postfix, from userid 500) id A2161AD591; Fri, 28 Jul 2000 13:34:54 +0200 (CEST) Date: Fri, 28 Jul 2000 13:34:54 +0200 From: Lars Marowsky-Bree To: Eric.Ayers@compgen.com Cc: failsafe@oss.sgi.com Subject: Re: Installed Linux Failsafe today Message-ID: <20000728133454.J1346@marowsky-bree.de> References: <14720.44583.485879.72937@gargle.gargle.HOWL> Mime-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Disposition: inline Content-Transfer-Encoding: 8bit User-Agent: Mutt/1.2.3i In-Reply-To: <14720.44583.485879.72937@gargle.gargle.HOWL>; from "Eric Z. Ayers" on 2000-07-27T17:48:23 X-Ctuhulu: HASTUR Sender: owner-failsafe@oss.sgi.com Precedence: bulk Return-Path: X-Orcpt: rfc822;failsafe-outgoing On 2000-07-27T17:48:23, "Eric Z. Ayers" said: > I am developing a hardware platform that will include: > > 2 X86 boxes > Internal IDE disk in each box > Shared SCSI (AHA2940 cards) with (software) RAID1 > Hopefully some sort of journaling FS when it becomes available > Right now I have a single 3c59x card in each machine, but I will > have dual ethernet in the final config (probabaly Intel nics) You may wish to have separate NICs though - my personal estimate is that if one port fails on a dual port card, the other one is likely to fail too. > serial cable interconnect for heartbeat Failsafe cannot currently use this, and I am unaware of the plan to make use of serial interconnect. > Yesterday, I finally got some test hardware in place and I've > installed Red Hat 6.2 in the IDE drive on each node. Good, thanks for testing this! So far, Failsafe has been mainly tested on SuSE Linux, but of course we want to support all platforms. > On the first box, I installed the failsafe binaries with 'fsinstall', > and then 'fsinstall server' and 'fsinstall client' I assume you mean "guiinstall" here? > I made sure that ulimit -c 100000 was in effect. > When I try to launch the GUI, I see the following: You can only launch the GUI directly from X or via X forwarding (using ssh/telnet). However, you do not have to run the GUI on the server, but you should be able to run the GUI on any client and have it connect to the proper server. Be aware that this is currently not encrypted and will transmit your root password over the network in plaintext! (You may wish to use a secured network or forward the GUI over ssh) Sincerely, Lars Marowsky-Brée Development HA -- Perfection is our goal, excellence will be tolerated. -- J. Yahl From owner-failsafe@oss.sgi.com Fri Jul 28 06:16:37 2000 Received: by oss.sgi.com id ; Fri, 28 Jul 2000 06:16:28 -0700 Received: from pyongsan.compgen.com ([158.155.0.1]:5901 "EHLO gw1.compgen.com") by oss.sgi.com with ESMTP id ; Fri, 28 Jul 2000 06:16:19 -0700 Received: from uxeric.compgen.com (root@uxeric.compgen.com [158.155.4.32]) by gw1.compgen.com (8.8.7/8.8.7) with ESMTP id JAA17473; Fri, 28 Jul 2000 09:15:56 -0400 Received: (from eric@localhost) by uxeric.compgen.com (8.9.3/8.9.3) id JAA26912; Fri, 28 Jul 2000 09:15:56 -0400 From: "Eric Z. Ayers" MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit Message-ID: <14721.34699.333611.737926@gargle.gargle.HOWL> Date: Fri, 28 Jul 2000 09:15:55 -0400 (EDT) To: rusty@sgi.com Cc: Eric.Ayers@compgen.com, paddy@rlyeh.engr.sgi.com, failsafe@oss.sgi.com Subject: Re: [Fwd: Installed Linux Failsafe today] In-Reply-To: <10007271644.ZM116181@rlyeh.engr.sgi.com> References: <3980AF4D.F12AEE06@engr.sgi.com> <10007271644.ZM116181@rlyeh.engr.sgi.com> X-Mailer: VM 6.72 under 21.1 (patch 8) "Bryce Canyon" XEmacs Lucid Reply-To: Eric.Ayers@compgen.com X-Face: (3Y\Z;G!Ce[Q\WBgGFLgcaL%v[kJ'@9s`Qn1<)EEL5tSW7IDvX[{APQ5]eY}uF}%qbD[-@N !5]S!%o0*DbAB?~o%tca^?3@zU~"fQ@MTiClP>w%`Y8oG&6|:>2F=bhnf2>bPedqw-.T>U-BaI`F>1 QY@?oGJ0.lV?b@0HgvaOt>=0,/@,=(kE"J++vO?K"3ve@,"sunF0HnU|h&|:}%|P6%BohO_*mAHJ#g EHc;_'bXG|kCLMSF`:/O_F0fuJ:j2^C\NJ(:$izN@mbXQo(IL,BO.7<]wT?:5.A<$C Sender: owner-failsafe@oss.sgi.com Precedence: bulk Return-Path: X-Orcpt: rfc822;failsafe-outgoing Hello Rusty, Thanks for the reply. I'm sitting at my workstation. I do have all of those packages installed on the machine 'dru1a' where I was first trying. [root@dru1a failsafe]# rpm -i ./IBMJava118-JRE-1.1.8-3.0.i386.rpm ./sysadm_base-client-1.3.2-6.i386.rpm ./sysadm_failsafe-client-0.1-4.i386.rpm package IBMJava118-JRE-1.1.8-3.0 is already installed package sysadm_base-client-1.3.2-6 is already installed package sysadm_failsafe-client-0.1-4 is already installed I exported my display to my workstation and when I tried to run it I got the crash. But you are right. If I install the client software on my workstation I get this nice GUI and I can connect. -Eric. Rusty Ballinger writes: > > Yesterday, I finally got some test hardware in place and I've > > installed Red Hat 6.2 in the IDE drive on each node. Notably, I > > installed without X windows server support, but the X libraries are > > there. I suppose I could install the X server and KDE, but my goal is > > to use a minimal OS installation. > > You're running the java client on one of the server machines, but > you don't have an X server running? I'm pretty sure that won't > work! You can run the GUI from another machine and connect to the > servers; on the GUI machine, you will need to install > > IBMJava118-JRE (IBM's Java runtime) > sysadm_base-client > sysadm_failsafe-client > > Then when you run fstask on the client machine, you should get a > login dialog which lets you connect to your servers. > > --Rusty From owner-failsafe@oss.sgi.com Fri Jul 28 07:09:47 2000 Received: by oss.sgi.com id ; Fri, 28 Jul 2000 07:09:38 -0700 Received: from pyongsan.compgen.com ([158.155.0.1]:55820 "EHLO gw1.compgen.com") by oss.sgi.com with ESMTP id ; Fri, 28 Jul 2000 07:09:19 -0700 Received: from uxeric.compgen.com (root@uxeric.compgen.com [158.155.4.32]) by gw1.compgen.com (8.8.7/8.8.7) with ESMTP id KAA25552; Fri, 28 Jul 2000 10:09:15 -0400 Received: (from eric@localhost) by uxeric.compgen.com (8.9.3/8.9.3) id KAA27174; Fri, 28 Jul 2000 10:09:15 -0400 From: "Eric Z. Ayers" MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit Message-ID: <14721.37898.612050.770034@gargle.gargle.HOWL> Date: Fri, 28 Jul 2000 10:09:14 -0400 (EDT) To: Lars Marowsky-Bree Cc: failsafe@oss.sgi.com Subject: Re: Installed Linux Failsafe today In-Reply-To: <20000728133454.J1346@marowsky-bree.de> References: <14720.44583.485879.72937@gargle.gargle.HOWL> <20000728133454.J1346@marowsky-bree.de> X-Mailer: VM 6.72 under 21.1 (patch 8) "Bryce Canyon" XEmacs Lucid Reply-To: Eric.Ayers@compgen.com X-Face: (3Y\Z;G!Ce[Q\WBgGFLgcaL%v[kJ'@9s`Qn1<)EEL5tSW7IDvX[{APQ5]eY}uF}%qbD[-@N !5]S!%o0*DbAB?~o%tca^?3@zU~"fQ@MTiClP>w%`Y8oG&6|:>2F=bhnf2>bPedqw-.T>U-BaI`F>1 QY@?oGJ0.lV?b@0HgvaOt>=0,/@,=(kE"J++vO?K"3ve@,"sunF0HnU|h&|:}%|P6%BohO_*mAHJ#g EHc;_'bXG|kCLMSF`:/O_F0fuJ:j2^C\NJ(:$izN@mbXQo(IL,BO.7<]wT?:5.A<$C Sender: owner-failsafe@oss.sgi.com Precedence: bulk Return-Path: X-Orcpt: rfc822;failsafe-outgoing Hello Lars, Thank you too for the reply. Lars Marowsky-Bree writes: > On 2000-07-27T17:48:23, > "Eric Z. Ayers" said: > > > I am developing a hardware platform that will include: > > > > 2 X86 boxes > > Internal IDE disk in each box > > Shared SCSI (AHA2940 cards) with (software) RAID1 > > Hopefully some sort of journaling FS when it becomes available > > Right now I have a single 3c59x card in each machine, but I will > > have dual ethernet in the final config (probabaly Intel nics) > > You may wish to have separate NICs though - my personal estimate is that if > one port fails on a dual port card, the other one is likely to fail > too. They will both be built into the motherboard. We just had a Sun box with 1 port on a 4 port card to go bad. There doesn't seem to be a lot of shared hardware between the two ports as far as I can tell. > > serial cable interconnect for heartbeat > > Failsafe cannot currently use this, and I am unaware of the plan to make use > of serial interconnect. OK - I got the idea that a null modem cable going between the two boxes would be the best heartbeat medium - but that is probably from reading about the 'heartbeat' project at linux-ha.org. Is there any reason why a null modem cable running ppp wouldn't be a good path for heartbeat/control in a 2 node setup? ... > > On the first box, I installed the failsafe binaries with 'fsinstall', > > and then 'fsinstall server' and 'fsinstall client' > > I assume you mean "guiinstall" here? yes. I don't completely understand why I can't export the DISPLAY variable and run the client, but I'll keep plugging along with what i have. --------------------------------------------------------------------------- I've hooked up the 2nd ethernet NICs and put a crossover cable between the two. I'm assuming that the "system controller port" is SGI specific hardware that I don't have. I will have some similar hardware. We're looking at a hardware platform that has an "EMB" port. Maybe once I get the demo box in with the EMB port and the failsafe source code is released, we could see if it would be possible to support it under failsafe. So I just un-checked "Set Reset Parameters for node dru1a" and kept going. I got the error: Task failed. setuid root access to "/usr/lib/sysadm/privbin" denied because it is writable by group or others. use the chmod(1) command fix access. (sic) [cgi@dru1a sysadm]$ ls -lad privbin drwxrwxr-x 2 root root 2048 Jan 2 00:54 privbin OK, so I fixed it. [root@dru1a sysadm]# chmod -w privbin [root@dru1a sysadm]# ls -lad privbin dr-xr-xr-x 2 root root 2048 Jan 2 00:54 privbin I'm fwding to you becuase I thought you might be able to fix this in the rpm or the install scripts. -Eric. From owner-failsafe@oss.sgi.com Fri Jul 28 07:25:37 2000 Received: by oss.sgi.com id ; Fri, 28 Jul 2000 07:25:28 -0700 Received: from pyongsan.compgen.com ([158.155.0.1]:20496 "EHLO gw1.compgen.com") by oss.sgi.com with ESMTP id ; Fri, 28 Jul 2000 07:25:12 -0700 Received: from uxeric.compgen.com (root@uxeric.compgen.com [158.155.4.32]) by gw1.compgen.com (8.8.7/8.8.7) with ESMTP id KAA27414 for ; Fri, 28 Jul 2000 10:25:08 -0400 Received: (from eric@localhost) by uxeric.compgen.com (8.9.3/8.9.3) id KAA27188; Fri, 28 Jul 2000 10:25:08 -0400 From: "Eric Z. Ayers" MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit Message-ID: <14721.38852.621813.613105@gargle.gargle.HOWL> Date: Fri, 28 Jul 2000 10:25:08 -0400 (EDT) To: failsafe@oss.sgi.com Subject: more install notes under RH6.2 X-Mailer: VM 6.72 under 21.1 (patch 8) "Bryce Canyon" XEmacs Lucid Reply-To: Eric.Ayers@compgen.com X-Face: (3Y\Z;G!Ce[Q\WBgGFLgcaL%v[kJ'@9s`Qn1<)EEL5tSW7IDvX[{APQ5]eY}uF}%qbD[-@N !5]S!%o0*DbAB?~o%tca^?3@zU~"fQ@MTiClP>w%`Y8oG&6|:>2F=bhnf2>bPedqw-.T>U-BaI`F>1 QY@?oGJ0.lV?b@0HgvaOt>=0,/@,=(kE"J++vO?K"3ve@,"sunF0HnU|h&|:}%|P6%BohO_*mAHJ#g EHc;_'bXG|kCLMSF`:/O_F0fuJ:j2^C\NJ(:$izN@mbXQo(IL,BO.7<]wT?:5.A<$C Sender: owner-failsafe@oss.sgi.com Precedence: bulk Return-Path: X-Orcpt: rfc822;failsafe-outgoing I'm assuming you guys are interested in these kinds of notes. Please let me know if you'd rather me not mail these kinds of things out to the list: Under 'Define Cluster', I chose Notify Administrator --> By email the mailer defaults to '/usr/sbin/Mail' which isn't right for Red Hat My system has /usr/bin/Mail... I'm assuming that's the right one to use. Well nuts, I have to go dork with some hardware right now... more later. -Eric. From owner-failsafe@oss.sgi.com Fri Jul 28 10:30:00 2000 Received: by oss.sgi.com id ; Fri, 28 Jul 2000 10:29:40 -0700 Received: from pyongsan.compgen.com ([158.155.0.1]:2059 "EHLO gw1.compgen.com") by oss.sgi.com with ESMTP id ; Fri, 28 Jul 2000 10:29:10 -0700 Received: from uxeric.compgen.com (root@uxeric.compgen.com [158.155.4.32]) by gw1.compgen.com (8.8.7/8.8.7) with ESMTP id NAA16637 for ; Fri, 28 Jul 2000 13:29:06 -0400 Received: (from eric@localhost) by uxeric.compgen.com (8.9.3/8.9.3) id NAA29507; Fri, 28 Jul 2000 13:29:06 -0400 From: "Eric Z. Ayers" MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit Message-ID: <14721.49889.872093.449784@gargle.gargle.HOWL> Date: Fri, 28 Jul 2000 13:29:05 -0400 (EDT) To: failsafe@oss.sgi.com Subject: more install notes under RH6.2 In-Reply-To: <14721.38852.621813.613105@gargle.gargle.HOWL> References: <14721.38852.621813.613105@gargle.gargle.HOWL> X-Mailer: VM 6.72 under 21.1 (patch 8) "Bryce Canyon" XEmacs Lucid Reply-To: Eric.Ayers@compgen.com X-Face: (3Y\Z;G!Ce[Q\WBgGFLgcaL%v[kJ'@9s`Qn1<)EEL5tSW7IDvX[{APQ5]eY}uF}%qbD[-@N !5]S!%o0*DbAB?~o%tca^?3@zU~"fQ@MTiClP>w%`Y8oG&6|:>2F=bhnf2>bPedqw-.T>U-BaI`F>1 QY@?oGJ0.lV?b@0HgvaOt>=0,/@,=(kE"J++vO?K"3ve@,"sunF0HnU|h&|:}%|P6%BohO_*mAHJ#g EHc;_'bXG|kCLMSF`:/O_F0fuJ:j2^C\NJ(:$izN@mbXQo(IL,BO.7<]wT?:5.A<$C Sender: owner-failsafe@oss.sgi.com Precedence: bulk Return-Path: X-Orcpt: rfc822;failsafe-outgoing Another permissions problem I ran into when trying to test Network Connectivity: $ cd /usr/lib/sysadm/privbin; $ ls -l ClusterDiags -rwxrwxr-x 1 root root 79 Jan 2 00:54 ClusterDiags (the write bit needs to be off of this executable) Once I fixed that, it tells me that: Cluster Diagnostics have not been implemented in this release. ------------------------------------------------------------------------------ >From the GUI, I added my nodes to my cluster definition, and then chose: Fix or Upgrade Cluster Nodes --> Start Failsafe HA Services --> Start It only started the first node, not the second. (No, I didn't select the node from the optional drop down list) I the went back and start it specifically on the second node (this time, I did select the node from the drop down list), and then the gui updated the status of the second node from 'Inactive' to 'OK'. No big deal, I just wondered why it didn't come up on both nodes the first time. I attached a portion of /var/log/messages below. There are some ugly looking messages about: Stale CDB handle. CI_IPCERR_NOSERVER, cms ipc: ipcclnt_connect() failed, file /var/cluster/ha/comm/cmsd-ipc_dru1a .Check if the cmsd daemon is running. from the time I attempted to start the cluster. ------------------------------------------------------------------------------ OK, now I'm at the point where I want to define my resources. >From FailSafe Manager: --> Resources & Resource Types --> Define a New Resource I get a dialog: Create a new Resource Definition It looks like I don't have any default resource types defined. All I've got in the 'Resource Type' drop down list is 'template'. The admin guide says there are some pre-defined resource types that look handy. Do you have any definitions already made up for linux for: IP Address filesystem I'll also need a RAID resource. I guess I need to crack open the Programming guide. I'll have to do it eventually for my application anyway. ----------------------------------------------------------------------------- Just for yucks, I took a look at /var/cluster/ha/log. It looks like one of the files is growing quite a bit! [root@dru1a log]# ls -l total 23669 -rw-r--r-- 1 root root 23892291 Jan 2 22:43 cad_log -rw-r--r-- 1 root root 5922 Jan 2 22:36 cli_dru1a -rw------- 1 root root 127777 Jan 2 22:36 cmond_log -rw-r--r-- 1 root root 6173 Jan 2 22:24 cmsd_dru1a -rw-r--r-- 1 root root 4782 Jan 2 22:15 crsd_dru1a -rw-r--r-- 1 root root 8435 Jan 2 22:25 failsafe_dru1a -rw------- 1 root root 50639 Jan 2 22:16 fs2d_log -rw-r--r-- 1 root root 36799 Jan 2 22:43 gcd_dru1a -rw-r--r-- 1 root root 1153 Jan 2 22:16 srmd_dru1a I take it there is some kind of debugging turned on in the 'cad' deamon?!? I only configured a 100MB var partition, so it looks like I'll only be able to run my cluster for a few days :-) Regards, -Eric. --- (/var/log/messages excerpt) Jan 2 22:10:10 dru1a PAM_pwdb[1593]: (su) session closed for user root Jan 2 22:10:14 dru1a runpriv[1598]: Running privilege ClusterDiags for user root. Jan 2 22:10:59 dru1a runpriv[1604]: Running privilege ClusterDiags for user root. Jan 2 22:11:05 dru1a runpriv[1605]: Running privilege ClusterDiags for user root. Jan 2 22:13:08 dru1a runpriv[1620]: Running privilege ClusterDiags for user root. Jan 2 22:14:40 dru1a runpriv[1631]: Running privilege haParamsModify for user root. Jan 2 22:14:40 dru1a cli[1631]: < E config 0> CI_ERR_INVAL, Internal error: inte rnal argument is invalid : Internal error no nodes in cluster Jan 2 22:14:41 dru1a cli[1631]: < E config 0> CI_ERR_INVAL, CLI private command: failed (Internal error no nodes in cluster) Jan 2 22:14:53 dru1a runpriv[1637]: Running privilege clusterAddMachine for user roo t. Jan 2 22:14:56 dru1a cmond[537]: Notification can not be processed , local machine and cluster name is not known. Jan 2 22:14:56 dru1a cmond[537]: Local machine belongs to cluster dru. Jan 2 22:14:56 dru1a cmond[537]: Local machine name is dru1a. Jan 2 22:15:02 dru1a cmond[537]: Stale CDB handle. Jan 2 22:15:02 dru1a crsd[549]: < N log 0> Additional crsd logs can be found in /var/cluster/ha/log/crsd_dru1a. Jan 2 22:15:21 dru1a runpriv[1692]: Running privilege haActivate for user root. Jan 2 22:15:21 dru1a cmond[537]: New process ha_cmsd pid 1702 Jan 2 22:15:21 dru1a cmond[537]: New process ha_gcd pid 1703 Jan 2 22:15:21 dru1a cmond[537]: New process ha_srmd pid 1704 Jan 2 22:15:21 dru1a cmond[537]: New process ha_fsd pid 1706 Jan 2 22:15:22 dru1a ha_cmsd[1702]: < N log 0> Additional ha_cmsd logs can be fo und in /var/cluster/ha/log/cmsd_dru1a. Jan 2 22:15:22 dru1a ha_gcd[1703]: < N log 0> Additional ha_gcd logs can be foun d in /var/cluster/ha/log/gcd_dru1a. Jan 2 22:15:22 dru1a ha_cmsd[1702]: < N cms 0> ha_cmsd restarted. Jan 2 22:15:22 dru1a ha_fsd[1706]: < N log 0> Additional ha_fsd logs can be foun d in /var/cluster/ha/log/failsafe_dru1a. Jan 2 22:15:22 dru1a ha_fsd[1706]: < N fsd 0> /usr/cluster/bin/ha_fsd is running as foreground process Jan 2 22:15:23 dru1a ha_srmd[1704]: < N log 0> Additional ha_srmd logs can be fo und in /var/cluster/ha/log/srmd_dru1a. Jan 2 22:15:23 dru1a ha_cmsd[1702]: < N log 0> Additional ha_cmsd logs can be fo und in /var/cluster/ha/log/cmsd_dru1a. Jan 2 22:15:23 dru1a ha_gcd[1703]: < N log 0> Additional ha_gcd logs can be foun d in /var/cluster/ha/log/gcd_dru1a. Jan 2 22:15:23 dru1a ha_gcd[1703]: < N gcd 0> My node name = dru1a. Jan 2 22:15:23 dru1a ha_gcd[1703]: < E cms 0> CI_IPCERR_NOSERVER, cms ipc: ipccl nt_connect() failed, file /var/cluster/ha/comm/cmsd-ipc_dru1a .Check if the cmsd daem on is running. Jan 2 22:15:24 dru1a ha_gcd[1703]: < E cms 0> CI_IPCERR_NOSERVER, cms ipc: ipccl nt_connect() failed, file /var/cluster/ha/comm/cmsd-ipc_dru1a .Check if the cmsd daem on is running. Jan 2 22:15:26 dru1a ha_cmsd[1702]: < N cms 0> Confirmed Membership: sqn 1 G_sqn = 1, ack false node dru1a [1] : UP incarnation 1 age 1:0 node dru1b [2] : DOWN* incarnation 0 age 0:0 Jan 2 22:15:27 dru1a ha_gcd[1703]: < N gcd 0> My nodeid = 1 [0x1]. Jan 2 22:15:46 dru1a ha_srmd[1733]: < N srm 2> SRM ready to accept clients Jan 2 22:16:30 dru1a ha_fsd[1706]: < N fsd 0> FailSafe initialization complete - - Move to state: UP Jan 2 22:24:07 dru1a runpriv[1746]: Running privilege haActivate for user root. Jan 2 22:24:08 dru1a ha_cmsd[1717]: < N log 1> Additional ha_cmsd logs can be fo und in /var/cluster/ha/log/cmsd_dru1a. Jan 2 22:24:38 dru1a ha_cmsd[1702]: < N cms 0> Node dru1b id 2 added/enabled. Jan 2 22:24:41 dru1a ha_cmsd[1702]: < N cms 0> Confirmed Membership: sqn 2 G_sqn = 2, ack false node dru1a [1] : UP incarnation 1 age 2:0 node dru1b [2] : UP inc arnation 1 age 1:0 From owner-failsafe@oss.sgi.com Fri Jul 28 11:26:02 2000 Received: by oss.sgi.com id ; Fri, 28 Jul 2000 11:25:52 -0700 Received: from pyongsan.compgen.com ([158.155.0.1]:38413 "EHLO gw1.compgen.com") by oss.sgi.com with ESMTP id ; Fri, 28 Jul 2000 11:25:29 -0700 Received: from uxeric.compgen.com (root@uxeric.compgen.com [158.155.4.32]) by gw1.compgen.com (8.8.7/8.8.7) with ESMTP id OAA18330 for ; Fri, 28 Jul 2000 14:16:16 -0400 Received: (from eric@localhost) by uxeric.compgen.com (8.9.3/8.9.3) id OAA29796; Fri, 28 Jul 2000 14:16:16 -0400 From: "Eric Z. Ayers" Message-ID: <14721.52720.506800.847847@gargle.gargle.HOWL> Date: Fri, 28 Jul 2000 14:16:16 -0400 (EDT) MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit To: failsafe@oss.sgi.com Subject: FS_Prog_Guide, MANPATH X-Mailer: VM 6.72 under 21.1 (patch 8) "Bryce Canyon" XEmacs Lucid Reply-To: Eric.Ayers@compgen.com X-Face: (3Y\Z;G!Ce[Q\WBgGFLgcaL%v[kJ'@9s`Qn1<)EEL5tSW7IDvX[{APQ5]eY}uF}%qbD[-@N !5]S!%o0*DbAB?~o%tca^?3@zU~"fQ@MTiClP>w%`Y8oG&6|:>2F=bhnf2>bPedqw-.T>U-BaI`F>1 QY@?oGJ0.lV?b@0HgvaOt>=0,/@,=(kE"J++vO?K"3ve@,"sunF0HnU|h&|:}%|P6%BohO_*mAHJ#g EHc;_'bXG|kCLMSF`:/O_F0fuJ:j2^C\NJ(:$izN@mbXQo(IL,BO.7<]wT?:5.A<$C Sender: owner-failsafe@oss.sgi.com Precedence: bulk Return-Path: X-Orcpt: rfc822;failsafe-outgoing On RH Linux 6.2, the man pages aren't visible by default. /usr/share/man has to be added to MANPATH in /etc/profile (at least, that's what I did) -Eric. From owner-failsafe@oss.sgi.com Fri Jul 28 11:31:02 2000 Received: by oss.sgi.com id ; Fri, 28 Jul 2000 11:30:42 -0700 Received: from pyongsan.compgen.com ([158.155.0.1]:52494 "EHLO gw1.compgen.com") by oss.sgi.com with ESMTP id ; Fri, 28 Jul 2000 11:30:28 -0700 Received: from uxeric.compgen.com (root@uxeric.compgen.com [158.155.4.32]) by gw1.compgen.com (8.8.7/8.8.7) with ESMTP id OAA18221 for ; Fri, 28 Jul 2000 14:07:23 -0400 Received: (from eric@localhost) by uxeric.compgen.com (8.9.3/8.9.3) id OAA29789; Fri, 28 Jul 2000 14:07:23 -0400 From: "Eric Z. Ayers" MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit Message-ID: <14721.52186.955592.208904@gargle.gargle.HOWL> Date: Fri, 28 Jul 2000 14:07:22 -0400 (EDT) To: failsafe@oss.sgi.com Subject: FS_Prog_Guide X-Mailer: VM 6.72 under 21.1 (patch 8) "Bryce Canyon" XEmacs Lucid Reply-To: Eric.Ayers@compgen.com X-Face: (3Y\Z;G!Ce[Q\WBgGFLgcaL%v[kJ'@9s`Qn1<)EEL5tSW7IDvX[{APQ5]eY}uF}%qbD[-@N !5]S!%o0*DbAB?~o%tca^?3@zU~"fQ@MTiClP>w%`Y8oG&6|:>2F=bhnf2>bPedqw-.T>U-BaI`F>1 QY@?oGJ0.lV?b@0HgvaOt>=0,/@,=(kE"J++vO?K"3ve@,"sunF0HnU|h&|:}%|P6%BohO_*mAHJ#g EHc;_'bXG|kCLMSF`:/O_F0fuJ:j2^C\NJ(:$izN@mbXQo(IL,BO.7<]wT?:5.A<$C Sender: owner-failsafe@oss.sgi.com Precedence: bulk Return-Path: X-Orcpt: rfc822;failsafe-outgoing I'm reading the programming guide now. Maybe I don't have everything installed right, but some of the paths seem incorrect: Table 1-3 on page 24 states: "/usr/share/failsafe/resource_types/template : Directory that contains the action template scripts" On my system, they seem to be in: /var/cluster/ha/resource_types I think that the other paths in Table 1-3 need to be checked. -Eric. From owner-failsafe@oss.sgi.com Fri Jul 28 11:37:42 2000 Received: by oss.sgi.com id ; Fri, 28 Jul 2000 11:37:32 -0700 Received: from pneumatic-tube.sgi.com ([204.94.214.22]:9775 "EHLO pneumatic-tube.sgi.com") by oss.sgi.com with ESMTP id ; Fri, 28 Jul 2000 11:37:21 -0700 Received: from miku.engr.sgi.com (miku.engr.sgi.com [163.154.34.25]) by pneumatic-tube.sgi.com (980327.SGI.8.8.8-aspam/980310.SGI-aspam) via ESMTP id LAA04826 for ; Fri, 28 Jul 2000 11:43:18 -0700 (PDT) mail_from (vasa@engr.sgi.com) Received: from engr.sgi.com (catapult.engr.sgi.com [192.26.80.17]) by miku.engr.sgi.com (SGI-8.9.3/8.9.3) with ESMTP id LAA67435; Fri, 28 Jul 2000 11:35:39 -0700 (PDT) Message-ID: <3981D27A.E64E0B3A@engr.sgi.com> Date: Fri, 28 Jul 2000 11:35:38 -0700 From: Mayank Vasa Organization: Silicon Graphics Inc. X-Mailer: Mozilla 4.7C-SGI [en] (X11; I; IRIX 6.5-ALPHA-1276552520 IP32) X-Accept-Language: en MIME-Version: 1.0 To: Lars Marowsky-Bree CC: Eric.Ayers@compgen.com, failsafe@oss.sgi.com Subject: Re: Installed Linux Failsafe today References: <14720.44583.485879.72937@gargle.gargle.HOWL> <20000728133454.J1346@marowsky-bree.de> Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit Sender: owner-failsafe@oss.sgi.com Precedence: bulk Return-Path: X-Orcpt: rfc822;failsafe-outgoing Lars Marowsky-Bree wrote: > > On 2000-07-27T17:48:23, > "Eric Z. Ayers" said: > > > I am developing a hardware platform that will include: > > > > 2 X86 boxes > > Internal IDE disk in each box > > Shared SCSI (AHA2940 cards) with (software) RAID1 > > Hopefully some sort of journaling FS when it becomes available > > Right now I have a single 3c59x card in each machine, but I will > > have dual ethernet in the final config (probabaly Intel nics) > > You may wish to have separate NICs though - my personal estimate is that if > one port fails on a dual port card, the other one is likely to fail too. > > > serial cable interconnect for heartbeat > > Failsafe cannot currently use this, and I am unaware of the plan to make use > of serial interconnect. > > > Yesterday, I finally got some test hardware in place and I've > > installed Red Hat 6.2 in the IDE drive on each node. > > Good, thanks for testing this! So far, Failsafe has been mainly tested on SuSE > Linux, but of course we want to support all platforms. > Testing is being done on both the OSes on our side. We have 2 Redhat clusters inhouse on which we are testing in addition to 2 SuSE clusters. [snip] -- Mayank Vasa Linux FailSafe Team. From owner-failsafe@oss.sgi.com Fri Jul 28 11:48:03 2000 Received: by oss.sgi.com id ; Fri, 28 Jul 2000 11:47:43 -0700 Received: from pneumatic-tube.sgi.com ([204.94.214.22]:62511 "EHLO pneumatic-tube.sgi.com") by oss.sgi.com with ESMTP id ; Fri, 28 Jul 2000 11:47:38 -0700 Received: from miku.engr.sgi.com (miku.engr.sgi.com [163.154.34.25]) by pneumatic-tube.sgi.com (980327.SGI.8.8.8-aspam/980310.SGI-aspam) via ESMTP id LAA01609 for ; Fri, 28 Jul 2000 11:53:34 -0700 (PDT) mail_from (vasa@engr.sgi.com) Received: from engr.sgi.com (catapult.engr.sgi.com [192.26.80.17]) by miku.engr.sgi.com (SGI-8.9.3/8.9.3) with ESMTP id LAA71721; Fri, 28 Jul 2000 11:46:02 -0700 (PDT) Message-ID: <3981D4EA.DA489EEF@engr.sgi.com> Date: Fri, 28 Jul 2000 11:46:02 -0700 From: Mayank Vasa Organization: Silicon Graphics Inc. X-Mailer: Mozilla 4.7C-SGI [en] (X11; I; IRIX 6.5-ALPHA-1276552520 IP32) X-Accept-Language: en MIME-Version: 1.0 To: Eric.Ayers@compgen.com CC: failsafe@oss.sgi.com Subject: Re: [Fwd: Installed Linux Failsafe today] References: <3980AF4D.F12AEE06@engr.sgi.com> <10007271644.ZM116181@rlyeh.engr.sgi.com> <14721.34699.333611.737926@gargle.gargle.HOWL> Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit Sender: owner-failsafe@oss.sgi.com Precedence: bulk Return-Path: X-Orcpt: rfc822;failsafe-outgoing "Eric Z. Ayers" wrote: > > Hello Rusty, > > Thanks for the reply. > > I'm sitting at my workstation. I do have all of those packages > installed on the machine 'dru1a' where I was first trying. > > [root@dru1a failsafe]# rpm -i ./IBMJava118-JRE-1.1.8-3.0.i386.rpm ./sysadm_base-client-1.3.2-6.i386.rpm ./sysadm_failsafe-client-0.1-4.i386.rpm > package IBMJava118-JRE-1.1.8-3.0 is already installed > package sysadm_base-client-1.3.2-6 is already installed > package sysadm_failsafe-client-0.1-4 is already installed > > I exported my display to my workstation and when I tried to run it I > got the crash. > > But you are right. If I install the client software on my workstation > I get this nice GUI and I can connect. > > -Eric. > That's strange. That's how I test, I export the display of the test machine in the lab to my desktop in my office. By any chance, are you missing any essential X-libs? -- Mayank Vasa Linux FailSafe Team. From owner-failsafe@oss.sgi.com Fri Jul 28 11:54:03 2000 Received: by oss.sgi.com id ; Fri, 28 Jul 2000 11:53:43 -0700 Received: from pyongsan.compgen.com ([158.155.0.1]:43794 "EHLO gw1.compgen.com") by oss.sgi.com with ESMTP id ; Fri, 28 Jul 2000 11:53:28 -0700 Received: from uxeric.compgen.com (root@uxeric.compgen.com [158.155.4.32]) by gw1.compgen.com (8.8.7/8.8.7) with ESMTP id OAA21046; Fri, 28 Jul 2000 14:53:24 -0400 Received: (from eric@localhost) by uxeric.compgen.com (8.9.3/8.9.3) id OAA29976; Fri, 28 Jul 2000 14:53:24 -0400 From: "Eric Z. Ayers" MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit Message-ID: <14721.54948.773464.331804@gargle.gargle.HOWL> Date: Fri, 28 Jul 2000 14:53:24 -0400 (EDT) To: Mayank Vasa Cc: Eric.Ayers@compgen.com, failsafe@oss.sgi.com Subject: Re: [Fwd: Installed Linux Failsafe today] In-Reply-To: <3981D4EA.DA489EEF@engr.sgi.com> References: <3980AF4D.F12AEE06@engr.sgi.com> <10007271644.ZM116181@rlyeh.engr.sgi.com> <14721.34699.333611.737926@gargle.gargle.HOWL> <3981D4EA.DA489EEF@engr.sgi.com> X-Mailer: VM 6.72 under 21.1 (patch 8) "Bryce Canyon" XEmacs Lucid Reply-To: Eric.Ayers@compgen.com X-Face: (3Y\Z;G!Ce[Q\WBgGFLgcaL%v[kJ'@9s`Qn1<)EEL5tSW7IDvX[{APQ5]eY}uF}%qbD[-@N !5]S!%o0*DbAB?~o%tca^?3@zU~"fQ@MTiClP>w%`Y8oG&6|:>2F=bhnf2>bPedqw-.T>U-BaI`F>1 QY@?oGJ0.lV?b@0HgvaOt>=0,/@,=(kE"J++vO?K"3ve@,"sunF0HnU|h&|:}%|P6%BohO_*mAHJ#g EHc;_'bXG|kCLMSF`:/O_F0fuJ:j2^C\NJ(:$izN@mbXQo(IL,BO.7<]wT?:5.A<$C Sender: owner-failsafe@oss.sgi.com Precedence: bulk Return-Path: X-Orcpt: rfc822;failsafe-outgoing Mayank Vasa writes: > "Eric Z. Ayers" wrote: > > I'm sitting at my workstation. I do have all of those packages > > installed on the machine 'dru1a' where I was first trying. ... > > But you are right. If I install the client software on my workstation > > I get this nice GUI and I can connect. > > > > -Eric. > > > > That's strange. That's how I test, I export the display of the test > machine in the lab to my desktop in my office. By any chance, are you > missing any essential X-libs? That is hard to say. I'm not getting a unresolved symbol error or can't find libXXXX.so error. It could be that the 'locale' error has something to do with it. current locale is not supported in X11, locale is set to C X locale modifiers are not supported, using default I do not get these errors when I run the client from my local box. My guess is that some files needed for i18n support didn't get installed when I left out the X server. I SEEM to have a full complement of X11 Libraries. [root@dru1a lib]# pwd /usr/X11R6/lib [root@dru1a lib]# ls -l total 2083 drwxr-xr-x 2 root root 1024 Jan 2 03:00 X11 lrwxrwxrwx 1 root root 13 Jan 1 22:28 libICE.so.6 -> libICE.so.6.3 -rwxr-xr-x 1 root root 95790 Mar 6 2000 libICE.so.6.3 lrwxrwxrwx 1 root root 14 Jan 1 22:28 libPEX5.so.6 -> libPEX5.so.6.0 -rwxr-xr-x 1 root root 259072 Mar 6 2000 libPEX5.so.6.0 lrwxrwxrwx 1 root root 12 Jan 1 22:28 libSM.so.6 -> libSM.so.6.0 -rwxr-xr-x 1 root root 40684 Mar 6 2000 libSM.so.6.0 lrwxrwxrwx 1 root root 13 Jan 1 22:28 libX11.so.6 -> libX11.so.6.1 -rwxr-xr-x 1 root root 799870 Mar 6 2000 libX11.so.6.1 lrwxrwxrwx 1 root root 13 Jan 1 22:28 libXIE.so.6 -> libXIE.so.6.0 -rwxr-xr-x 1 root root 54537 Mar 6 2000 libXIE.so.6.0 lrwxrwxrwx 1 root root 13 Jan 1 22:28 libXaw.so.6 -> libXaw.so.6.1 -rwxr-xr-x 1 root root 272797 Mar 6 2000 libXaw.so.6.1 lrwxrwxrwx 1 root root 14 Jan 1 22:28 libXext.so.6 -> libXext.so.6.3 -rwxr-xr-x 1 root root 54591 Mar 6 2000 libXext.so.6.3 lrwxrwxrwx 1 root root 12 Jan 1 22:28 libXi.so.6 -> libXi.so.6.0 -rwxr-xr-x 1 root root 36567 Mar 6 2000 libXi.so.6.0 lrwxrwxrwx 1 root root 13 Jan 1 22:28 libXmu.so.6 -> libXmu.so.6.0 -rwxr-xr-x 1 root root 91196 Mar 6 2000 libXmu.so.6.0 lrwxrwxrwx 1 root root 12 Jan 1 22:28 libXp.so.6 -> libXp.so.6.2 -rwxr-xr-x 1 root root 34337 Mar 6 2000 libXp.so.6.2 lrwxrwxrwx 1 root root 12 Jan 1 22:28 libXt.so.6 -> libXt.so.6.0 -rwxr-xr-x 1 root root 345760 Mar 6 2000 libXt.so.6.0 lrwxrwxrwx 1 root root 14 Jan 1 22:28 libXtst.so.6 -> libXtst.so.6.1 -rwxr-xr-x 1 root root 22020 Mar 6 2000 libXtst.so.6.1 From owner-failsafe@oss.sgi.com Fri Jul 28 12:07:03 2000 Received: by oss.sgi.com id ; Fri, 28 Jul 2000 12:06:53 -0700 Received: from pyongsan.compgen.com ([158.155.0.1]:9733 "EHLO gw1.compgen.com") by oss.sgi.com with ESMTP id ; Fri, 28 Jul 2000 12:06:29 -0700 Received: from uxeric.compgen.com (root@uxeric.compgen.com [158.155.4.32]) by gw1.compgen.com (8.8.7/8.8.7) with ESMTP id PAA22136; Fri, 28 Jul 2000 15:05:58 -0400 Received: (from eric@localhost) by uxeric.compgen.com (8.9.3/8.9.3) id PAA30109; Fri, 28 Jul 2000 15:05:58 -0400 From: "Eric Z. Ayers" MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit Message-ID: <14721.55701.784293.128746@gargle.gargle.HOWL> Date: Fri, 28 Jul 2000 15:05:57 -0400 (EDT) To: linuxfailsafe@lists.tummy.com, failsafe@oss.sgi.com Subject: Missing man pages X-Mailer: VM 6.72 under 21.1 (patch 8) "Bryce Canyon" XEmacs Lucid Reply-To: Eric.Ayers@compgen.com X-Face: (3Y\Z;G!Ce[Q\WBgGFLgcaL%v[kJ'@9s`Qn1<)EEL5tSW7IDvX[{APQ5]eY}uF}%qbD[-@N !5]S!%o0*DbAB?~o%tca^?3@zU~"fQ@MTiClP>w%`Y8oG&6|:>2F=bhnf2>bPedqw-.T>U-BaI`F>1 QY@?oGJ0.lV?b@0HgvaOt>=0,/@,=(kE"J++vO?K"3ve@,"sunF0HnU|h&|:}%|P6%BohO_*mAHJ#g EHc;_'bXG|kCLMSF`:/O_F0fuJ:j2^C\NJ(:$izN@mbXQo(IL,BO.7<]wT?:5.A<$C Sender: owner-failsafe@oss.sgi.com Precedence: bulk Return-Path: X-Orcpt: rfc822;failsafe-outgoing Hello again, I'm reading through the Failsafe programming guide now. In section 2.3 (page 30) it says to read some man pages that aren't here: cdbd ha_exec2 ha_fsd ha_ifd ha_ifadmin ha_macconfig2 ha_statd2 I can guess that the statd2 might be included in the NFS package. Here's the list of man pages I have installed in /usr/share/man/ cbeutil.1m cdbBackup.1m cdbRestore.1m cdbutil.1m cluster_mgr.1m cmond.1m crsd.1m failsafe.1m fs2d.1m haStatus.1m ha_cilog.1m ha_cmsd.1m ha_gcd.1m ha_srmd.1m Sorry, I have been using the 'failsafe@oss.sgi.com' list and I guess I should be using the 'linuxfailsafe@lists.tummy.com', shouldn't I? -Eric. From owner-failsafe@oss.sgi.com Fri Jul 28 12:26:04 2000 Received: by oss.sgi.com id ; Fri, 28 Jul 2000 12:25:54 -0700 Received: from pyongsan.compgen.com ([158.155.0.1]:7690 "EHLO gw1.compgen.com") by oss.sgi.com with ESMTP id ; Fri, 28 Jul 2000 12:25:33 -0700 Received: from uxeric.compgen.com (root@uxeric.compgen.com [158.155.4.32]) by gw1.compgen.com (8.8.7/8.8.7) with ESMTP id PAA24722 for ; Fri, 28 Jul 2000 15:25:30 -0400 Received: (from eric@localhost) by uxeric.compgen.com (8.9.3/8.9.3) id PAA30123; Fri, 28 Jul 2000 15:25:30 -0400 From: "Eric Z. Ayers" MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit Message-ID: <14721.56874.395254.185177@gargle.gargle.HOWL> Date: Fri, 28 Jul 2000 15:25:30 -0400 (EDT) To: failsafe@oss.sgi.com Subject: haStatus X-Mailer: VM 6.72 under 21.1 (patch 8) "Bryce Canyon" XEmacs Lucid Reply-To: Eric.Ayers@compgen.com X-Face: (3Y\Z;G!Ce[Q\WBgGFLgcaL%v[kJ'@9s`Qn1<)EEL5tSW7IDvX[{APQ5]eY}uF}%qbD[-@N !5]S!%o0*DbAB?~o%tca^?3@zU~"fQ@MTiClP>w%`Y8oG&6|:>2F=bhnf2>bPedqw-.T>U-BaI`F>1 QY@?oGJ0.lV?b@0HgvaOt>=0,/@,=(kE"J++vO?K"3ve@,"sunF0HnU|h&|:}%|P6%BohO_*mAHJ#g EHc;_'bXG|kCLMSF`:/O_F0fuJ:j2^C\NJ(:$izN@mbXQo(IL,BO.7<]wT?:5.A<$C Sender: owner-failsafe@oss.sgi.com Precedence: bulk Return-Path: X-Orcpt: rfc822;failsafe-outgoing FYI folks, I wouldn't keep mailing with all these percieved problems if I wasn't very encouraged by the failsafe product. I really hope that I can build our new product around it. The haStatus command looks like one of the tools I am used to having, but it isn't in PATH. I hunted for it, and it's in /var/cluster/cmgr-scripts/. Will it always be there, tucked out of the way, or should I add this to my PATH settings? -Eric. From owner-failsafe@oss.sgi.com Fri Jul 28 15:58:17 2000 Received: by oss.sgi.com id ; Fri, 28 Jul 2000 15:57:57 -0700 Received: from pneumatic-tube.sgi.com ([204.94.214.22]:25671 "EHLO pneumatic-tube.sgi.com") by oss.sgi.com with ESMTP id ; Fri, 28 Jul 2000 15:57:49 -0700 Received: from rapture.engr.sgi.com (rapture.engr.sgi.com [163.154.5.98]) by pneumatic-tube.sgi.com (980327.SGI.8.8.8-aspam/980310.SGI-aspam) via ESMTP id QAA06240 for ; Fri, 28 Jul 2000 16:03:38 -0700 (PDT) mail_from (rcu@rapture.engr.sgi.com) Received: (from rcu@localhost) by rapture.engr.sgi.com (SGI-8.9.3/8.9.3) id PAA02634; Fri, 28 Jul 2000 15:55:16 -0700 (PDT) From: rcu@rapture.engr.sgi.com (R. Underwood) Message-Id: <10007281555.ZM2548@rapture.engr.sgi.com> Date: Fri, 28 Jul 2000 15:55:15 -0700 In-Reply-To: "Eric Z. Ayers" "FS_Prog_Guide" (Jul 28, 2:07pm) References: <14721.52186.955592.208904@gargle.gargle.HOWL> X-Mailer: Z-Mail (3.2.3 08feb96 MediaMail) To: Eric.Ayers@compgen.com Subject: various FailSafe bugs Cc: failsafe@oss.sgi.com Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Sender: owner-failsafe@oss.sgi.com Precedence: bulk Return-Path: X-Orcpt: rfc822;failsafe-outgoing Eric, Just wanted to thank you for your review of the code and finding of several bugs. This kind of active participation will make Linux FailSafe a success and useful to the whole Linux community. Keep the great feedback coming! :) R -- R. Underwood, Engineering Manager / Desktop & Sysadm Software From owner-failsafe@oss.sgi.com Mon Jul 31 14:09:11 2000 Received: by oss.sgi.com id ; Mon, 31 Jul 2000 14:09:01 -0700 Received: from pyongsan.compgen.com ([158.155.0.1]:45840 "EHLO gw1.compgen.com") by oss.sgi.com with ESMTP id ; Mon, 31 Jul 2000 14:08:46 -0700 Received: from uxeric.compgen.com (root@uxeric.compgen.com [158.155.4.32]) by gw1.compgen.com (8.8.7/8.8.7) with ESMTP id RAA07323 for ; Mon, 31 Jul 2000 17:08:45 -0400 Received: (from eric@localhost) by uxeric.compgen.com (8.9.3/8.9.3) id RAA24299; Mon, 31 Jul 2000 17:08:45 -0400 From: "Eric Z. Ayers" MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit Message-ID: <14725.60124.785560.180889@gargle.gargle.HOWL> Date: Mon, 31 Jul 2000 17:08:44 -0400 (EDT) To: failsafe@oss.sgi.com Subject: Problem with GUI starting solved X-Mailer: VM 6.72 under 21.1 (patch 8) "Bryce Canyon" XEmacs Lucid Reply-To: Eric.Ayers@compgen.com X-Face: (3Y\Z;G!Ce[Q\WBgGFLgcaL%v[kJ'@9s`Qn1<)EEL5tSW7IDvX[{APQ5]eY}uF}%qbD[-@N !5]S!%o0*DbAB?~o%tca^?3@zU~"fQ@MTiClP>w%`Y8oG&6|:>2F=bhnf2>bPedqw-.T>U-BaI`F>1 QY@?oGJ0.lV?b@0HgvaOt>=0,/@,=(kE"J++vO?K"3ve@,"sunF0HnU|h&|:}%|P6%BohO_*mAHJ#g EHc;_'bXG|kCLMSF`:/O_F0fuJ:j2^C\NJ(:$izN@mbXQo(IL,BO.7<]wT?:5.A<$C Sender: owner-failsafe@oss.sgi.com Precedence: bulk Return-Path: X-Orcpt: rfc822;failsafe-outgoing Thanks folks - I got the new v.8 RPMS and and working with them now. ---------------------------------------------------------------------- I reported last week that I couldn't run the GUI and export the display to my workstation. In addition to the XFree86-libs RPM, I needed to install the main XFree86 RPM, (but not an XServer or fonts). Now that I've done that, I can run the failsafe GUI on the node dru1a and export it to my workstation. ---------------------------------------------------------------------- Great! I see a template for IP_Address (but that's the only one besides template :-( I defined a new resource of type IP_Address. (I called it "158.155.8.170") Then back at the failsafe manager main window I tried -->Resource & Resource Types -->Modify an existing resource definition I can select "IP_Address" but the second drop down list comes up blank. I know that the resource is defined because I get an error if I try to add it again (a message like "... already defined") ---------------------------------------------------------------------- Unfortunately, now I am stuck. This seems related to the problem above. After defining my failover policy, and my resource group, I'm now trying to add a resource to the group and having some trouble using the GUI. >From the Failsafe Manager -->Failover Policies & Resource Groups -->Add/Remove Resources in Resource Groups I select my resource group name, then select the resource type 'IP_Address' but the drop down list isn't filled in.