Re: [Linux-HA] monitor multiple nodes using single node

2010-02-09 Thread Andrew Beekhof
On Tue, Feb 9, 2010 at 5:21 AM, Qwerty-1  wrote:
>
> Hi,
>
> Thanks. can all the nodes send status to a single node. i am working on HA
> N+1 architecture where LoadBalancer will look for the status of all running
> N nodes.If any node fails then LoadBalancer should know it and take
> appropriate steps.
> Please suggest.

Please read some of the documentation.  In particular:

   
http://www.clusterlabs.org/doc/en-US/Pacemaker/1.0/html/Pacemaker_Explained/s-intro-redundancy.html

and

   
http://www.clusterlabs.org/doc/en-US/Pacemaker/1.0/html/Pacemaker_Explained/s-intro-architecture.html
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems


Re: [Linux-HA] [Pacemaker] Announce: Hawk (HA Web Konsole) 0.2.0

2010-02-09 Thread Darren.Mansell
Hi Tim. Thanks for this project, it seems to be exactly what we're
looking for.

I've installed it (it required spawn-fcgi too on SLES11 64) but I just
get a blank page. I've looked at the page source and the divs have
style="display: none". Not sure why that's happening, can you think of
anything?

Regards
Darren Mansell

-Original Message-
From: Tim Serong [mailto:tser...@novell.com] 
Sent: 09 February 2010 05:44
To: linux-ha@lists.linux-ha.org; pacema...@oss.clusterlabs.org
Subject: [Pacemaker] Announce: Hawk (HA Web Konsole) 0.2.0

Greetings All,

This is to announce version 0.2.0 of Hawk, a web-based GUI for
Pacemaker HA clusters.  The major item of note for this version
is that we now have reasonable feature parity with crm_mon, and
there are SLES/openSUSE packages available from the openSUSE
Build Service:

http://software.opensuse.org/search?baseproject=ALL&p=1&q=hawk

There is also a wiki page up at http://clusterlabs.org/wiki/Hawk
that gives a brief overview of the project, and tells you how
to get the source from Mercurial, if you don't want to (or can't)
use the above packages.

As before, please direct comments, feedback, questions etc.
to tser...@novell.com and/or the Pacemaker mailing list.

Thanks for listening,

Tim


-- 
Tim Serong 
Senior Clustering Engineer, Novell Inc.




___
Pacemaker mailing list
pacema...@oss.clusterlabs.org
http://oss.clusterlabs.org/mailman/listinfo/pacemaker
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems


Re: [Linux-HA] [Pacemaker] Announce: Hawk (HA Web Konsole) 0.2.0

2010-02-09 Thread Tim Serong
On 2/9/2010 at 09:15 PM,  wrote: 
> Hi Tim. Thanks for this project, it seems to be exactly what we're 
> looking for. 

Well, I certainly hope so :)

> I've installed it (it required spawn-fcgi too on SLES11 64) but I just 
> get a blank page. I've looked at the page source and the divs have 
> style="display: none". Not sure why that's happening, can you think of 
> anything? 

style="display: none" is used in two cases; one is for unexpanded
children of a collapsible panel (but the header will still be visible).
The other is if it thinks it can't see any useful information from
cibadmin, in which case the expected behaviour would be an error message
of some description.

Can you please tell me:

 - What version of Pacemaker you're running
 - If you run "cibadmin -Ql | grep cluster-infrastructure", do you
   see any output?  If so, what?

Thanks,

Tim

> -Original Message- 
> From: Tim Serong [mailto:tser...@novell.com]  
> Sent: 09 February 2010 05:44 
> To: linux-ha@lists.linux-ha.org; pacema...@oss.clusterlabs.org 
> Subject: [Pacemaker] Announce: Hawk (HA Web Konsole) 0.2.0 
>  
> Greetings All, 
>  
> This is to announce version 0.2.0 of Hawk, a web-based GUI for 
> Pacemaker HA clusters.  The major item of note for this version 
> is that we now have reasonable feature parity with crm_mon, and 
> there are SLES/openSUSE packages available from the openSUSE 
> Build Service: 
>  
> http://software.opensuse.org/search?baseproject=ALL&p=1&q=hawk 
>  
> There is also a wiki page up at http://clusterlabs.org/wiki/Hawk 
> that gives a brief overview of the project, and tells you how 
> to get the source from Mercurial, if you don't want to (or can't) 
> use the above packages. 
>  
> As before, please direct comments, feedback, questions etc. 
> to tser...@novell.com and/or the Pacemaker mailing list. 
>  
> Thanks for listening, 
>  
> Tim 
 



___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems


Re: [Linux-HA] monitor multiple nodes using single node

2010-02-09 Thread Qwerty-1

Hi,

Thanks, i have already gone through this doc. this doc covers that using
Pacemaker it is possible to define N+1 Architecture. It does not cover how
to implement it. Please share any configuration information or any other doc
that defines how to implement N+1 Architecture where one node will take care
of N running nodes.

Thanks in Advance


> On Tue, Feb 9, 2010 at 5:21 AM, Qwerty-1  wrote:
>>
>> Hi,
>>
>> Thanks. can all the nodes send status to a single node. i am working on
>> HA
>> N+1 architecture where LoadBalancer will look for the status of all
>> running
>> N nodes.If any node fails then LoadBalancer should know it and take
>> appropriate steps.
>> Please suggest.
> 
> Please read some of the documentation.  In particular:
> 
>   
> http://www.clusterlabs.org/doc/en-US/Pacemaker/1.0/html/Pacemaker_Explained/s-intro-redundancy.html
> 
> and
> 
>   
> http://www.clusterlabs.org/doc/en-US/Pacemaker/1.0/html/Pacemaker_Explained/s-intro-architecture.html
> ___
> Linux-HA mailing list
> Linux-HA@lists.linux-ha.org
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems
> 
> 

-- 
View this message in context: 
http://old.nabble.com/monitor-multiple-nodes-using-single-node-tp27498002p27513780.html
Sent from the Linux-HA mailing list archive at Nabble.com.

___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems


Re: [Linux-HA] monitor multiple nodes using single node

2010-02-09 Thread Andrew Beekhof
On Tue, Feb 9, 2010 at 12:23 PM, Qwerty-1  wrote:
>
> Hi,
>
> Thanks, i have already gone through this doc. this doc covers that using
> Pacemaker it is possible to define N+1 Architecture. It does not cover how
> to implement it. Please share any configuration information or any other doc
> that defines how to implement N+1 Architecture where one node will take care
> of N running nodes.

Thats how the cluster works.
Every node monitors its own resources and they elect one node to
co-ordinate the actions of the cluster.
This node gets told when any resource fails anywhere.

There is nothing to implement, just tell the cluster where they can
and cannot run, in your case their "home" machine and the shared
backup.
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems


Re: [Linux-HA] MySQL RA with replication awareness

2010-02-09 Thread Marian Marinov
Hello again,
I haven't finished full port of my script but most of the work is already done:

http://hydra.azilian.net/gitweb/?p=linux-ha/.git;a=blob_plain;f=mysql-
replica;hb=HEAD

I still have some tests to finish and I have one problem with which I would 
like some help.

I implemented the replication awareness within the notify section of the RA 
where I have all the information about the state of the cluster.

However this information is missing in different states of the RA. This is what 
makes the development a little bit slow :)

So my problem is how to find the master_host IP? I need the IP since with 
replication I want to skip the name resolving. Currently I do that by adding 
the hostname to /etc/hosts and get it every time I do a demote.

Will there be a better way ?

I'll continue development and I hope to finish it by the end of the week.

Best regards,
Marian


On Wednesday 03 February 2010 00:51:18 Marian Marinov wrote:
> This Friday I'll polish the code and make it more general. Then I'll post
>  the enhancement bugzilla.
> 
> Regards,
> Marian
> 
> On Tuesday 02 February 2010 11:49:49 Florian Haas wrote:
> > Hello,
> >
> > yes please! Could you file an enhancement bugzilla, attach your updated
> > RA version (or a patch), and let us know the bug ID?
> >
> > Thanks!
> > Cheers,
> > Florian
> >
> > On 2010-02-01 15:34, Marian Marinov wrote:
> > > I'm currently running my cluster with my modified version of the mysql
> > > ra.
> > >
> > > Do you want to see my changes?
> > >
> > > Regards,
> > > Marian
> > >
> > > On Monday 01 February 2010 14:52:04 Florian Haas wrote:
> > >> I am currently working on extending the MySQL RA in this way, and will
> > >> merge this some time after the impending resource agents release is
> > >> out.
> > >>
> > >> Cheers,
> > >> Florian
> > >>
> > >> On 2010-02-01 11:32, Marian Marinov wrote:
> > >>> Hello,
> > >>> I have a cluster of 2 nodes, on which I have MySQL with Master/Slave
> > >>> replication.
> > >>>
> > >>> What I want to do is, use a mysql ra which will know which server is
> > >>> master and which is slave so I can have a Master/Slave primitive in
> > >>> my crm configuration which will monitor those.
> > >>>
> > >>> For now I didn't found such a RA. Is there any?
> > >>>
> > >>> My approach to this problem is to use the current mysql ra and to
> > >>> modify it so it becomes replication aware. What I did was to add
> > >>> notify functionality and a few more OCF variables.
> > >>>
> > >>>
> > >>>
> > >>> -
> > >>>-- -
> 

-- 
Best regards,
Marian Marinov


signature.asc
Description: This is a digitally signed message part.
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

[Linux-HA] resource-agents??

2010-02-09 Thread Ilo Lorusso
 hi ,

 ive got a resource-agent that works 100% on one machine.
If i run it through off-test I get the following output:

/usr/sbin/ocf-tester -n post1 /usr/lib/ocf/resource.d/heartbeat/postfix; echo $?
 Beginning tests for /usr/lib/ocf/resource.d/heartbeat/postfix...
 * Your agent does not support the notify action (optional)
 * Your agent does not support the demote action (optional)
 * Your agent does not support the promote action (optional)
 * Your agent does not support master/slave (optional)
 0


when I copy it to another machine and try test it with  ocf-test I get
the following error,
/usr/sbin/ocf-tester -n post1 /usr/lib/ocf/resource.d/heartbeat/postfix; echo $?
* rc=1: Validation failed.  Did you supply enough options with -o ?
 Aborting tests


 why would this be the case? both machines are setup identically
running the same versions
of heartbeat and pacemaker

heartbeat-libs-3.0.1-1.el5
heartbeat-3.0.1-1.el5

pacemaker-libs-1.0.7-2.el5
pacemaker-1.0.7-2.el5



how could I debug this futher ocf-test in more details? as I dont see
much in the logs
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems