tile media is lost, with RELP things received but not written is lost)
Really, the only way to not loose something is to have an application level
acknowlegement that's only sent after the data is safe on redundant non-volitile
media.
David Lang
_
e hash
mode to just sourceip instead of sourceip-sourceport.
If you have things setup per your config, check that both nodes are online and
have the resource active. At that point, both systems should get some of the
traffic.
David Lang
___
Linux
he hash details :-) This is 'good enough'
for load balancing syslog messages.
David Lang
vagrant@sendervm:~$ sudo crm configure show
node receivervm
node sendervm
primitive p_vip ocf:heartbeat:IPaddr2 \
params ip="10.114.90.251" cidr_netmask="24" nic="eth1&
On Thu, 19 Sep 2013, Jakob Curdes wrote:
Am 19.09.2013 11:49, schrieb David Lang:
On Thu, 19 Sep 2013, Jakob Curdes wrote:
That's the direction we started, but apparently the centos
pacemaker/corosync packages don't look at the corosync.conf file, they
expect to extract everyth
On Thu, 19 Sep 2013, Jakob Curdes wrote:
Am 19.09.2013 01:21, schrieb David Lang:
token_retransmits_before_loss_const="10" join="60" consensus="4800"
rrp_mode="none" transport="udpu">
ttl="1"
I have
ll if this is a pacemaker or corosync problem (so I can drop one of the
lists from the thread)?
David Lang
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems
not, what else do I need to do to disable multicast and just use udpu?
join="60" consensus="4800" rrp_mode="none" transport="udpu">
David Lang
__
Why is corosync replacing heartbeat? it seems like there are a number of things
(like multi-datacenter operation) that have worked for heartbeat for years that
corosync lists as new or upcoming features.
David Lang
On Mon, 1 Jul 2013, Digimer wrote:
On 07/01/2013 07:06 AM, Parkirat wrote
in situation.
If you have shared nothing HA nodes, the odds are that you really aren't
damaging much, the worst case tends to be the dup IP issue, and if one is off
the network, that really doesn't matter.
David Lang
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems
d of problems when running over a virtual machine (ESXi
>> 5.0)?
>
> I'd suspect a scheduler issue. The VM is probably starved, hence
> that long delays. You should check the vmware docs or forums.
I've seen similar logs with real hardware when the system is overheating and
t
On Wed, 22 Aug 2012, Jon Heese wrote:
> On 21 Aug 2012, at 17:29, David Lang wrote:
>> On Tue, 21 Aug 2012, Jon Heese wrote:
>>> Feel free to keep discussing alternatives, but I am not at liberty to
>>> change this system from the current Heartbeat/Pacemaker/CRM\
>
On Tue, 21 Aug 2012, Jon Heese wrote:
>> On 21 Aug 2012, at 22:06, David Lang wrote:
>> Well haproxy does all that out of the box, no tricks or tweakery
>> required...
>>
>> Another thing I like about haproxy is that it's unnervingly fast;
>> start/
On Tue, 21 Aug 2012, Lars Marowsky-Bree wrote:
> On 2012-08-21T13:16:29, David Lang wrote:
>
>> with ldirectord you have an extra network hop, and you have all your
>> traffic going through one system. This is a scalability bottleneck as
>> well as bing a sepa
CLUSTERIP is a very strong contender.
It's especially good as a simple migration from active/passive. You just add a
third box to the cluster and change the configuration so that instead of your
VIP moving from one box to another, it's instead shared between the systems.
This is a much s
y
> isn't) - haproxy is incredibly easy in comparison.
I still have most systems using the version 1 style haresources config. It's
great for doing the simple failover scenario easily.
David Lang
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems
ay to do it for load sharing in one network.
ANYCAST is the approach you would use to distribute the work across different
datacenters.
ANYCAST has severe limitations on what you can do with it, but CLUSTERIP is far
more flexible and can work in just about any local active/active problem.
Da
to have the same IP on both systems at the same time, you need to use something
like CLUSTERIP
David Lang
On Mon, 20 Aug 2012, Yount, William D wrote:
> Date: Mon, 20 Aug 2012 14:19:59 -0700
> From: "Yount, William D"
> Reply-To: General Linux-HA mailing list
> To: Ge
On Fri, 8 Jun 2012, Lars Ellenberg wrote:
> On Fri, Jun 08, 2012 at 02:07:17PM -0700, David Lang wrote:
>> I just updated one of my systems to the 3.4 kernel and findif appears to be
>> failing (generic error)
>
>
> And your version of the resource agents is?
> Did you
I just updated one of my systems to the 3.4 kernel and findif appears to be
failing (generic error)
how can I test it from the command line?
David Lang
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo
nodes is down before you restore the comms.
actually, I believe that what happens is that both nodes stop the resouce, and
then one of the nodes starts it.
this solves the dup-IP problem because starting the resource re-sends the
appropriate ARP packets to clean up the network.
David Lang
, shutdown heartbeat on
the
new box and copy the file over (or update the value in the file) on the new box.
David Lang
On Fri, 29 Jul 2011, mike wrote:
> Date: Fri, 29 Jul 2011 16:06:25 -0300
> From: mike
> Reply-To: General Linux-HA mailing list
> To: General Linux-HA mailing lis
tems because I use MailTo to send failover
messages, if sendmail stalls attempting to send the message (including failed
DNS lookups because the network is down), a graceful failover can take a long
time to take place.
David Lang
___
Linux-HA mailing
On Thu, 12 Aug 2010, Dejan Muhamedagic wrote:
> On Wed, Aug 11, 2010 at 05:22:56PM -0700, David Lang wrote:
>> On Thu, 12 Aug 2010, Dejan Muhamedagic wrote:
>>
>>> On Wed, Aug 11, 2010 at 03:59:34PM -0700, David Lang wrote:
>>>> On Thu, 12 Aug 2010, Dejan Muham
On Thu, 12 Aug 2010, Dejan Muhamedagic wrote:
> On Wed, Aug 11, 2010 at 03:59:34PM -0700, David Lang wrote:
>> On Thu, 12 Aug 2010, Dejan Muhamedagic wrote:
>>
>>> On Wed, Aug 11, 2010 at 02:44:36PM -0700, David Lang wrote:
>> I currently manage over a hundred
&g
On Thu, 12 Aug 2010, Dejan Muhamedagic wrote:
> On Wed, Aug 11, 2010 at 02:44:36PM -0700, David Lang wrote:
>>> haresources2cib.py is obsolete and probably produces a bad
>>> cib.xml. The recommended way is to create a configuration using
>>> the crm shell.
>>
> some help from the community.
the problem isn't heartbeat v1, and it's not even the v1 style config.
the problem he's having is simply figuring out what heartbeat is doing. the
logs
aren't showing anything (including failures), which makes it incredibly
difficult to troub
on.
when it does the failover, does something further then show up in the ha-log
file?
David Lang
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems
at version 3.0.2 is doing that Igor is experiancing, and the inability to
take a simple config and convert it to the new format, it is sounding like it
may be time to fork.
David Lang
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.l
have one set of files that showed what is happening, what was different
about that run.
I sympathise with the problems you are having with your old system, but it
becoming slower is almost certinly not related to heartbeat.
David Lang
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems
On Tue, 10 Aug 2010, Igor Chudov wrote:
> On Tue, Aug 10, 2010 at 7:05 PM, David Lang
> wrote:
>> On Tue, 10 Aug 2010, Igor Chudov wrote:
>>
>>> On Tue, Aug 10, 2010 at 6:41 PM, David Lang
>>> wrote:
>>>> On Tue, 10 Aug 2010, Igor Chudov wrote:
&
On Tue, 10 Aug 2010, Igor Chudov wrote:
> On Tue, Aug 10, 2010 at 6:41 PM, David Lang
> wrote:
>> On Tue, 10 Aug 2010, Igor Chudov wrote:
>>
>>> Guys, I have a bit of clarification. In an attempt to avoid the timing
>>> issues, an hour ago I tried adding a con
ion) heartbeat sits extra long when it first boots and
doesn't hear anything, just in case the switch is preventing it from seeing
another system that's up.
David Lang
> pfs-srv3:
>
>
> Aug 10 18:04:41 pfs-srv3 logd: [955]: WARN: Core dumps could be lost
> if multipl
the other box shutdown, not that the other box has instructed it to shutdown.
this is looking to me like something external to heartbeat is killing the
process.
David Lang
On Tue, 10 Aug 2010, Igor Chudov wrote:
> Date: Tue, 10 Aug 2010 17:57:26 -0500
> From: Igor Chudov
> Reply-T
with the older version I am using I get a better error message than I
saw you post, which let me find it more clearly.
David Lang
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems
could you re-post the files (log files, ha.cf and haresources from each box)
David Lang
On Tue, 10 Aug 2010, Igor Chudov wrote:
> Date: Tue, 10 Aug 2010 15:23:44 -0500
> From: Igor Chudov
> Reply-To: General Linux-HA mailing list
> To: General Linux-HA mailing list
> Subject
es on it may get detected as a special case and ignored, but
it's
doing you no good and there's some possibility of it confusing the system, so
try removing it.
David Lang
On Tue, 10 Aug 2010, Igor Chudov wrote:
> Date: Tue, 10 Aug 2010 14:55:25 -0500
> From: Igor Chudov
>
On Tue, 10 Aug 2010, Igor Chudov wrote:
> Dmitri, you are right.
>
> In any case the name change did nothing.
did it eliminate the error from the log? does the log say anything else after
that point?
David Lang
> They are still refuse to take over when rebooted simultaneou
On Tue, 10 Aug 2010, Igor Chudov wrote:
On Tue, Aug 10, 2010 at 12:51 PM, David Lang
wrote:
one problem I see in ha-log-2.txt is the lines
Aug 10 10:38:06 pfs-srv4 ResourceManager[1241]: [1253]: ERROR: Cannot locate
resource script
Aug 10 10:38:06 pfs-srv4 req_resource[1236]: [1256]: debug
: dont_ask:
nice_failback: yes
given that this only happens on one machine, either you don't have heartbeat
installed the same way on both machines, or you don't have the same haresorces
file
can you please verify that haresources is _identical_ on both machines?
David Lang
On T
they do recover, but if the network outage is longer than your deadtime, the
inactive box has decided the other one is dead and become active, so when the
network is restored you are in a splitbrain situation, and that is what you are
recovering from.
David Lang
>>
>> Thanks,
>>
ha-log should give you a detailed picture of what each box is thinking as they
startup. I've always been able to track down the problem with that info for my
systems.
David Lang
On Mon, 9 Aug 2010, Igor Chudov wrote:
> Pushkar, I will be at work tomorrow (took a couple of days off) an
does ha-log show on pfs-srv3?
David Lang
On Mon,
9 Aug 2010, Nick Calvert wrote:
> Date: Mon, 9 Aug 2010 21:10:13 +0100
> From: Nick Calvert
> Reply-To: General Linux-HA mailing list
> To: General Linux-HA mailing list
> Subject: Re: [Linux-HA] Heartbeat does not take over i
On Tue, 4 May 2010, Cameron Smith wrote:
> On Tue, May 4, 2010 at 11:26 AM, Michael Schwartzkopff
> wrote:
>
>> Am Dienstag, 4. Mai 2010 20:19:02 schrieb Cameron Smith:
>>> On Tue, May 4, 2010 at 10:17 AM, David Lang
>>>
>>> wrote:
>>>&g
On Tue, 4 May 2010, Cameron Smith wrote:
> On Tue, May 4, 2010 at 10:17 AM, David Lang
> wrote:
>
>> On Tue, 4 May 2010, Cameron Smith wrote:
>>
>>> I am currently using Heartbeat to manage http, mysql and a DRBD device
>>> between two nodes.
>>>
ich would let it receive mail at both the real IP
addresses and the VIPs).
I have several dozen clusters with postfix on them, and it works just fine
without needing to know anything about heartbeat.
David Lang
___
Linux-HA mailing list
Linux-HA@lists
On Mon, 15 Feb 2010, Dejan Muhamedagic wrote:
> On Thu, Feb 11, 2010 at 10:04:04AM -0800, David Lang wrote:
>> on my old systems that are still running heartbeat 1.x there is a hb_reset
>> command that moves all resources to the node they are configured to start on
>> (assu
.x configs as they do the job so I don't need the
complexity of the 2.x crm stuff)
David Lang
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems
nough detail to know if this
would be a factor or not.
David Lang
On Mon, 11 Jan 2010, Javen Wu wrote:
Can we change the limitation 32 nodes of COROSYNC to 64 ?
Thanks
Javen
2010/1/11 Alain.Moulle
Hi,
I wonder what is for now the *real* limit in the number
of nodes in a HA cluster with Pace
able.
what I am currently using is
keepalive 2
deadtime 10
udpport 1107
bcast eth0 eth1 eth2 eth3 eth4 eth5 eth6 eth7
node primary
node backup
debugfile /var/log/ha-debug
logfile /var/log/ha-log
auto_failback off
apiauth cl_status gid=haclient
this is using udp broadcase instead of unicast
you
at messages that aren't
appropriate for the cluster, which is what you are seeing.
David Lang
> Thanks,
>
> Dejan
>
>> heartbeat[4514]: 2009/07/07_09:11:35 ERROR: MSG[4] : [src=dcwvm-drbdnode-1]
>> heartbeat[4514]: 2009/07/07_09:11:35 ERROR: MSG[5] :
>> [(1)src
do the rules on the ASA allow the real interfaces of the boxes to get to the
Internet, or only the VIP?
David Lang
On Fri, 8 May 2009, Ryan Thomson wrote:
> Date: Fri, 8 May 2009 10:48:14 -0700
> From: Ryan Thomson
> Reply-To: General Linux-HA mailing list
> To: General Linux-HA
newer (2.x) versions of heartbeat.
David Lang
Is there are way to do this?
Thanks,
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems
ster think about using Linux Virtual Server. It also
integrates nicely into heartbeat.
CLUSTERIP and LVS serve very different purposes.
CLUSTERIP spreads the load across different servers in that heartbeat cluster.
LVS spreads the load across different servers outside that heartbeat cluster.
r
misconfigurations.
David Lang
Brandon
Dejan Muhamedagic wrote:
Hi,
You need to find a more appropriate forum: heartbeat isn't
haproxy.
Thanks,
Dejan
On Fri, Jan 16, 2009 at 07:23:13PM +0530, kiran sarvabhotla wrote:
Hi all,
I tried to configure a 2 node cluster (courtesy
http:
n with stop multiple times.
David Lang
--
Eric Robinson
-Original Message-
From: [EMAIL PROTECTED]
[mailto:[EMAIL PROTECTED] On Behalf Of Imran Chaudhry
Sent: Friday, November 28, 2008 9:36 AM
To: linux-ha@lists.linux-ha.org
Subject: [Linux-HA] Rename an arbitrary file on failover (a
h machines
2. start the resource manually with the command line that heartbeat would use
for example
/etc/ha.d/resouce.d/IPADDR 192.168.111.1 start
David Lang
Thanks!
Leon
=
Leon Miller-Out
President, Singlebrook Technology, Inc
p 607-330-1493
f 607-697-0457
516
0
Which seems to suggest that heartbeat at least thinks everything is OK.
on your boxes running heartbeat just do an ifconfig and you should see eth0:0
and eth0:1 with your VIPs on them.
The 2 real web servers are set up like this:
what do your load balancers look like?
David Lang
/etc
is to have the clients point at the individual boxes. the
NTP protocol is designed to deal with multiple sources so it doesn't need to
work through the VIP.
David Lang
Thanks! Phil
LNXPNTP1: primary system, eth0 = 10.255.213.232
CHARNTP1: backup system, eth0 = 10.255.213.233
Virtu
y need to be defined in the haresources file where it can be
liegitimate for them to be different, but there are not many cases like this.
David Lang
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/lin
upgrades, kernel upgrades,
reboots, etc) that no single box solution can match, no matter how much more you
spend on the hardware.
David Lang
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems
On Fri, 28 Mar 2008, Luis Motta Campos wrote:
David Lang wrote:
haresources is the V1-style configuration
cibadmin is the V2-style configuration
V2 is significantly more powerful then V1, but also more complicated. if
you don't need any of the V2 features you can use V1.
I'm not
(Kernel 2.6.18-53.1.14.el5),
and (I believe) V2-style configuration.
haresources is the V1-style configuration
cibadmin is the V2-style configuration
V2 is significantly more powerful then V1, but also more complicated. if you
don't need any of the V2 features you can
On Wed, 13 Feb 2008, Andrew Beekhof wrote:
On Feb 12, 2008, at 7:23 PM, David Lang wrote:
so does this mean that those of us who use systems that don't run X and
don't have windows boxes nearby shouldn't use heartbeat V2?
that's what it sounds like you guys are saying.
nts to be
running a version that gets patches, this is a useful option.
David Lang
On Wed, 13 Feb 2008, Andrew Beekhof wrote:
Date: Wed, 13 Feb 2008 08:23:08 +0100
From: Andrew Beekhof <[EMAIL PROTECTED]>
Reply-To: General Linux-HA mailing list
To: General Linux-HA mailing list
Subject: Re
so does this mean that those of us who use systems that don't run X and don't
have windows boxes nearby shouldn't use heartbeat V2?
that's what it sounds like you guys are saying.
David Lang
On Tue, 12 Feb 2008, Dan Gahlinger wrote:
Date: Tue, 12 Feb 2008 13:14:26 -0500
apps that don't handle the loss of connection cleanly and the app has
required manual restarting, but never any problems with the AIX systems
themselves.
David Lang
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.o
t anyone who really cares (like the person
asking the question) should go back to the early archives to find the details.
David Lang
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems
en tested that if we did both everything that people were
running into was handled, and it's worked ever since.
David Lang
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See
everything.
David Lang
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems
going to be used for multiple purposes then splitting it off is a good
idea (and I would suggest that it may be appropriate to split the GUI from the
core CRM as well).
I will only start to get worried if I start to see interface changes takeing
place that aren
st of interrupting
connections low enough that it hasn't been a high priority)
David Lang
Perhaps a good 'distribution' is pfsense, which packages it all
(FreeBSD+PF+CARP+more) including a web interface. There is plenty of
documentation on the web avaiable for such a setup...
- Joris
-
hat do you mean when you say you need to manage ssh sessions? if you mean they
go through the firewall, then the iptables stuff should fix this. if you mean
that people connect to the firewall itself and you want the ssh session to
failover to the backup, that's not possibl
7;t bother with the conntrack replication yet (once I get a
chance to experiment with it I may find it's trivial to setup and start useing
it everywhere, but it hasn't been important enough for me to do so yet)
David Lang
___
Linux-HA mailing
On Mon, 16 Jul 2007, Lars Marowsky-Bree wrote:
On 2007-07-14T20:52:43, Alan Robertson <[EMAIL PROTECTED]> wrote:
a random collection of mostly old cast-off computers. They range
between 300mhz and 2.4 ghz, and have disks ranging from
5200 to 7200 RPM with significantly dif
On Mon, 16 Jul 2007, Andrew Beekhof wrote:
On 7/16/07, David Lang <[EMAIL PROTECTED]> wrote:
On Mon, 16 Jul 2007, Andrew Beekhof wrote:
> Lets call a spade a spade shall we...
>
> This is a thinly veiled put-down of the people who have been doing
Alan's
> job for t
On Mon, 16 Jul 2007, Lars Marowsky-Bree wrote:
On 2007-07-16T12:20:42, David Lang <[EMAIL PROTECTED]> wrote:
releasing substandard packages. Particularly given that I am the author of
the majority of the HAv2 code and therefor have arguably the most interest
in its quality.
the inf
kernel has gone with each distro patching the base
version and distributing slightly different things, but all named the same
version numbers is not the way I would want to see this project go. please try
to avoid this.
David Lang
___
Linu
e-mails.
this isn't a fatal problem, but it is an annoyance (I've had the shutting down
on the inactive box in a pair generate MailTo messages from both boxes, causing
management to freak out)
are there enough 'oneshot' type things that it is worth adding
On Mon, 16 Jul 2007, Lars Marowsky-Bree wrote:
On 2007-07-15T14:58:47, David Lang <[EMAIL PROTECTED]> wrote:
it would be great if somthing like the auto-build system could feed into a
cluster like this for automated testing (I don't know if there would be
enough testing with nig
0+ Athlons (both CPU's in one case will probably be the same)
3. a limited number of 2u dual Athlon 2000+ systems with SCSI RAID controllers
(I'm short on drives, but can come up with a handful of 10k, and possibly a
couple 15K drives)
any interst from anyone for this?
David Lang
On Fri, 6 Jul 2007, Andrew Beekhof wrote:
Date: Fri, 6 Jul 2007 21:40:13 +0200
From: Andrew Beekhof <[EMAIL PROTECTED]>
Reply-To: General Linux-HA mailing list
To: General Linux-HA mailing list
Subject: Re: [Linux-HA] Late heartbeats with heartbeat 2.0.8
On 7/6/07, David Lang &
c in the
normal release locations?
David Lang
Since I am running
FreeBSD I can obviously not use binary packages for linux distributions.
On 7/6/07, Lars Marowsky-Bree <[EMAIL PROTECTED]> wrote:
>
> On 2007-07-05T12:00:04, Matt Wilder <[EMAIL PROTECTED]> wrote:
>
>
at he hasn't spoken up
since then.
David Lang
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems
83 matches
Mail list logo