[DRBD-user] data corruption scenario in ha cluster

2011-02-23 Thread Mia Lueng
a to drbd disk and then hangup. Then how can B know that the data will be corrupt and do not start as primary DRBD. Best Regards Mia Lueng ___ drbd-user mailing list drbd-user@lists.linbit.com http://lists.linbit.com/mailman/listinfo/drbd-user

[DRBD-user] use drbd on wan

2011-09-20 Thread Mia Lueng
HI All: In our case, we use drbd to sync oracle db data in two node across the wan. Since the net bandwidth is slow(<2M), the i/o performance is pretty poor in primary node because i/o operation will be pending to wait the data transmission. So we have considered the following solution: On secon

Re: [DRBD-user] use drbd on wan

2011-09-20 Thread Mia Lueng
aster don't block while the data is shipped to the secondary? > There's obviously a trade-off with coherency on the secondary but you > don't have the latency. > > Sent from my iPhone > > On 20 Sep 2011, at 18:18, "Mia Lueng" wrote: > >> HI Al

Re: [DRBD-user] use drbd on wan

2011-09-21 Thread Mia Lueng
in rhel6,(we uset sles11 sp1). Is there another way to merge the snapshot? 2011/9/21 Lionel Sausin : > [Sorry, seems like my first post didn't make it to the list: resending] > > Dear Mia Lueng, > > We've been in this very situation for 6 month so I think I can anwser

Re: [DRBD-user] A problem about oracle on drbd

2012-07-03 Thread Mia Lueng
I should give some additional information. We build a two-node cluster with drbd. The following is the configuration. resource drbd1 { protocol A; on host41 { device /dev/drbd0 minor 0; disk /dev/vgdrbd/oracle; address

[DRBD-user] drbd hangup on rhel6

2012-07-06 Thread Mia Lueng
I built a two node cluster using drbd. After several drbdadm primary/secondary operations, the drbd status turn to Unconnected and drbdadm disconnect operation hangup and the following message occurs in dmesg. Oct 20 11:32:26 cgsl42 kernel: INFO: task cqueue:3267 blocked for more than 120 seco

[DRBD-user] how can i reduce the time doing initial full sync

2012-07-31 Thread Mia Lueng
Hi All: I have a two storage with 10T capacibility. When I doing initial full sync ,it will take a whole night. How can I reduce the time doing initial full sync. Thanks ___ drbd-user mailing list drbd-user@lists.linbit.com http://lists.linbit.com/mai

[DRBD-user] drbd read/write performance

2012-08-15 Thread Mia Lueng
Hi All: I setup a drbd device on a storage . Its write performance can reach to 300MB/s tested by dd. But when I setup a drbd device on it , use dd to test its write performance (peer node does not connect). The test result is only 40MB/s. drbd version is 8.3.13. And the read performance is also

[DRBD-user] drbd 8.3.13 hang up in rhel6.2

2012-08-22 Thread Mia Lueng
Hi All: I set a clsuter based on drbd and do the following test: On Node A, I set drbd as Primary, and run a mysql database on it and some load. and run the following script while [ 0 ] ; do drbdadm connect drbd0 >/dev/null 2>&1 sleep 10; done On node B, I do the follwing test to simulate

[DRBD-user] oracle on drbd failed

2012-08-25 Thread Mia Lueng
Hi All: I built a cluster to protect oracle database. The oracle db file stored on the drbd(8.3.13) device using protocol A. But sometime oracle can not be failover when the primary node is down. Here is the testing step 1. node A, B, A is primary node, B is secondary node. oracle run on node

[DRBD-user] why my mail be treated as a reply

2012-08-25 Thread Mia Lueng
??? ___ drbd-user mailing list drbd-user@lists.linbit.com http://lists.linbit.com/mailman/listinfo/drbd-user

Re: [DRBD-user] oracle on drbd failed

2012-08-26 Thread Mia Lueng
ocol A. Oracle > expects that data is commited to the disk, which is in the RAM of your > node A. I really have no idea in which situation protocol A can be > useful. > > Cheers, Felix > > Am 2012-08-26 05:10, schrieb Mia Lueng: >> >> Hi All: >> >> I bui

Re: [DRBD-user] oracle on drbd failed

2012-08-28 Thread Mia Lueng
another error occurs in oracle : ORA-00214: control file '/oradata/orcl/control01.ctl' version 79111 inconsistent with file '/oradata/flash_recovery_area/orcl/control02.ctl' version 79104 Is it sure that protocol A can not keep the oracle's data integration ? 2012/8

Re: [DRBD-user] oracle on drbd failed

2012-09-02 Thread Mia Lueng
I use drbd_trace to trace drbd write operation when running oracle, it show info like this; block drbd0: /root/rpmbuild/BUILD/drbd-8.3.13/drbd/drbd_main.c:2152: drbd0_worker [5323] data >>> Barrier (barrier 435610040) block drbd0: /root/rpmbuild/BUILD/drbd-8.3.13/drbd/drbd_receiver.c:5005: drbd0_

Re: [DRBD-user] oracle on drbd failed

2012-09-03 Thread Mia Lueng
ed proxy yet. I just test this on local lan environment. If the test pass, we will install it on WAN enviroment. 2012/9/3 Lars Ellenberg : > On Sun, Aug 26, 2012 at 11:10:44AM +0800, Mia Lueng wrote: >> Hi All: >> >> I built a cluster to protect oracle database. The or

[DRBD-user] can drbd reuse the transaction buffer after broken netlink recover?

2013-03-17 Thread Mia Lueng
Hi : Since drbd primary/secondary will do quick sync after the broken netlink recover, the data on secondary is inconsistent during the quick sync. Can I save the transaction buffer to disk and replay the transaction after the netlink recover? This can provide the secondary node is always be UpToDa

[DRBD-user] umount costs lots of time in drbd 8.4.3

2013-05-08 Thread Mia Lueng
in drbd 8.4.3,I do the following test: [root@kvm3 drbd.d]# drbdadm dump drbd0 # resource drbd0 on kvm3: not ignored, not stacked # defined at /etc/drbd.d/drbd0.res:1 resource drbd0 { on kvm3 { device /dev/drbd0 minor 0; disk /dev/vg_kvm3/drbd0; met

Re: [DRBD-user] umount costs lots of time in drbd 8.4.3

2013-05-08 Thread Mia Lueng
# sysctl -a|grep dirty vm.dirty_background_ratio = 10 vm.dirty_background_bytes = 0 vm.dirty_ratio = 20 vm.dirty_bytes = 0 vm.dirty_writeback_centisecs = 500 vm.dirty_expire_centisecs = 3000 bandwidth is 100M bps 2013/5/9 Lars Ellenberg > On Thu, May 09, 2013 at 12:16:56AM +0800, Mia Lu

[DRBD-user] iozone test block on drbd

2013-05-21 Thread Mia Lueng
drbd configure : [root@kvm3 ~]# drbdadm dump drbd0 # resource drbd0 on kvm3: not ignored, not stacked # defined at /etc/drbd.d/drbd0.res:1 resource drbd0 { on kvm3 { device /dev/drbd0 minor 0; disk /dev/vg_kvm3/drbd0; meta-diskinternal;

[DRBD-user] iozone test block on drbd

2013-05-21 Thread Mia Lueng
drbd configure : [root@kvm3 ~]# drbdadm dump drbd0 # resource drbd0 on kvm3: not ignored, not stacked # defined at /etc/drbd.d/drbd0.res:1 resource drbd0 { on kvm3 { device /dev/drbd0 minor 0; disk /dev/vg_kvm3/drbd0; meta-diskinternal;

Re: [DRBD-user] umount costs lots of time in drbd 8.4.3

2013-05-21 Thread Mia Lueng
I have 16G RAM in this server. Using a low dirty configuration may lead to a pool I/O performance? 2013/5/14 Lars Ellenberg > On Thu, May 09, 2013 at 10:33:16AM +0800, Mia Lueng wrote: > > # sysctl -a|grep dirty > > vm.dirty_background_ratio = 10 > > vm.dirty

Re: [DRBD-user] iozone test block on drbd

2013-05-22 Thread Mia Lueng
ke_function+0x0/0x40 [] kjournald2+0xb8/0x220 [jbd2] [] ? autoremove_wake_function+0x0/0x40 [] ? kjournald2+0x0/0x220 [jbd2] [] kthread+0x96/0xa0 [] child_rip+0xa/0x20 [] ? kthread+0x0/0xa0 [] ? child_rip+0x0/0x20 2013/5/21 Mia Lueng > drbd configure : > > [root@kvm3 ~]# drbdadm du

Re: [DRBD-user] umount costs lots of time in drbd 8.4.3

2013-06-23 Thread Mia Lueng
I think disconnecting drbd before umount and connecting it after umount is a good idea to avoid this. 2013/6/6 Lars Ellenberg > On Tue, May 21, 2013 at 11:52:58PM +0800, Mia Lueng wrote: > > I have 16G RAM in this server. Using a low dirty configuration may lead > >

[DRBD-user] problem using lv as drbd backend device

2013-10-06 Thread Mia Lueng
I have built a drbd cluster. The storage setting is like the following: backend LV--->drbd0--->pv-->vg-->userlv That means I create a drbd device on a LV, and create a volume group on drbd device again. In /etc/lvm/lvm.conf, I add a filter so that pvscan do not probe for the backend LV. This wo

Re: [DRBD-user] problem using lv as drbd backend device

2013-10-07 Thread Mia Lueng
I find a solution : 1. drbdadm sh-ll-dev drbd0 find drbd0's backend lv 2. map lv to dm-x , ls /sys/block/dm-x/holdersto find the frontend lv 3. dmsetup remove -f $frontlv 2013/10/7 Digimer > On 06/10/13 23:26, Mia Lueng wrote: > > I have built a drbd cluster. The stor

[DRBD-user] data mismatch when primary/secondary are both up2date

2013-11-29 Thread Mia Lueng
In my cluster(node1/node2) with drbd, the state in /proc/drbd is primary/secondary up2date/up2date, but when I change primary to node2 , the file that existed on node1 can not be found on node2. Then I do "drbdadm verify drbd0" to verify and resync the data, node2's data returned to be OK. I am w

Re: [DRBD-user] drbdadm verify stalled

2013-12-04 Thread Mia Lueng
http://www.novell.com/support/kb/doc.php?id=7009306 2013/12/4 Matthieu Lejeune > Hi, > > I was on the DRBD 8.3.13-2 on Debian Wheezy > > I have upgraded my debian on Jessie. > I'm using DRBD 8.4.3 now > I have push my config files to the new version. > > And the verify is ok > > Every 1.0s:

[DRBD-user] drbd sync rate slow down during resync

2013-12-31 Thread Mia Lueng
Hi all: Happy new year! I use drbd 8.3.15 now and I find a problem. The drbd sync rate slow down during resync . The initial sync rate will be up to 100MB/s(in gigabyte lan) and become slower and slower during resync . At the end , it will reach to 20MB/s. But after I disconnect it and connect a

[DRBD-user] writer order on secondary site

2015-12-22 Thread Mia Lueng
Hi: I'm just wondering how secondary handle the write ordering when a same block is written twice on primary. Application submits these updates: X, Y, Z. They may or may not be to the same block. If they are to the same block, then the application, file system or other layer already makes sure (or

Re: [DRBD-user] writer order on secondary site

2015-12-23 Thread Mia Lueng
completely. 2015-12-23 12:52 GMT+08:00 Digimer : > On 22/12/15 11:11 AM, Mia Lueng wrote: >> Hi: >> I'm just wondering how secondary handle the write ordering when a same >> block is written twice on primary. >> >> Application submits these updates: X, Y, Z. >&

Re: [DRBD-user] writer order on secondary site

2015-12-26 Thread Mia Lueng
How does epoch works? I have examined the source code. when primary recieves a bio with bi_size=0 or bio num > MAX_EPOCH_SIZE or idle time exceeds limit , there will be a P_BARRIER packet triggered? 2015-12-24 23:49 GMT+08:00 Lars Ellenberg : > On Wed, Dec 23, 2015 at 12:11:51AM +080

[DRBD-user] oracle stop timeout while drbd resync

2016-08-30 Thread Mia Lueng
Hi: I have a cluster with four drbd devices. I found oracle stopped timeout while drbd is in resync state. oracle is blocked like following: oracle6869 6844 0.0 0.0 71424 12616 ?S16:28 00:00:00 pipe_wait /oracle/app/oracle/dbhome_1/bin/sqlplus @/tmp/ora_ommbb_shutdown.sql oracle

Re: [DRBD-user] oracle stop timeout while drbd resync

2016-08-31 Thread Mia Lueng
Yes, Oracle & drbd is running under pacemaker just in primary/secondary mode. I stopped the oracle resource during DRBD is resyncing and the oracle hangup 2016-08-31 14:38 GMT+08:00 Igor Cicimov : > > > On Wed, Aug 31, 2016 at 3:49 PM, Mia Lueng wrote: >> >> Hi: >

Re: [DRBD-user] oracle stop timeout while drbd resync

2016-08-31 Thread Mia Lueng
e70d0 R11: 0206 R12: 0001553e0f80 R13: 7f38ac571c60 R14: 7fffc9ee77c0 R15: 7fffc9ee77e0 ORIG_RAX: 00d1 CS: 0033 SS: 002b 2016-09-01 7:48 GMT+08:00 Igor Cicimov : > > > On Thu, Sep 1, 2016 at 9:02 AM, Igor Cicimov > wrote: >> >>

[DRBD-user] how about read a block that not return to upper application in protocol C?

2016-09-04 Thread Mia Lueng
Hi All: In protocol C, a bio will return to upper application(execute bi_endio()) when local bio is completed and recieve the data ack packet from peer. But if a write request to block N was submitted and written to local disk, but not received the data ack from peer, a read request to the same