Am 02.12.2011 00:32, schrieb Andreas Kurz: > Hello Lutz, > > On 12/01/2011 01:26 PM, Lutz Reinhardt wrote: >> hi >> >> use a simple config: >> >> node node1 >> node node2 \ >> attributes standby="off" >> primitive res_drbd_cluster_ocfs ocf:linbit:drbd \ >> params drbd_resource="cluster-ocfs" \ >> op monitor interval="20" role="Master" timeout="20" \ >> op monitor interval="30" role="Slave" timeout="20" >> ms ms_drbd_cluster_ocfs res_drbd_cluster_ocfs \ >> meta resource-stickiness="100" master-max="2" notify="true" >> interleave="true" target-role="Started" >> property $id="cib-bootstrap-options" \ >> dc-version="1.0.9-74392a28b7f31d7ddc86689598bd23114f58978b" \ >> cluster-infrastructure="openais" \ >> expected-quorum-votes="2" \ >> no-quorum-policy="ignore" \ >> stonith-enabled="false" \ >> default-resource-stickiness="1000" \ >> last-lrm-refresh="1322660182" >> >> now it works fine but one thing happens: when i stop >> "ms_drbd_cluster_ocfs" and start it again, i see every time a fresh >> resync of one drbd node. it's not happened when i bring up the drbd >> device first before i start it with "start ms_drbd_cluster_ocfs" - then >> it works fine!? >> >> whats wrong? > > First of all you must use fencing for a dual-primary setup ... as you > have no fencing configured in Pacemaker I conclude (please correct me if > I'm wrong) that you also skip this part in DRBD config. > > Then please share the result of "drbdadm dump all" and "cat /proc/drbd" > when your cluster is up. You said you see a "fresh" resync ... you mean > a full resync? Please also provide kernel logs for this.
yes i mean a full resync and i use fencing (see below): # drbdadm dump all global { minor-count 32; } common { protocol C; net { allow-two-primaries; after-sb-0pri discard-zero-changes; after-sb-1pri discard-secondary; after-sb-2pri disconnect; } disk { on-io-error pass_on; fencing resource-only; } syncer { rate 12800; } startup { become-primary-on both; } handlers { pri-on-incon-degr "/usr/lib/drbd/notify-pri-on-incon-degr.sh; /usr/lib/drbd/notify-emergency-reboot.sh; echo b > /proc/sysrq-trigger ; reboot -f"; pri-lost-after-sb "/usr/lib/drbd/notify-pri-lost-after-sb.sh; /usr/lib/drbd/notify-emergency-reboot.sh; echo b > /proc/sysrq-trigger ; reboot -f"; local-io-error "/usr/lib/drbd/notify-io-error.sh; /usr/lib/drbd/notify-emergency-shutdown.sh; echo o > /proc/sysrq-trigger ; halt -f"; fence-peer "/usr/lib/drbd/crm-fence-peer.sh --timeout 30 --dc-timeout 40"; after-resync-target /usr/lib/drbd/crm-unfence-peer.sh; } } resource cluster-ocfs { protocol C; floating ipv4 10.60.60.22:7802 { device minor 0; disk /dev/xencluster/cluster-ocfs; meta-disk internal; } floating ipv4 10.60.60.23:7802 { device minor 0; disk /dev/xencluster/cluster-ocfs; meta-disk internal; } } resource test-drbd { protocol C; floating ipv4 10.60.60.22:7804 { device minor 31; disk /dev/xencluster/test-drbd; meta-disk internal; } floating ipv4 10.60.60.23:7804 { device minor 31; disk /dev/xencluster/test-drbd; meta-disk internal; } } # cat /proc/drbd version: 8.3.7 (api:88/proto:86-91) srcversion: EE47D8BF18AC166BE219757 0: cs:Connected ro:Primary/Primary ds:UpToDate/UpToDate C r---- ns:0 nr:0 dw:0 dr:200 al:0 bm:0 lo:0 pe:0 ua:0 ap:0 ep:1 wo:b oos:0 31: cs:Connected ro:Primary/Primary ds:UpToDate/UpToDate C r---- ns:0 nr:0 dw:0 dr:152 al:0 bm:0 lo:0 pe:0 ua:0 ap:0 ep:1 wo:b oos:0 stop ms_drbd_cluster_ocfs: node1: Dec 2 08:46:55 node1 kernel: [143347.351875] block drbd0: role( Primary -> Secondary ) Dec 2 08:46:55 node1 kernel: [143347.363566] block drbd0: peer( Primary -> Secondary ) Dec 2 08:46:55 node1 kernel: [143347.526187] block drbd0: sock was shut down by peer Dec 2 08:46:55 node1 kernel: [143347.526192] block drbd0: peer( Secondary -> Unknown ) conn( Connected -> Disconnecting ) pdsk( UpToDate -> DUnknown ) Dec 2 08:46:55 node1 kernel: [143347.526247] block drbd0: asender terminated Dec 2 08:46:55 node1 kernel: [143347.526250] block drbd0: Terminating drbd0_asender Dec 2 08:46:55 node1 kernel: [143347.526376] block drbd0: Connection closed Dec 2 08:46:55 node1 kernel: [143347.526384] block drbd0: conn( Disconnecting -> StandAlone ) Dec 2 08:46:55 node1 kernel: [143347.526403] block drbd0: receiver terminated Dec 2 08:46:55 node1 kernel: [143347.526406] block drbd0: Terminating drbd0_receiver Dec 2 08:46:55 node1 kernel: [143347.526426] block drbd0: disk( UpToDate -> Diskless ) Dec 2 08:46:55 node1 kernel: [143347.526476] block drbd0: drbd_bm_resize called with capacity == 0 Dec 2 08:46:55 node1 kernel: [143347.526482] block drbd0: worker terminated Dec 2 08:46:55 node1 kernel: [143347.526486] block drbd0: Terminating drbd0_worker node2: Dec 2 08:46:55 node2 kernel: [143573.833171] block drbd0: peer( Primary -> Secondary ) pdsk( Inconsistent -> UpToDate ) Dec 2 08:46:55 node2 kernel: [143573.843876] block drbd0: role( Primary -> Secondary ) Dec 2 08:46:55 node2 kernel: [143574.007033] block drbd0: peer( Secondary -> Unknown ) conn( Connected -> TearDown ) pdsk( UpToDate -> DUnknown ) Dec 2 08:46:55 node2 kernel: [143574.007077] block drbd0: asender terminated Dec 2 08:46:55 node2 kernel: [143574.007082] block drbd0: Terminating drbd0_asender Dec 2 08:46:55 node2 kernel: [143574.007285] block drbd0: Connection closed Dec 2 08:46:55 node2 kernel: [143574.007290] block drbd0: conn( TearDown -> Unconnected ) Dec 2 08:46:55 node2 kernel: [143574.007296] block drbd0: receiver terminated Dec 2 08:46:55 node2 kernel: [143574.007298] block drbd0: Restarting drbd0_receiver Dec 2 08:46:55 node2 kernel: [143574.007300] block drbd0: receiver (re)started Dec 2 08:46:55 node2 kernel: [143574.007304] block drbd0: conn( Unconnected -> WFConnection ) Dec 2 08:46:55 node2 kernel: [143574.018028] block drbd0: conn( WFConnection -> Disconnecting ) Dec 2 08:46:55 node2 kernel: [143574.018043] block drbd0: Discarding network configuration. Dec 2 08:46:55 node2 kernel: [143574.018163] block drbd0: Connection closed Dec 2 08:46:55 node2 kernel: [143574.018180] block drbd0: conn( Disconnecting -> StandAlone ) Dec 2 08:46:55 node2 kernel: [143574.018302] block drbd0: receiver terminated Dec 2 08:46:55 node2 kernel: [143574.018306] block drbd0: Terminating drbd0_receiver Dec 2 08:46:55 node2 kernel: [143574.018392] block drbd0: disk( UpToDate -> Diskless ) Dec 2 08:46:55 node2 kernel: [143574.018501] block drbd0: drbd_bm_resize called with capacity == 0 Dec 2 08:46:55 node2 kernel: [143574.018508] block drbd0: worker terminated Dec 2 08:46:55 node2 kernel: [143574.018510] block drbd0: Terminating drbd0_worker start ms_drbd_cluster_ocfs: node1: Dec 2 08:47:14 node1 kernel: [143365.606992] block drbd0: Starting worker thread (from cqueue [1898]) Dec 2 08:47:14 node1 kernel: [143365.607353] block drbd0: disk( Diskless -> Attaching ) Dec 2 08:47:14 node1 kernel: [143365.625044] block drbd0: No usable activity log found. Dec 2 08:47:14 node1 kernel: [143365.625050] block drbd0: Method to ensure write ordering: barrier Dec 2 08:47:14 node1 kernel: [143365.625056] block drbd0: Backing device's merge_bvec_fn() = ffffffffa00f2c89 Dec 2 08:47:14 node1 kernel: [143365.625059] block drbd0: max_segment_size ( = BIO size ) = 4096 Dec 2 08:47:14 node1 kernel: [143365.625064] block drbd0: drbd_bm_resize called with capacity == 2097016 Dec 2 08:47:14 node1 kernel: [143365.625078] block drbd0: resync bitmap: bits=262127 words=4096 Dec 2 08:47:14 node1 kernel: [143365.625082] block drbd0: size = 1024 MB (1048508 KB) Dec 2 08:47:14 node1 kernel: [143365.628586] block drbd0: recounting of set bits took additional 0 jiffies Dec 2 08:47:14 node1 kernel: [143365.628590] block drbd0: 0 KB (0 bits) marked out-of-sync by on disk bit-map. Dec 2 08:47:14 node1 kernel: [143365.628596] block drbd0: disk( Attaching -> Consistent ) Dec 2 08:47:14 node1 kernel: [143365.633831] block drbd0: conn( StandAlone -> Unconnected ) Dec 2 08:47:14 node1 kernel: [143365.633854] block drbd0: Starting receiver thread (from drbd0_worker [14778]) Dec 2 08:47:14 node1 kernel: [143365.634352] block drbd0: receiver (re)started Dec 2 08:47:14 node1 kernel: [143365.634359] block drbd0: conn( Unconnected -> WFConnection ) Dec 2 08:47:14 node1 kernel: [143365.850121] block drbd0: helper command: /sbin/drbdadm fence-peer minor-0 Dec 2 08:47:14 node1 kernel: [143365.931181] block drbd0: Handshake successful: Agreed network protocol version 91 Dec 2 08:47:15 node1 crm-fence-peer.sh[14894]: invoked for cluster-ocfs Dec 2 08:47:45 node1 crm-fence-peer.sh[14894]: Call cib_create failed (-21): The object already exists Dec 2 08:47:45 node1 crm-fence-peer.sh[14894]: <failed> Dec 2 08:47:45 node1 crm-fence-peer.sh[14894]: <failed_update id="drbd-fence-by-handler-cluster-ocfs-ms_drbd_cluster_ocfs" object_type="rsc_location" operation="cib_create" reason="The object already exists"> Dec 2 08:47:45 node1 crm-fence-peer.sh[14894]: <rsc_location rsc="ms_drbd_cluster_ocfs" id="drbd-fence-by-handler-cluster-ocfs-ms_drbd_cluster_ocfs"> Dec 2 08:47:45 node1 crm-fence-peer.sh[14894]: <rule role="Master" score="-INFINITY" id="drbd-fence-by-handler-cluster-ocfs-rule-ms_drbd_cluster_ocfs"> Dec 2 08:47:45 node1 crm-fence-peer.sh[14894]: <expression attribute="#uname" operation="ne" value="node1" id="drbd-fence-by-handler-cluster-ocfs-expr-ms_drbd_cluster_ocfs"/> Dec 2 08:47:45 node1 crm-fence-peer.sh[14894]: </rule> Dec 2 08:47:45 node1 crm-fence-peer.sh[14894]: </rsc_location> Dec 2 08:47:45 node1 crm-fence-peer.sh[14894]: </failed_update> Dec 2 08:47:45 node1 crm-fence-peer.sh[14894]: </failed> Dec 2 08:47:45 node1 crm-fence-peer.sh[14894]: WARNING DATA INTEGRITY at RISK: could not place the fencing constraint! Dec 2 08:47:45 node1 kernel: [143397.494413] block drbd0: helper command: /sbin/drbdadm fence-peer minor-0 exit code 1 (0x100) Dec 2 08:47:45 node1 kernel: [143397.494676] block drbd0: conn( WFConnection -> WFReportParams ) Dec 2 08:47:45 node1 kernel: [143397.494841] block drbd0: Starting asender thread (from drbd0_receiver [14787]) Dec 2 08:47:45 node1 kernel: [143397.494951] block drbd0: data-integrity-alg: <not-used> Dec 2 08:47:45 node1 kernel: [143397.494964] block drbd0: drbd_sync_handshake: Dec 2 08:47:45 node1 kernel: [143397.494967] block drbd0: self C25D6F4D22879C5C:0000000000000000:BBD6AD66F9958D13:FE0A927040B5EE53 bits:0 flags:0 Dec 2 08:47:45 node1 kernel: [143397.494971] block drbd0: peer D97B1C1FD6F6A891:C25D6F4D22879C5C:BBD6AD66F9958D13:FE0A927040B5EE53 bits:0 flags:0 Dec 2 08:47:45 node1 kernel: [143397.494977] block drbd0: conn( WFReportParams -> NetworkFailure ) Dec 2 08:47:45 node1 kernel: [143397.494980] block drbd0: uuid_compare()=-1 by rule 50 Dec 2 08:47:45 node1 kernel: [143397.494989] block drbd0: asender terminated Dec 2 08:47:45 node1 kernel: [143397.494990] block drbd0: drbd_sync_handshake: Dec 2 08:47:45 node1 kernel: [143397.494994] block drbd0: self C25D6F4D22879C5C:0000000000000000:BBD6AD66F9958D13:FE0A927040B5EE53 bits:0 flags:0 Dec 2 08:47:45 node1 kernel: [143397.494998] block drbd0: peer D97B1C1FD6F6A891:C25D6F4D22879C5C:BBD6AD66F9958D13:FE0A927040B5EE53 bits:0 flags:0 Dec 2 08:47:45 node1 kernel: [143397.495001] block drbd0: uuid_compare()=-1 by rule 50 Dec 2 08:47:45 node1 kernel: [143397.495004] block drbd0: Terminating drbd0_asender Dec 2 08:47:45 node1 kernel: [143397.495029] block drbd0: Connection closed Dec 2 08:47:45 node1 kernel: [143397.495066] block drbd0: conn( NetworkFailure -> Unconnected ) Dec 2 08:47:45 node1 kernel: [143397.495071] block drbd0: receiver terminated Dec 2 08:47:45 node1 kernel: [143397.495073] block drbd0: Restarting drbd0_receiver Dec 2 08:47:45 node1 kernel: [143397.495076] block drbd0: receiver (re)started Dec 2 08:47:45 node1 kernel: [143397.495082] block drbd0: conn( Unconnected -> WFConnection ) Dec 2 08:47:46 node1 kernel: [143397.512326] block drbd0: disk( Consistent -> Outdated ) Dec 2 08:47:46 node1 kernel: [143397.598232] block drbd0: Handshake successful: Agreed network protocol version 91 Dec 2 08:47:46 node1 kernel: [143397.598241] block drbd0: conn( WFConnection -> WFReportParams ) Dec 2 08:47:46 node1 kernel: [143397.598264] block drbd0: Starting asender thread (from drbd0_receiver [14787]) Dec 2 08:47:46 node1 kernel: [143397.598415] block drbd0: data-integrity-alg: <not-used> Dec 2 08:47:46 node1 kernel: [143397.598428] block drbd0: drbd_sync_handshake: Dec 2 08:47:46 node1 kernel: [143397.598432] block drbd0: self C25D6F4D22879C5C:0000000000000000:BBD6AD66F9958D13:FE0A927040B5EE53 bits:0 flags:0 Dec 2 08:47:46 node1 kernel: [143397.598437] block drbd0: peer D97B1C1FD6F6A891:C25D6F4D22879C5C:BBD6AD66F9958D13:FE0A927040B5EE53 bits:0 flags:0 Dec 2 08:47:46 node1 kernel: [143397.598440] block drbd0: uuid_compare()=-1 by rule 50 Dec 2 08:47:46 node1 kernel: [143397.598446] block drbd0: peer( Unknown -> Primary ) conn( WFReportParams -> WFBitMapT ) pdsk( DUnknown -> UpToDate ) Dec 2 08:47:46 node1 kernel: [143397.599801] block drbd0: conn( WFBitMapT -> WFSyncUUID ) Dec 2 08:47:46 node1 kernel: [143397.601341] block drbd0: helper command: /sbin/drbdadm before-resync-target minor-0 Dec 2 08:47:46 node1 kernel: [143397.603220] block drbd0: helper command: /sbin/drbdadm before-resync-target minor-0 exit code 0 (0x0) Dec 2 08:47:46 node1 kernel: [143397.603227] block drbd0: conn( WFSyncUUID -> SyncTarget ) disk( Outdated -> Inconsistent ) Dec 2 08:47:46 node1 kernel: [143397.603235] block drbd0: Began resync as SyncTarget (will sync 0 KB [0 bits set]). Dec 2 08:47:46 node1 kernel: [143398.149623] block drbd0: peer( Primary -> Unknown ) conn( SyncTarget -> Disconnecting ) pdsk( UpToDate -> DUnknown ) Dec 2 08:47:46 node1 kernel: [143398.149814] block drbd0: asender terminated Dec 2 08:47:46 node1 kernel: [143398.149822] block drbd0: Terminating drbd0_asender Dec 2 08:47:46 node1 kernel: [143398.150000] block drbd0: Connection closed Dec 2 08:47:46 node1 kernel: [143398.150033] block drbd0: conn( Disconnecting -> StandAlone ) Dec 2 08:47:46 node1 kernel: [143398.150279] block drbd0: receiver terminated Dec 2 08:47:46 node1 kernel: [143398.150284] block drbd0: Terminating drbd0_receiver Dec 2 08:47:46 node1 kernel: [143398.150345] block drbd0: disk( Inconsistent -> Diskless ) Dec 2 08:47:46 node1 kernel: [143398.150451] block drbd0: drbd_bm_resize called with capacity == 0 Dec 2 08:47:46 node1 kernel: [143398.150459] block drbd0: worker terminated Dec 2 08:47:46 node1 kernel: [143398.150462] block drbd0: Terminating drbd0_worker Dec 2 08:47:46 node1 kernel: [143398.257848] block drbd0: Starting worker thread (from cqueue [1898]) Dec 2 08:47:46 node1 kernel: [143398.258149] block drbd0: disk( Diskless -> Attaching ) Dec 2 08:47:46 node1 kernel: [143398.266813] block drbd0: No usable activity log found. Dec 2 08:47:46 node1 kernel: [143398.266819] block drbd0: Method to ensure write ordering: barrier Dec 2 08:47:46 node1 kernel: [143398.266825] block drbd0: Backing device's merge_bvec_fn() = ffffffffa00f2c89 Dec 2 08:47:46 node1 kernel: [143398.266828] block drbd0: max_segment_size ( = BIO size ) = 4096 Dec 2 08:47:46 node1 kernel: [143398.266834] block drbd0: drbd_bm_resize called with capacity == 2097016 Dec 2 08:47:46 node1 kernel: [143398.266847] block drbd0: resync bitmap: bits=262127 words=4096 Dec 2 08:47:46 node1 kernel: [143398.266852] block drbd0: size = 1024 MB (1048508 KB) Dec 2 08:47:46 node1 kernel: [143398.267799] block drbd0: recounting of set bits took additional 0 jiffies Dec 2 08:47:46 node1 kernel: [143398.267803] block drbd0: 0 KB (0 bits) marked out-of-sync by on disk bit-map. Dec 2 08:47:46 node1 kernel: [143398.267809] block drbd0: disk( Attaching -> Inconsistent ) Dec 2 08:47:46 node1 kernel: [143398.272213] block drbd0: conn( StandAlone -> Unconnected ) Dec 2 08:47:46 node1 kernel: [143398.272230] block drbd0: Starting receiver thread (from drbd0_worker [15524]) Dec 2 08:47:46 node1 kernel: [143398.272260] block drbd0: receiver (re)started Dec 2 08:47:46 node1 kernel: [143398.272265] block drbd0: conn( Unconnected -> WFConnection ) Dec 2 08:47:47 node1 kernel: [143398.572843] block drbd0: Handshake successful: Agreed network protocol version 91 Dec 2 08:47:47 node1 kernel: [143398.572853] block drbd0: conn( WFConnection -> WFReportParams ) Dec 2 08:47:47 node1 kernel: [143398.573065] block drbd0: Starting asender thread (from drbd0_receiver [15535]) Dec 2 08:47:48 node1 kernel: [143399.548019] block drbd0: data-integrity-alg: <not-used> Dec 2 08:47:48 node1 kernel: [143399.548191] block drbd0: drbd_sync_handshake: Dec 2 08:47:48 node1 kernel: [143399.548197] block drbd0: self ADA4AB4A18C972CC:0000000000000000:BBD6AD66F9958D13:FE0A927040B5EE53 bits:0 flags:0 Dec 2 08:47:48 node1 kernel: [143399.548204] block drbd0: peer DF81B74C806482F7:D97B1C1FD6F6A890:ADA4AB4A18C972CD:C25D6F4D22879C5C bits:0 flags:0 Dec 2 08:47:48 node1 kernel: [143399.548209] block drbd0: uuid_compare()=-2 by rule 60 Dec 2 08:47:48 node1 kernel: [143399.548212] block drbd0: Becoming sync target due to disk states. Dec 2 08:47:48 node1 kernel: [143399.548215] block drbd0: Writing the whole bitmap, full sync required after drbd_sync_handshake. Dec 2 08:47:48 node1 kernel: [143399.548986] block drbd0: 1024 MB (262127 bits) marked out-of-sync by on disk bit-map. Dec 2 08:47:48 node1 kernel: [143399.549157] block drbd0: peer( Unknown -> Primary ) conn( WFReportParams -> WFBitMapT ) pdsk( DUnknown -> UpToDate ) Dec 2 08:47:48 node1 kernel: [143399.551323] block drbd0: conn( WFBitMapT -> WFSyncUUID ) Dec 2 08:47:48 node1 kernel: [143399.552817] block drbd0: helper command: /sbin/drbdadm before-resync-target minor-0 Dec 2 08:47:48 node1 kernel: [143399.558452] block drbd0: helper command: /sbin/drbdadm before-resync-target minor-0 exit code 0 (0x0) Dec 2 08:47:48 node1 kernel: [143399.558459] block drbd0: conn( WFSyncUUID -> SyncTarget ) Dec 2 08:47:48 node1 kernel: [143399.558466] block drbd0: Began resync as SyncTarget (will sync 1048508 KB [262127 bits set]). node2: Dec 2 08:47:14 node2 kernel: [143592.106933] block drbd0: Starting worker thread (from cqueue [1840]) Dec 2 08:47:14 node2 kernel: [143592.107136] block drbd0: disk( Diskless -> Attaching ) Dec 2 08:47:14 node2 kernel: [143592.117429] block drbd0: No usable activity log found. Dec 2 08:47:14 node2 kernel: [143592.117434] block drbd0: Method to ensure write ordering: barrier Dec 2 08:47:14 node2 kernel: [143592.117440] block drbd0: Backing device's merge_bvec_fn() = ffffffffa010fc89 Dec 2 08:47:14 node2 kernel: [143592.117444] block drbd0: max_segment_size ( = BIO size ) = 4096 Dec 2 08:47:14 node2 kernel: [143592.117449] block drbd0: drbd_bm_resize called with capacity == 2097016 Dec 2 08:47:14 node2 kernel: [143592.117464] block drbd0: resync bitmap: bits=262127 words=4096 Dec 2 08:47:14 node2 kernel: [143592.117472] block drbd0: size = 1024 MB (1048508 KB) Dec 2 08:47:14 node2 kernel: [143592.123062] block drbd0: recounting of set bits took additional 0 jiffies Dec 2 08:47:14 node2 kernel: [143592.123067] block drbd0: 0 KB (0 bits) marked out-of-sync by on disk bit-map. Dec 2 08:47:14 node2 kernel: [143592.123074] block drbd0: disk( Attaching -> Consistent ) Dec 2 08:47:14 node2 kernel: [143592.127792] block drbd0: conn( StandAlone -> Unconnected ) Dec 2 08:47:14 node2 kernel: [143592.127810] block drbd0: Starting receiver thread (from drbd0_worker [15110]) Dec 2 08:47:14 node2 kernel: [143592.127846] block drbd0: receiver (re)started Dec 2 08:47:14 node2 kernel: [143592.127851] block drbd0: conn( Unconnected -> WFConnection ) Dec 2 08:47:14 node2 kernel: [143592.361076] block drbd0: helper command: /sbin/drbdadm fence-peer minor-0 Dec 2 08:47:14 node2 kernel: [143592.440790] block drbd0: Handshake successful: Agreed network protocol version 91 Dec 2 08:47:15 node2 crm-fence-peer.sh[15223]: invoked for cluster-ocfs Dec 2 08:47:45 node2 crm-fence-peer.sh[15223]: INFO peer is reachable, my disk is Consistent: placed constraint 'drbd-fence-by-handler-cluster-ocfs-ms_drbd_cluster_ocfs' Dec 2 08:47:45 node2 kernel: [143623.333511] block drbd0: helper command: /sbin/drbdadm fence-peer minor-0 exit code 4 (0x400) Dec 2 08:47:45 node2 kernel: [143623.333516] block drbd0: fence-peer helper returned 4 (peer was fenced) Dec 2 08:47:45 node2 kernel: [143623.333524] block drbd0: role( Secondary -> Primary ) disk( Consistent -> UpToDate ) pdsk( DUnknown -> Outdated ) Dec 2 08:47:45 node2 kernel: [143623.333830] block drbd0: Creating new current UUID Dec 2 08:47:45 node2 kernel: [143623.334207] block drbd0: conn( WFConnection -> WFReportParams ) Dec 2 08:47:45 node2 kernel: [143623.334226] block drbd0: Starting asender thread (from drbd0_receiver [15121]) Dec 2 08:47:45 node2 kernel: [143623.833678] block drbd0: conn( WFReportParams -> NetworkFailure ) Dec 2 08:47:45 node2 kernel: [143623.833687] block drbd0: asender terminated Dec 2 08:47:45 node2 kernel: [143623.833691] block drbd0: Terminating drbd0_asender Dec 2 08:47:45 node2 kernel: [143623.834043] block drbd0: Connection closed Dec 2 08:47:45 node2 kernel: [143623.834059] block drbd0: conn( NetworkFailure -> Unconnected ) Dec 2 08:47:45 node2 kernel: [143623.834065] block drbd0: receiver terminated Dec 2 08:47:45 node2 kernel: [143623.834067] block drbd0: Restarting drbd0_receiver Dec 2 08:47:45 node2 kernel: [143623.834070] block drbd0: receiver (re)started Dec 2 08:47:45 node2 kernel: [143623.834075] block drbd0: conn( Unconnected -> WFConnection ) Dec 2 08:47:46 node2 kernel: [143624.157421] block drbd0: Handshake successful: Agreed network protocol version 91 Dec 2 08:47:46 node2 kernel: [143624.157429] block drbd0: conn( WFConnection -> WFReportParams ) Dec 2 08:47:46 node2 kernel: [143624.157585] block drbd0: Starting asender thread (from drbd0_receiver [15121]) Dec 2 08:47:46 node2 kernel: [143624.157728] block drbd0: data-integrity-alg: <not-used> Dec 2 08:47:46 node2 kernel: [143624.157743] block drbd0: drbd_sync_handshake: Dec 2 08:47:46 node2 kernel: [143624.157747] block drbd0: self D97B1C1FD6F6A891:C25D6F4D22879C5C:BBD6AD66F9958D13:FE0A927040B5EE53 bits:0 flags:0 Dec 2 08:47:46 node2 kernel: [143624.157751] block drbd0: peer C25D6F4D22879C5C:0000000000000000:BBD6AD66F9958D13:FE0A927040B5EE53 bits:0 flags:0 Dec 2 08:47:46 node2 kernel: [143624.157755] block drbd0: uuid_compare()=1 by rule 70 Dec 2 08:47:46 node2 kernel: [143624.157759] block drbd0: peer( Unknown -> Secondary ) conn( WFReportParams -> WFBitMapS ) Dec 2 08:47:46 node2 kernel: [143624.160572] block drbd0: conn( WFBitMapS -> SyncSource ) pdsk( Outdated -> Inconsistent ) Dec 2 08:47:46 node2 kernel: [143624.160580] block drbd0: Began resync as SyncSource (will sync 0 KB [0 bits set]). Dec 2 08:47:46 node2 kernel: [143624.249923] block drbd0: role( Primary -> Secondary ) Dec 2 08:47:46 node2 kernel: [143624.709497] block drbd0: Resync done (total 1 sec; paused 0 sec; 0 K/sec) Dec 2 08:47:46 node2 kernel: [143624.709507] block drbd0: conn( SyncSource -> Connected ) pdsk( Inconsistent -> UpToDate ) Dec 2 08:47:46 node2 kernel: [143624.709530] block drbd0: peer( Secondary -> Unknown ) conn( Connected -> TearDown ) pdsk( UpToDate -> DUnknown ) Dec 2 08:47:46 node2 kernel: [143624.709633] block drbd0: conn( TearDown -> Disconnecting ) Dec 2 08:47:46 node2 kernel: [143624.709920] block drbd0: asender terminated Dec 2 08:47:46 node2 kernel: [143624.709928] block drbd0: Terminating drbd0_asender Dec 2 08:47:46 node2 kernel: [143624.710210] block drbd0: Connection closed Dec 2 08:47:46 node2 kernel: [143624.710222] block drbd0: conn( Disconnecting -> StandAlone ) Dec 2 08:47:46 node2 kernel: [143624.710405] block drbd0: receiver terminated Dec 2 08:47:46 node2 kernel: [143624.710409] block drbd0: Terminating drbd0_receiver Dec 2 08:47:46 node2 kernel: [143624.710469] block drbd0: disk( UpToDate -> Diskless ) Dec 2 08:47:46 node2 kernel: [143624.710645] block drbd0: drbd_bm_resize called with capacity == 0 Dec 2 08:47:46 node2 kernel: [143624.710653] block drbd0: worker terminated Dec 2 08:47:46 node2 kernel: [143624.710657] block drbd0: Terminating drbd0_worker Dec 2 08:47:46 node2 kernel: [143624.819951] block drbd0: Starting worker thread (from cqueue [1840]) Dec 2 08:47:46 node2 kernel: [143624.820182] block drbd0: disk( Diskless -> Attaching ) Dec 2 08:47:46 node2 kernel: [143624.821156] block drbd0: No usable activity log found. Dec 2 08:47:46 node2 kernel: [143624.821162] block drbd0: Method to ensure write ordering: barrier Dec 2 08:47:46 node2 kernel: [143624.821168] block drbd0: Backing device's merge_bvec_fn() = ffffffffa010fc89 Dec 2 08:47:46 node2 kernel: [143624.821171] block drbd0: max_segment_size ( = BIO size ) = 4096 Dec 2 08:47:46 node2 kernel: [143624.821176] block drbd0: drbd_bm_resize called with capacity == 2097016 Dec 2 08:47:46 node2 kernel: [143624.821191] block drbd0: resync bitmap: bits=262127 words=4096 Dec 2 08:47:46 node2 kernel: [143624.821195] block drbd0: size = 1024 MB (1048508 KB) Dec 2 08:47:46 node2 kernel: [143624.821526] block drbd0: recounting of set bits took additional 0 jiffies Dec 2 08:47:46 node2 kernel: [143624.821530] block drbd0: 0 KB (0 bits) marked out-of-sync by on disk bit-map. Dec 2 08:47:46 node2 kernel: [143624.821537] block drbd0: disk( Attaching -> Consistent ) Dec 2 08:47:46 node2 kernel: [143624.826011] block drbd0: conn( StandAlone -> Unconnected ) Dec 2 08:47:46 node2 kernel: [143624.826040] block drbd0: Starting receiver thread (from drbd0_worker [15839]) Dec 2 08:47:46 node2 kernel: [143624.826079] block drbd0: receiver (re)started Dec 2 08:47:46 node2 kernel: [143624.826085] block drbd0: conn( Unconnected -> WFConnection ) Dec 2 08:47:47 node2 kernel: [143625.074611] block drbd0: helper command: /sbin/drbdadm fence-peer minor-0 Dec 2 08:47:47 node2 kernel: [143625.133531] block drbd0: Handshake successful: Agreed network protocol version 91 Dec 2 08:47:48 node2 crm-fence-peer.sh[15957]: invoked for cluster-ocfs Dec 2 08:47:48 node2 crm-fence-peer.sh[15957]: INFO suitable constraint already placed: 'drbd-fence-by-handler-cluster-ocfs-ms_drbd_cluster_ocfs' Dec 2 08:47:48 node2 kernel: [143626.109469] block drbd0: helper command: /sbin/drbdadm fence-peer minor-0 exit code 4 (0x400) Dec 2 08:47:48 node2 kernel: [143626.109473] block drbd0: fence-peer helper returned 4 (peer was fenced) Dec 2 08:47:48 node2 kernel: [143626.109481] block drbd0: role( Secondary -> Primary ) disk( Consistent -> UpToDate ) pdsk( DUnknown -> Outdated ) Dec 2 08:47:48 node2 kernel: [143626.109833] block drbd0: Creating new current UUID Dec 2 08:47:48 node2 kernel: [143626.110142] block drbd0: conn( WFConnection -> WFReportParams ) Dec 2 08:47:48 node2 kernel: [143626.110165] block drbd0: Starting asender thread (from drbd0_receiver [15850]) Dec 2 08:47:48 node2 kernel: [143626.110232] block drbd0: data-integrity-alg: <not-used> Dec 2 08:47:48 node2 kernel: [143626.110246] block drbd0: drbd_sync_handshake: Dec 2 08:47:48 node2 kernel: [143626.110250] block drbd0: self DF81B74C806482F7:D97B1C1FD6F6A890:ADA4AB4A18C972CD:C25D6F4D22879C5C bits:0 flags:0 Dec 2 08:47:48 node2 kernel: [143626.110254] block drbd0: peer ADA4AB4A18C972CC:0000000000000000:BBD6AD66F9958D13:FE0A927040B5EE53 bits:0 flags:0 Dec 2 08:47:48 node2 kernel: [143626.110258] block drbd0: uuid_compare()=2 by rule 80 Dec 2 08:47:48 node2 kernel: [143626.110260] block drbd0: Becoming sync source due to disk states. Dec 2 08:47:48 node2 kernel: [143626.110263] block drbd0: Writing the whole bitmap, full sync required after drbd_sync_handshake. Dec 2 08:47:48 node2 kernel: [143626.110667] block drbd0: 1024 MB (262127 bits) marked out-of-sync by on disk bit-map. Dec 2 08:47:48 node2 kernel: [143626.110915] block drbd0: peer( Unknown -> Secondary ) conn( WFReportParams -> WFBitMapS ) pdsk( Outdated -> Inconsistent ) Dec 2 08:47:48 node2 kernel: [143626.115046] block drbd0: conn( WFBitMapS -> SyncSource ) Dec 2 08:47:48 node2 kernel: [143626.115057] block drbd0: Began resync as SyncSource (will sync 1048508 KB [262127 bits set]). Dec 2 08:49:18 node2 kernel: [143716.815795] block drbd0: Resync done (total 90 sec; paused 0 sec; 11648 K/sec) Dec 2 08:49:18 node2 kernel: [143716.815803] block drbd0: conn( SyncSource -> Connected ) pdsk( Inconsistent -> UpToDate ) Dec 2 08:49:18 node2 kernel: [143716.998813] block drbd0: peer( Secondary -> Primary ) THANKS! _______________________________________________ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://bugs.clusterlabs.org