Craig, I was starting the first node then letting all of the other nodes join as quick as they could which clearly won't work. It also explains why it worked when I did it manually, I can only do it sequentially myself ;) I had suspected a race condition and it seems I was in the right area :)
Thank you for this, I will alter what I am doing to start sequentially. On Tue, May 12, 2015 at 1:19 AM, Craig Ringer <cr...@2ndquadrant.com> wrote: > > > On 12 May 2015 at 14:33, Wayne E. Seguin <wayneeseg...@gmail.com> wrote: > >> >> > 7. on nodes 1-4 I am doing (adjusted for the nodes IP): >> SELECT bdr.bdr_group_join( >> local_node_name := 'pgbdr1', >> node_external_dsn := 'host=10.244.2.6 port=5432 user=postgres >> dbname=pgbdr', >> join_using_dsn := 'host=10.244.2.2 port=5432 user=postgres >> dbname=pgbdr' >> ); >> > > At a guess you're probably not waiting between joins to ensure that each > new node has finished joining before starting another node join. > > BDR really needs to be enhanced to either support parallel join of > multiple nodes or identify and reject it. > > > -- > Craig Ringer http://www.2ndQuadrant.com/ > PostgreSQL Development, 24x7 Support, Training & Services > -- ~Wayne Wayne E. Seguin wayneeseg...@gmail.com wayneeseguin on irc.freenode.net http://twitter.com/wayneeseguin/ https://github.com/wayneeseguin/