Hello, As Jean said il will be preferable to use http://cassandra-reaper.io
So you don't have to manually manage the consistency of your cassandra ring nor the list of nodes to repair. Le mer. 21 août 2019 à 15:57, Rahul Reddy <rahulreddy1...@gmail.com> a écrit : > Thanks Jean, > > I have dc1 and dc2 existing. added dc3 from dc1 and dc4 from dc2. If I > want to run repair on one node in dc3 from dc1 only is it possible? > > On Wed, Aug 21, 2019, 8:11 AM Jean Carlo <jean.jeancar...@gmail.com> > wrote: > >> Hello Rahul, >> >> To ensure the consistency among the DCs, it is enough to run a repair >> command. >> >> You can do it using http://cassandra-reaper.io/ >> or runing the commande *nodetool repair* with the respectively options >> in every node. >> >> You do not need to count the rows in every DC to ensure cassandra is sync >> amongs DC after you have run the repair. But if you still want to do it, >> use Spark for it. >> >> Jean Carlo >> >> "The best way to predict the future is to invent it" Alan Kay >> >> >> On Wed, Aug 21, 2019 at 1:51 PM Rahul Reddy <rahulreddy1...@gmail.com> >> wrote: >> >>> Yep I did run rebuild on each new node >>> >>> On Wed, Aug 21, 2019, 7:25 AM Stefan Miklosovic < >>> stefan.mikloso...@instaclustr.com> wrote: >>> >>>> Hi Rahul, >>>> >>>> how did you add that dc3 to cluster? The rule of thumb here is to do >>>> rebuild from each node, for example like here >>>> >>>> https://docs.datastax.com/en/archived/cassandra/3.0/cassandra/operations/opsAddDCToCluster.html >>>> >>>> On Wed, 21 Aug 2019 at 12:57, Rahul Reddy <rahulreddy1...@gmail.com> >>>> wrote: >>>> > >>>> > Hi sefan, >>>> > >>>> > I'm adding new DC3 to exiting cluster and see discripencies couple of >>>> millions in Nodetool cfstats in new DC. >>>> > >>>> > My table size is 50gb >>>> > I'm trying to run copy entire table. >>>> > >>>> > Copy table to 'full_tablr.csv' with delimiter ','; >>>> > >>>> > If I run above command from dc3. Does it get the data only from dc3? >>>> > >>>> > >>>> > >>>> > On Wed, Aug 21, 2019, 6:46 AM Stefan Miklosovic < >>>> stefan.mikloso...@instaclustr.com> wrote: >>>> >> >>>> >> Hi Rahul, >>>> >> >>>> >> what is your motivation behind this? Why do you want to make sure the >>>> >> count is same? What is the purpose of that? All you should care about >>>> >> is that Cassandra will return you right results. It was designed from >>>> >> the very bottom to do that for you, you should not be bothered too >>>> >> much about such discrepancies, they will be always there in general. >>>> >> But the important fact is that once queried, you can rest assured it >>>> >> is returned (and consequentially repaired if data not match) as they >>>> >> should. >>>> >> >>>> >> What copy command you are talking about precisely, why you cant use >>>> just repair? >>>> >> >>>> >> On Wed, 21 Aug 2019 at 12:14, Rahul Reddy <rahulreddy1...@gmail.com> >>>> wrote: >>>> >> > >>>> >> > Hello, >>>> >> > >>>> >> > I have 3 datacenters . Want to make sure record count is same in >>>> all dc's . If I run copy command node1 in dc1 does it get the data from >>>> only dc1? Nodetool cfstats I'm seeing discrepancies in partitions count is >>>> it because we didn't run cleanup after adding few nodes and remove them?. >>>> To rule out any discripencies I want to run copy command from 3 DC's and >>>> compare. Please let me know if copy command extracts data from the DC only >>>> I ran it from? >>>> >> >>>> >> --------------------------------------------------------------------- >>>> >> To unsubscribe, e-mail: user-unsubscr...@cassandra.apache.org >>>> >> For additional commands, e-mail: user-h...@cassandra.apache.org >>>> >> >>>> >>>> --------------------------------------------------------------------- >>>> To unsubscribe, e-mail: user-unsubscr...@cassandra.apache.org >>>> For additional commands, e-mail: user-h...@cassandra.apache.org >>>> >>>> -- Cordialement; Ahmed ELJAMI