Re: missing rows while importing data using sstable loader

2016-02-05 Thread Jack Krupansky
I sent a message to DataStax Docs to add this nodetool flush suggestion to the doc for sstableloader. -- Jack Krupansky On Fri, Feb 5, 2016 at 3:35 AM, Romain Hardouin wrote: > > What is the best practise to create sstables? > > When you run a "nodetool flush" Cassandra persists all the memtabl

Re: missing rows while importing data using sstable loader

2016-02-05 Thread Victor Chen
Arindam, What can you share regarding the source from which you are importing data? Is it a separate cassandra cluster? If so, how many nodes and datacenters? What is RF (replication factor) of source cluster? How certain are you that the rows indeed exist in the set of sstables which you are load

Re: missing rows while importing data using sstable loader

2016-02-05 Thread Romain Hardouin
> What is the best practise to create sstables? When you run a "nodetool flush" Cassandra persists all the memtables on disk, i.e. it produces sstables. (You can create sstables by yourself thanks to CQLSSTableWriter, but I don't think it was the point of your question.)

Re: missing rows while importing data using sstable loader

2016-02-01 Thread Arindam Choudhury
What is the best practise to create sstables? On 1 February 2016 at 15:21, Romain Hardouin wrote: > Did you run "nodetool flush" on the source node? If not, the missing rows > could be in memtables. >

Re: missing rows while importing data using sstable loader

2016-02-01 Thread Romain Hardouin
Did you run "nodetool flush" on the source node? If not, the missing rows could be in memtables.

Re: missing rows while importing data using sstable loader

2016-01-29 Thread Jack Krupansky
I agree that there should be more clear doc on exactly how the estimation is calculated. When I inquired about this recently the response was that it should be within about 2% of the actual key count. I started looking at the code, but I ran out of time before I chased down all the subsidiary facto

Re: missing rows while importing data using sstable loader

2016-01-29 Thread Arindam Choudhury
Why in cqlsh when I query "select count(*) from mordor.things_values_meta ;" it says: 4692 But in nodetool cfstats it says Number of keys (estimate): 4720? On 29 January 2016 at 16:25, Arindam Choudhury < arindam.choudh...@ackstorm.com> wrote: > I am counting the rows with "select count(*) from

Re: missing rows while importing data using sstable loader

2016-01-29 Thread Jack Krupansky
And how are you counting the rows? With a query? If, so, what is the query. Using nodetool cfstats (estimated) key count? Or... what? Are the tokens for the missing rows is the same range and a distinct range from the rest of the data in the original cluster? How many nodes in the original cluste

Re: missing rows while importing data using sstable loader

2016-01-29 Thread Arindam Choudhury
I am counting the rows with "select count(*) from mordor.things_values_meta;" I am doing one node cluster to one node cluster for testing. On 29 January 2016 at 16:20, Jack Krupansky wrote: > And how are you counting the rows? With a query? If, so, what is the > query. Using nodetool cfstats (e

Re: missing rows while importing data using sstable loader

2016-01-29 Thread Arindam Choudhury
I will check the output of nodetool cfstats. Its from version 2.1.2 to version 2.1.9. On 29 January 2016 at 16:02, Jack Krupansky wrote: > Are these sstables from an existing Cassandra cluster or generated by a > program? > > If the former, do a nodetool tablestats or cfstats to get the sstable

Re: missing rows while importing data using sstable loader

2016-01-29 Thread Jack Krupansky
Are these sstables from an existing Cassandra cluster or generated by a program? If the former, do a nodetool tablestats or cfstats to get the sstable count and compare it to both the number of sstables that the loader is reading from and the number that end up in the target cluster. What Cassand

Re: missing rows while importing data using sstable loader

2016-01-29 Thread Arindam Choudhury
Hi Romain, The RF was set to 2. I changed it to one. CREATE KEYSPACE mordor WITH replication = {'class' : 'SimpleStrategy', 'replication_factor' : 1} AND durable_writes = true; re-inserted the columns, still missing rows. Regards, Arindam On 29 January 2016 at 15:14, Romain Hardouin wrote:

Re: missing rows while importing data using sstable loader

2016-01-29 Thread Romain Hardouin
Hi, I assume a RF > 1. Right?What is the consistency level you used? cqlsh use ONE by default. Try: cqlsh> CONSISTENCY ALLAnd run your query again. Best,Romain Le Vendredi 29 janvier 2016 13h45, Arindam Choudhury a écrit : Hi Kai, The table schema is: CREATE TABLE mordor.things_value

Re: missing rows while importing data using sstable loader

2016-01-29 Thread Arindam Choudhury
Hi Kai, The table schema is: CREATE TABLE mordor.things_values_meta ( thing_id text, key text, bucket_timestamp timestamp, total_rows counter, PRIMARY KEY ((thing_id, key), bucket_timestamp) ) WITH CLUSTERING ORDER BY (bucket_timestamp ASC) AND bloom_filter_fp_chance = 0.0

Re: missing rows while importing data using sstable loader

2016-01-29 Thread Kai Wang
Arindam, what's the table schema and what does your query to retrieve the rows look like? On Fri, Jan 29, 2016 at 7:33 AM, Arindam Choudhury < arindam.choudh...@ackstorm.com> wrote: > Hi, > > I am importing data to a new cassandra cluster using sstableloader. The > sstableloader runs without any

missing rows while importing data using sstable loader

2016-01-29 Thread Arindam Choudhury
Hi, I am importing data to a new cassandra cluster using sstableloader. The sstableloader runs without any warning or error. But I am missing around 1000 rows. Any feedback will be highly appreciated. Kind Regards, Arindam Choudhury

Re: sstable loader

2015-03-30 Thread Amila Paranawithana
30, 2015 at 12:57 PM, Vanessa Gligor > wrote: > >> Hi, >> >> I used this https://github.com/yukim/cassandra-bulkload-example/ (I have >> modified BulkLoad.java for my needs) for the sstable loader and it works >> ok. You can take a look, maybe it will help you

Re: sstable loader

2015-03-30 Thread Rahul Bhardwaj
M, Vanessa Gligor wrote: > Hi, > > I used this https://github.com/yukim/cassandra-bulkload-example/ (I have > modified BulkLoad.java for my needs) for the sstable loader and it works > ok. You can take a look, maybe it will help you. > > Regards, > Vanessa. > >

Re: sstable loader

2015-03-30 Thread Vanessa Gligor
Hi, I used this https://github.com/yukim/cassandra-bulkload-example/ (I have modified BulkLoad.java for my needs) for the sstable loader and it works ok. You can take a look, maybe it will help you. Regards, Vanessa. On Mon, Mar 30, 2015 at 10:04 AM, Rahul Bhardwaj < rahul.bh

Re: sstable loader

2015-03-30 Thread Rahul Bhardwaj
[1]. > http://amilaparanawithana.blogspot.com/2012/06/bulk-loading-external-data-to-cassandra.html > > Thanks, > > > On Fri, Mar 27, 2015 at 11:40 AM, Rahul Bhardwaj < > rahul.bhard...@indiamart.com> wrote: > >> Hi All, >> >> Can we use sstable loader

Re: sstable loader

2015-03-27 Thread Amila Paranawithana
ard...@indiamart.com> wrote: > Hi All, > > Can we use sstable loader for loading external flat file or csv file. > If yes , kindly share the steps or manual. > > I need to put 40 million data into a table of around 70 columns > > > > Regards: > Rahul Bhardwaj

sstable loader

2015-03-26 Thread Rahul Bhardwaj
Hi All, Can we use sstable loader for loading external flat file or csv file. If yes , kindly share the steps or manual. I need to put 40 million data into a table of around 70 columns Regards: Rahul Bhardwaj -- Follow IndiaMART.com <http://www.indiamart.com> for latest updates o

Re: error on using sstable loader

2015-01-12 Thread Robert Coli
On Mon, Jan 12, 2015 at 4:26 AM, Rahul Bhardwaj < rahul.bhard...@indiamart.com> wrote: > sstableloader -d 162.217.99.217 > /var/lib/cassandra/data/clickstream/business_feed_new > ERROR 17:50:48,218 Unable to initialize MemoryMeter (jamm not specified as > javaagent). This means Cassandra will be

error on using sstable loader

2015-01-12 Thread Rahul Bhardwaj
Hi All, While using bulk loader we are getting this error: sstableloader -d 162.217.99.217 /var/lib/cassandra/data/clickstream/business_feed_new ERROR 17:50:48,218 Unable to initialize MemoryMeter (jamm not specified as javaagent). This means Cassandra will be unable to measure object sizes ac

Re: Issue with SStable loader.

2012-04-11 Thread aaron morton
:56 PM, Rishabh Agrawal wrote: > >> Hello, >> >> I had three node cluster which I converted to 4 node one. Later I >> decommissioned one of them and load balanced the data on remaining 3. I >> removed decommissioned node from ‘seed list’ . I restarted all nodes and >

RE: Issue with SStable loader.

2012-04-10 Thread Rishabh Agrawal
bject: Re: Issue with SStable loader. Did you update the config for sstableloader ? Are their any data files in the data directory pointed to by the sstableloader config ? Cheers - Aaron Morton Freelance Developer @aaronmorton http://www.thelastpickle.com On 10/04/2012, at 11: