Re: Effect of rangequeries with RandomPartitioner

2012-07-08 Thread samal
inline resp. On Mon, Jul 9, 2012 at 10:18 AM, prasenjit mukherjee wrote: > Thanks Aaron for your response. Some follow up > questions/assumptions/clarifications : > > 1. With RandomPartitioner, on a given node, are the keys sorted by > their hash_values or original/unhashed keys ? > hash value

Re: Data Modeling- another question

2012-08-24 Thread samal
First is better choice, each filed can be updated separately(write only). Second you have to take care json yourself (read first-modify-then write). On Fri, Aug 24, 2012 at 5:45 PM, Roshni Rajagopal < roshni.rajago...@wal-mart.com> wrote: > Hi, > > Suppose I have a column family to associate a us

Re: Data Modeling- another question

2012-08-28 Thread samal
good idea in cassandra. but it is more > complicated for most usage scenarios, so you have to work out if you really > need the extra flexibility. > > > On 24/08/2012 13:54, samal wrote: > > First is better choice, each filed can be updated separately(write only). > Second you

Re: Storing counters in the standard column families along with non-counter columns ?

2011-07-10 Thread samal
Yes. may be 0.8.2 current version need specific validation class CounterColumn for CCF, that only count [+,-,do not replace] stuff, where as normal CF simply just add or replace. On Sun, Jul 10, 2011 at 10:39 PM, Aditya Narayan wrote: > Thanks for info. > > Is there any target version in near

Re: Key_Cache @ Row_Cache

2011-07-13 Thread samal
.datastax.com/docs/0.8/operations/cache_tuning#configuring-key-and-row-caches /Samal

Re: One node down but it thinks its fine...

2011-07-13 Thread samal
Check seed ip is same in all node and should not be loopback ip on cluster. On Wed, Jul 13, 2011 at 8:40 PM, Ray Slakinski wrote: > One of our nodes, which happens to be the seed thinks its Up and all the > other nodes are down. However all the other nodes thinks the seed is down > instead. The l

Re: CQL + Counters = bad request

2011-07-13 Thread samal
> > >>> cqlsh> UPDATE RouterAggWeekly SET 1310367600 = 1310367600 + 17 WHERE > >>> KEY = '1_20110728_ifoutmulticastpkts'; > >>> Bad Request: line 1:51 no viable alternative at character '+' > I m able to insert it. ___ cqlsh> cqlsh> UPDATE counts SET 1310367600 = 1310367600 +

Re: gossiper problem

2011-07-14 Thread samal
so much memory. > Reducing max head size wont solve problem, i think it will do more swapping. data only does not only count for memory requirement, but no. of memtables, as each CF has separate memtable and its size, compaction, caching, read You should upgrade to 0.7 or later. /samal

Re: Range query ordering with CQL JDBC

2011-07-18 Thread samal
I haven't used CQL functionality much, but thirft client I think what I encounter is exactly this problem! > If you want to query over key, you can index keys to other CF, get the column names (that is key of other CF ). and then query actual CF with keys. switch away from the random partitioner.

Re: Cassandra training in Bangalore, India

2011-07-20 Thread samal
As per my knowledge, there is not such expert training available in India as of now. As Sameer said there is enough online material available from where you can learn.I have been playing with Cassandra since beginning. We can plan for Meetup/learning session near Mumbai/Pune region.

Re: Memtables stored in which location

2011-07-21 Thread samal
SSTable is stored on disk not memtable. Memtable is memory representation of data, which is on flush to create SSTable on disk. This is the location where SSTable is stored https://github.com/apache/cassandra/blob/trunk/conf/cassandra.yaml#L71 Where as Commitlog which is back up (log) for memta

Re: Memtables stored in which location

2011-07-21 Thread samal
Any ways , some where memtable has to be stored right, like we say memtable data is flushed to create sstable on disk. > Exactly from which location or memory it will be getting from. is it like > an objects streams or like it is storing the values in commitlog. > A Memtable is Cassandra's in-mem

Re: Read process

2011-07-27 Thread samal
from ROW CACHE {if enabled} -->KEY CACHE-->MEMTABLE-->SSTABLE On Wed, Jul 27, 2011 at 1:19 PM, CASSANDRA learner < cassandralear...@gmail.com> wrote: > Hi, > > I am having one doubt regarding reads. The data will be stored in > commitlog,memtable,sstables right.. While reading the data may be ava

Re: Nodetool ring not showing all nodes in cluster

2011-08-02 Thread samal
> "ERROR 08:53:47,678 Internal error processing batch_mutate > java.lang.IllegalStateException: replication factor (3) exceeds number > of endpoints (1)" > You already answered "It always keeps showing only one node and mentions that it is handling 100% of the load." > On Tue, Aug 2, 2011 at 7:2

Re: Nodetool ring not showing all nodes in cluster

2011-08-02 Thread samal
"ERROR 08:53:47,678 Internal error processing batch_mutate >> java.lang.IllegalStateException: replication factor (3) exceeds number >> of endpoints (1)" >> > You already answered > "It always keeps showing only one node and mentions that it is handling > 100% of the load." > Cluster think only on

Re: Do we have any restrictions on the number of column families in a keyspace?

2011-08-03 Thread samal
yes for 0.7 no for 0.8

Re: Sample Cassandra project in Tomcat

2011-08-03 Thread samal
I don't know much about this, may help you.. http://www.codefreun.de/apolloUI/ http://www.codefreun.de/apollo/ On Wed, Aug 3, 2011 at 3:36 PM, CASSANDRA learner < cassandralear...@gmail.com> wrote: > Hiii, > > Can any one pleaze send me any sample application which is (.war) > implemented in j

Re: Installation Exception

2011-08-03 Thread samal
did u compile source code? :) you have downloaded source code not binary. try with binary. On Wed, Aug 3, 2011 at 9:14 PM, Eldad Yamin wrote: > Hi, > I'm trying to install Cassandra on Amazon EC2 without success, this is what > I did: > >1. Created new "Small" EC2 instance (this is just for

Re: 5 node cluster - Recommended seed configuration.

2011-08-08 Thread samal
: seeds: node1,node2, autobootstrap=true node4: seeds: node1, node2,autobootstrap=true node5: seeds: node1,node2, autobootstrap=true /Samal On Tue, Aug 9, 2011 at 9:16 AM, Selva Kumar wrote: > We have a 5 node Cassandra cluster. We use version 0.7.4. What is the > recommende

Re: Cassandra Certification

2011-08-14 Thread samal
Does it really make sense? If yes, I think Apache Cassandra Project (ASF) should offer Open Certification. Other entity can offer courses, training materials.

Re: Apache Cassandra Hangout in Mumbai-Pune area (India)

2011-11-13 Thread samal
Let's catch up. I am available in Mumbai. Using C* in dev env. Love to share or hear experience's. On Fri, Nov 11, 2011 at 10:25 PM, Adi wrote: > Hey GeekTalks/any other cassandra users around Mumbai/Pune, > > I will be around Mumbai from last week of Nov through Third week of > December. I have

Re: Seeking advice on Schema and Caching

2011-11-15 Thread samal
> > I need to add 'search users' functionality to my application. (The >> trigger for fetching searched items(like google instant search) is made >> when 3 letters have been typed in). >> > >> > For this, I make a CF with String type keys. Each such key is made of >> first 3 letters of a user's na

Re: Seeking advice on Schema and Caching

2011-11-15 Thread samal
& store cols, for all users of that name, containing userId >> inside that row. That way it would have to read multiple rows while user is >> doing a single search. >> >> >> On Wed, Nov 16, 2011 at 10:47 AM, samal wrote: >> >>> >>> > I nee

Re: Seeking advice on Schema and Caching

2011-11-16 Thread samal
n have aditya name adi{ {tya,1} . . } but only few ppl will have name with x or y. On Thu, Nov 17, 2011 at 3:29 AM, Aditya wrote: > Thanks to samal who pointed to look at the composite columns. I am now > using composite columns names containing username+userId & valueless > colum

Re: OutOfMemory Exception during bootstrap

2011-12-04 Thread samal
Lower your heap size, if you are testing multiple instance with single node. https://github.com/apache/cassandra/blob/trunk/conf/cassandra-env.sh#L64 On Sun, Dec 4, 2011 at 11:08 PM, Harald Falzberger wrote: > Hi, > > I'm trying to set up a test environment with 2 nodes on one physical > machin

Re: Re: Cassandra DataModeling recommendations

2011-12-05 Thread samal
metadata_column:ttl }* } 2. userIdCF:{ *user1:{ id:user1 //*hack : to prevent unwanted behavior one column with no ttl.* cart1:cart1_uuidkey:ttl cart2:ttl cart3:ttl } user2:{ id:user2 cart1:cartX_uuidkey:ttl cart2:cart4:ttl cart3:cartMttl }* } /Samal

Re: node.js library?

2011-12-05 Thread samal
On Mon, Dec 5, 2011 at 7:59 PM, Norman Maurer wrote: > As far as I know its the library that was developed by "rackspace". > > See > https://github.com/racker/node-cassandra-client > *No longer maintained. it is moved as separate project in apache-extras * 2011/12/5 Joe Stein > Hey folks, so

Re: Setting Key Validation Class

2011-12-05 Thread samal
key_validation_class is different from validation_class validation are BytesType by default. key_valdation_class => key default_validation_class=>column_values comparator=>column_name default_validation_class is global scope of validation_class On Mon, Dec 5, 2011 at 10:10 PM, Dinusha Dilrukshi

Re: Fwd: information on cassandra

2012-03-24 Thread samal
Cassandra has distributed architecture. So 1 node does not fit into it. although it can used but you loose its benefits , ok if you are just playing around, use vm to learn how cluster communicate, handle request. To get full tolerance, redundancy and consistency minimum 3 node is required. Imp

Re: How to store a list of values?

2012-03-26 Thread samal
I would take simple approach. create one other CF "UserSkill" with row key same as profile_cf key, In user_skill cf will add skill as column name and value null. Columns can be added or removed. UserProfile={ '*ben*'={ blah :blah blah :blah blah :blah } } UserSkill={ '*ben*'={

Re: How to store a list of values?

2012-03-26 Thread samal
plus it is fully compatible with CQL. SELECT * FROM UserSkill WHERE KEY='ben'; On Mon, Mar 26, 2012 at 9:13 PM, samal wrote: > I would take simple approach. create one other CF "UserSkill" with row > key same as profile_cf key, > In user_skill cf will add skill

Re: How to store a list of values?

2012-03-26 Thread samal
On Mon, Mar 26, 2012 at 9:20 PM, Ben McCann wrote: > Thanks for the reply Samal. > > I did not realize that you could store a column with null value. > values can be null or any value like [default@node] set hus['test']['wowq']='\{de\'.de\;\}\+\^

Re: How to store a list of values?

2012-03-26 Thread samal
he columns for values. You don't need the >>> column-values to hold multiple columns (the super-column principle). So a >>> normal CF would work. >>> >>> >>> 2012/3/26 Ben McCann >>> >>>> Thanks for the reply Samal. I did not

Re: How to store a list of values?

2012-03-26 Thread samal
On Tue, Mar 27, 2012 at 1:47 AM, R. Verlangen wrote: > " but any schema change will break it " > > How do you mean? You don't have to specify the columns in Cassandra so it > should work perfect. Except for the "skill~" is preserverd for your list. > In case skill~ is decided to change to skill

Re: How to store a list of values?

2012-03-27 Thread samal
YEAH! agree, it only matter for time bucket data. On Tue, Mar 27, 2012 at 12:31 PM, R. Verlangen wrote: > That's true, but it does not sound like a real problem to me.. Maybe > someone else can shed some light upon this. > > > 2012/3/27 samal > >> >> &g

Re: Cassandra and harddrives

2012-04-25 Thread samal
Each node need its own HDD for multiple copies. cant share it with others node. On Thu, Apr 26, 2012 at 8:52 AM, Benny Rönnhager < benny.ronnha...@thrutherockies.com> wrote: > Hi! > > I am building a database with several hundred thousands of images. > have just learned that HaProxy is a very goo

timezone time series data model

2012-04-29 Thread samal
any better approach? Thanks, >>>Samal

Re: Data model question, storing Queue Message

2012-04-30 Thread samal
On Mon, Apr 30, 2012 at 4:25 PM, Morgan Segalis wrote: > Hi Aaron, > > Thank you for your answer, I was beginning to think that my question would > never be answered ;-) > > Actually, this is what I was going for, except one thing, instead of > partitioning row per month, I though about partition

Re: Data model question, storing Queue Message

2012-04-30 Thread samal
On Mon, Apr 30, 2012 at 5:52 PM, Morgan Segalis wrote: > Hi Samal, > > Thanks for the TTL feature, I wasn't aware of it's existence. > > Day's partitioning will be less wider than month partitionning (about 30 > times less give or take ;-) ) > Per day i

Re: timezone time series data model

2012-04-30 Thread samal
r of > seconds in a day). > > > On Mon, Apr 30, 2012 at 1:46 AM, samal wrote: > >> Hello List, >> >> I need suggestion/ recommendation on time series data. >> >> I have requirement where users belongs to different timezone and they can >> subscribe

Re: timezone time series data model

2012-04-30 Thread samal
console.log("Modulo i%86400=> ",i%86400); console.log("== "); },2000); Am I doing wrong? On Mon, Apr 30, 2012 at 9:54 PM, Tyler Hobbs wrote: > Correct, that's exactly what I'm saying. > > > On Mon, Apr 30, 2012 at 10:37 AM, sama

Re: timezone time series data model

2012-04-30 Thread samal
pr 30, 2012 at 10:44 PM, Tyler Hobbs wrote: > getTime() returns the number of milliseconds since the epoch, not the > number of seconds: http://www.w3schools.com/jsref/jsref_gettime.asp > > If you divide that number by 1000, it should work. > > > On Mon, Apr 30, 2012 at 11:28 AM, s

Re: timezone time series data model

2012-04-30 Thread samal
hhmm. I will try both. thanks On Mon, Apr 30, 2012 at 11:29 PM, Tyler Hobbs wrote: > Err, sorry, I should have said ts - (ts % 86400). Integer division does > something similar. > > > On Mon, Apr 30, 2012 at 12:39 PM, samal wrote: > >> thanks I didn't noticed.

Re: timezone time series data model

2012-05-01 Thread samal
this will work.I have tried both gave one day unique bucket. I just realized, If I sync all clients to one zone then date will remain same for all. One Zone date will give materialize view to row. On Mon, Apr 30, 2012 at 11:43 PM, samal wrote: > hhmm. I will try both. thanks > > >

Re: How can I implement 'LIKE operation in SQL' on values while querying a column family in Cassandra

2012-05-15 Thread samal
27;=>'abhiskek' 'name_atul'=>'atul' } here you can do slice query on column name and get desired result. /samal On Tue, May 15, 2012 at 3:29 PM, selam wrote: > Mapreduce jobs may solve your problem for batch processing > > > On Tue, May 15, 2012

Re: Composite Column

2012-05-15 Thread samal
I have not used CC but yes you can. Below is not composite column. It is not not column with JSON hash value. Column value can be anything you like. date inside value are not indexed. On Tue, May 15, 2012 at 9:27 AM, Abhijit Chanda wrote: > Is it possible to create this data model with the help o

Re: Composite Column

2012-05-15 Thread samal
It is just column with JSON value On Tue, May 15, 2012 at 4:00 PM, samal wrote: > I have not used CC but yes you can. > Below is not composite column. It is not not column with JSON hash value. > Column value can be anything you like. > date inside value are not indexed. > >

Re: Composite Column

2012-05-16 Thread samal
n numbers of column]"* comaprator=CompositeType(UTF8Type1,UTF8Type2,...,n) /Samal On Thu, May 17, 2012 at 10:40 AM, Abhijit Chanda wrote: > Aaron, > > Actually Aaron i am looking for a scenario on super columns being replaced > by composite column. > Say this is a data

Re: Composite Column

2012-05-17 Thread samal
dra-sf-2011and Joe http://www.youtube.com/watch?v=EBjWlH4NPMA , it will help you understand data model. @samalgorai On Thu, May 17, 2012 at 12:29 PM, Abhijit Chanda wrote: > Samal, > > Thanks buddy for interpreting. Now suppose i am inserting data in a column > family using this data mod

Re: RE Ordering counters in Cassandra

2012-05-22 Thread samal
ed from IOS or andriod. KEY, concat can also be use to filter out more deep: IOS#safari, andriod#chrome. Less number of columns will help to reverse index more efficiently. /Samal On Mon, May 21, 2012 at 11:53 PM, Tamar Fraenkel wrote: > Indeed I took the not delete approach. If time bucket rows

Re: Number of keyspaces

2012-05-22 Thread samal
Not ideally, now cass has global memtable tuning. Each cf correspond to memory in ram. Year wise cf means it will be in read only state for next year, memtable will still consume ram. On 22-May-2012 5:01 PM, "Franc Carter" wrote: > On Tue, May 22, 2012 at 9:19 PM, aaron morton wrote: > >> It's

Re: Astyanax Error

2012-05-22 Thread samal
Host not found in client. On 22-May-2012 4:34 PM, "Abhijit Chanda" wrote: > Hi All, > > Can any one suggest me why i am getting this error in Astyanax > NoAvailableHostsException: [host=None(0.0.0.0):0, latency=0(0), > attempts=0] No hosts to borrow from > > > Thanks In Advance > Abhijit >

Re: RE Ordering counters in Cassandra

2012-05-22 Thread samal
iterate over each column -- more precisly: *bunches of columns* > using slices -- and write new columns in the inversed index. > Tamar's data model is made for real time analysis. It's maybe overdesigned > for a daily ranking. > I agree with Samal, you should split your data acro

Re: supercolumns with TTL columns not being compacted correctly

2012-05-22 Thread samal
Data will remain till next compaction but won't be available. Compaction will delete old sstable create new one. On 22-May-2012 5:47 PM, "Pieter Callewaert" wrote: > Hi, > > ** ** > > I’ve had my suspicions some months, but I think I am sure about it. > > Data is being written by the SST

Re: Astyanax Error

2012-05-22 Thread samal
Are you able to connect through cli? Can you share your client code? On 22-May-2012 5:59 PM, "Abhijit Chanda" wrote: > Samal, > > > But I am setting up the Host. > > On Tue, May 22, 2012 at 5:30 PM, samal wrote: > >> Host not found in client. >&

Re: Cassandra 0.8.5: Column name mystery in create column family command

2012-05-22 Thread samal
Change your comparator to utf8type. On 22-May-2012 4:32 PM, "Roshan Dawrani" wrote: > Hi, > > I use Cassandra 0.8.5 and am suddenly noticing some strange behavior. I > run a "create column family" command with some column meta-data and it runs > fine, but when I do "describe keyspace", it shows m

Re: Cassandra 0.8.5: Column name mystery in create column family command

2012-05-22 Thread samal
Type > Index Name: ACUserIdIdx > Index Type: KEYS > ---- > > On Tue, May 22, 2012 at 6:16 PM, samal wrote: > >> Change your comparator to utf8type. >> On 22-May-2012 4:32 PM, "Roshan Dawrani" wrote: >> >>> Hi, >>> &

Re: supercolumns with TTL columns not being compacted correctly

2012-05-22 Thread samal
t; wrote: > >> Data will not be deleted when those keys appear in other stables > outside of > >> compaction. This is to prevent obsolete data from appearing again. > >> > >> yuki > >> > >> On Tuesday, May 22, 2012 at 7:37 AM, Pieter Callewa

Re: Query on how to count the total number of rowkeys and columns in them

2012-05-23 Thread samal
get_range /Samal On Thu, May 24, 2012 at 11:52 AM, Prakrati Agrawal < prakrati.agra...@mu-sigma.com> wrote: > Hi > > ** ** > > I am trying to learn Cassandra and I have one doubt. I am using the Thrift > API, to count the number of row keys I am using KeyRange to spe

Re: Cassandra Data Archiving

2012-05-31 Thread samal
cf_bucket _data, and deleting that cf few days later, this will keep cf count fixed. current cf count=n, bucket cf count= b*n using separate cluster old data analytic. /Samal On Fri, Jun 1, 2012 at 9:58 AM, Harshvardhan Ojha < harshvardhan.o...@makemytrip.com> wrote: > Problem s

Re: Adding a new node to Cassandra cluster

2012-06-04 Thread samal
If you use thrift API, you have to maintain lot of low level code by yourself which is already being polished by HLC hector, pycassa also with HLC your can easily switch between thrift and growing CQL. On Mon, Jun 4, 2012 at 3:00 PM, R. Verlangen wrote: > You might consider using a higher level

Re: about multitenant datamodel

2012-06-05 Thread samal
r may not have liberty to define their own data model. Good for fixed pattern data: logger, hits, geodata. /Samal >> >> > >> On Thu, 31 May 2012 06:44:05 +0900, aaron morton >> wrote: >> >> - Do a lot of keyspaces cause some problems? (If I have 1,000 users,

Re: How to include two nodes in Java code using Hector

2012-06-05 Thread samal
level 2 is used both node must be UP to read and write. It doesn't matter which node you connect, if your data is present in cluster it will be read directly or through coordinator node. Read hector doc- http://hector-client.github.com/hector/build/html/documentation.html /Samal On We

Re: Supercolumn behavior on writes

2012-06-13 Thread samal
You can't 'invent' columns on the fly, everything has > to be declared when you declare the column family. >> >> That' s incorrect. You can define name on fly. Validation must be define when declaring CF

Re: Supercolumn behavior on writes

2012-06-13 Thread samal
at_event > ( > ac_event_id int PRIMARY KEY, > ac_event_type text, > ev_sev int, > ac_id text, > ac_creation timestamp > ) with compression_parameters:sstable_compression = '' > ; > > -g > > > > > On Wed, Jun 13, 2012 at 9:36 PM, samal > wrote:

Re: Videos of the cassandra summit starting to be posted

2010-08-17 Thread samal gorai
thanks Riptano group for ur support in community education. On Tue, Aug 17, 2010 at 11:15 PM, Jeremy Hanna wrote: > The videos of the cassandra summit are starting to be posted, just fyi for > those who were unable to make it out to SF. > > http://www.riptano.com/blog/slides-and-videos-cassandra-

Re: Cassandra summit video downloads?

2010-08-28 Thread samal gorai
http://www.riptano.com/blog/slides-and-videos-cassandra-summit-2010

Re: Riptano Cassandra training in Denver

2010-09-01 Thread samal gorai
It will be gr8. Samal Gorai On Thu, Sep 2, 2010 at 10:46 AM, vineet daniel wrote: > Hi Jonathan > > Any plans of coming to India in future ? > > ___ > Regards > Vineet Daniel > +918106217121 >

Re: servers for cassandra

2010-09-04 Thread samal gorai
conditions. I use 64 bit linux (ubuntu) with 4GB RAM that is more than sufficient to play around. ___ *Samal Gora**i* On Sat, Sep 4, 2010 at 12:05 PM, vineet daniel wrote: > Hi > > I am just curious to know if there is an

Re: 4k keyspaces... Maybe we're doing it wrong?

2010-09-11 Thread samal gorai
Lot of memtables means lot of sstables means lot of disk io. On 9/7/10, Benjamin Black wrote: > On Mon, Sep 6, 2010 at 12:41 AM, Janne Jalkanen > wrote: >> >> So if I read this right, using lots of CF's is also a Bad Idea(tm)? >> > > Yes, lots of CFs is bad means lots of CFs is also bad. > --