Re: Stable cassandra version with frozen UDTs

2017-06-26 Thread Ali Akhtar
So, which cassandra version is the most stable / production ready currently? I'm fine with reverting to 2.x if needed. On Mon, Jun 26, 2017 at 8:37 PM, Michael Shuler wrote: > On 06/26/2017 10:17 AM, Vladimir Yudovin wrote: > > > > In terms of tick-tock releases odd releases (e.g. 3.11) are bug

Re: Stable cassandra version with frozen UDTs

2017-06-26 Thread Ali Akhtar
mir Yudovin, > *Winguzone <https://winguzone.com?from=list> - Cloud Cassandra Hosting* > > > On Thu, 22 Jun 2017 10:17:15 -0400 *Ali Akhtar >* wrote > > I'm running cassandra 3.9, but it doesn't seem stable. E.g, one of my > nodes recen

Stable cassandra version with frozen UDTs

2017-06-22 Thread Ali Akhtar
I'm running cassandra 3.9, but it doesn't seem stable. E.g, one of my nodes recently crashed with the message 'org.apache.cassandra.db.commitlog.CommitLogReadHandler$CommitLogReadException: Unexpected error deserializing mutation; saved to /tmp/mutation3976606415170694683dat. This may be caused b

Cassandra cost vs an RDBMS?

2017-06-15 Thread Ali Akhtar
A client recently inquired about the costs of running Cassandra vs a traditional RDBMS like Postgres or Mysql, in the cloud. They are releasing a b2b product similar to Slack, Trello, etc which will have a free tier. And they're concerned about the costs of running it on Cassandra, and whether it

Counter being incremented extra times

2017-04-27 Thread Ali Akhtar
I have the following schema: CREATE TABLE total_volume ( team_id text, channel_id text, volume counter, PRIMARY KEY (team_id, channel_id) ); I've written an integration test, using CassandraUnit, which runs a loop 200 times and executes the query: UPDATE total_volume SET volume = volume + 1 WHER

Deserializing a json string directly to a java class using Jackson?

2017-04-11 Thread Ali Akhtar
I have a table containing a column `foo` which is a string, and is json. I have a class called `Foo` which maps to `foo_json` and can be serialized / deserialized using Jackson. Is it possible to define the column as `private Foo foo` rather than `private String foo` and manually deserializing it

Effective partition key for time series data, which allows range queries?

2017-03-27 Thread Ali Akhtar
I have a use case where the data for individual users is being tracked, and every 15 minutes or so, the data for the past 15 minutes is inserted into the table. The table schema looks like: user id, timestamp, foo, bar, etc. Where foo, bar, etc are the items being tracked, and their values over t

Grouping time series data into blocks of times

2017-03-18 Thread Ali Akhtar
I have a use case where a stream of time series data is coming in. Each item in the stream has a timestamp of when it was sent, and covers the activity that happened within a 5 minute timespan. I need to group the items together into 30 minute blocks of time. E.g, say I receive the following ite

Re: Ye old singleton debate

2017-03-15 Thread Ali Akhtar
+1. Would be awesome if this could be mocked / tested. On Thu, Mar 16, 2017 at 3:47 AM, Edward Capriolo wrote: > This question came up today: > > OK, say you mock, how do you construct a working multi-process > representation of how C* actually works from within a unit test without > running the

Re: Not timing out some queries (Java driver)

2016-12-22 Thread Ali Akhtar
The replication factor is the default - I haven't changed it. Would tweaking it help? On Thu, Dec 22, 2016 at 8:41 PM, Ali Akhtar wrote: > Vladimir, > > I'm receiving a batch of messages which are out of order, and I need to > process those messages in order. > > My

Re: Not timing out some queries (Java driver)

2016-12-22 Thread Ali Akhtar
> > Best regards, Vladimir Yudovin, > *Winguzone <https://winguzone.com?from=list> - Cloud Cassandra Hosting* > > > ---- On Thu, 22 Dec 2016 10:14:14 -0500 *Ali Akhtar >* wrote > > Is it possible to provide these options per query rather than set them >

Re: Not timing out some queries (Java driver)

2016-12-22 Thread Ali Akhtar
a Cluster -> Configuration -> SocketOptions -> setReadTimeoutMillis). > > Not sure if you can (or would want to) set them to "forever", but it's a > starting point. > > On Wed, Dec 21, 2016 at 7:10 PM, Ali Akhtar wrote: > >> I have some queries

Re: Processing time series data in order

2016-12-21 Thread Ali Akhtar
a problem > you can reread data from Kafka. > > -Jesse > > > On Dec 21, 2016, at 7:24 PM, Ali Akhtar wrote: > > > > - I'm receiving a batch of messages to a Kafka topic. > > > > Each message has a timestamp, however the messages can arrive / get > p

Processing time series data in order

2016-12-21 Thread Ali Akhtar
- I'm receiving a batch of messages to a Kafka topic. Each message has a timestamp, however the messages can arrive / get processed out of order. I.e event 1's timestamp could've been a few seconds before event 2, and event 2 could still get processed before event 1. - I know the number of messag

Not timing out some queries (Java driver)

2016-12-21 Thread Ali Akhtar
I have some queries which need to be processed in a consistent manner. I'm setting the consistently level = ALL option on these queries. However, I've noticed that sometimes these queries fail because of a timeout (2 seconds). In my use case, for certain queries, I want them to never time out and

Re: Storing videos in cassandra

2016-11-14 Thread Ali Akhtar
at some point. Please keep that > in mind no-one has any understanding to the intent on your jokes when all > they have is a 2 sentence response that is obviously not meant to be > helpful. > > Jon > > On Nov 14, 2016, at 10:25 AM, Ali Akhtar wrote: > > Excuse me?

Re: Storing videos in cassandra

2016-11-14 Thread Ali Akhtar
n > help on a ton of beginner issues. Making fun of someone for asking similar > beginner questions is not cool at all. Cut it out. > > > > On Nov 14, 2016, at 10:13 AM, Ali Akhtar wrote: > > Another solution could be to print the raw bytes to paper, and write the > pag

Re: Storing videos in cassandra

2016-11-14 Thread Ali Akhtar
Another solution could be to print the raw bytes to paper, and write the page numbers to cassandra. Playback will be challenging with this method however, unless interns are available to transcribe the papers back to a digital format. On Mon, Nov 14, 2016 at 11:06 PM, Ali Akhtar wrote: >

Re: Storing videos in cassandra

2016-11-14 Thread Ali Akhtar
The video can be written to floppy diskettes, and the serial numbers of the diskettes can be written to cassandra. On Mon, Nov 14, 2016 at 11:00 PM, Oskar Kjellin wrote: > The actual video is not stored in Cassandra. You need to use a proper > origin like s3. > > Although you can probably store

Re: Consistency when adding data to collections concurrently?

2016-11-13 Thread Ali Akhtar
ing labels. (Which will be rare, by the way.) On Sun, Nov 13, 2016 at 5:38 PM, DuyHai Doan wrote: > So problem solved! > > On Sun, Nov 13, 2016 at 1:37 PM, Ali Akhtar wrote: > >> Yeah, I am using set (not set though) >> >> On Sun, Nov 13, 2016 at 5:36 PM, DuyHai Doan

Re: Consistency when adding data to collections concurrently?

2016-11-13 Thread Ali Akhtar
= labels + ; > > It does work well with concurrent updates. > > On Sun, Nov 13, 2016 at 1:32 PM, Ali Akhtar wrote: > >> But then how would you query it? You'd need to know all the values of the >> udt, right? >> >> On Sun, Nov 13, 2016 at 5:30 PM, D

Re: Consistency when adding data to collections concurrently?

2016-11-13 Thread Ali Akhtar
But then how would you query it? You'd need to know all the values of the udt, right? On Sun, Nov 13, 2016 at 5:30 PM, DuyHai Doan wrote: > "Also can you make a UDT a clustered key?" --> yes if it's frozen > > On Sun, Nov 13, 2016 at 1:25 PM, Ali Akhtar wr

Re: Consistency when adding data to collections concurrently?

2016-11-13 Thread Ali Akhtar
assandra - > which is what you are doing. What I suggesting > above is appending more columns and not updating existing columns. > > regards > > regards > > On Sat, Nov 12, 2016 at 2:34 AM, Ali Akhtar wrote: > >> I have a table where each record contains a list of

Re: Consistency when adding data to collections concurrently?

2016-11-12 Thread Ali Akhtar
Just to be clear, doing mapper.save() will do an insert rather than an update? On Sat, Nov 12, 2016 at 9:36 PM, Andrew Tolbert wrote: > I believe you are correct that the implementation taking the Set is the > right one to use. > > On Sat, Nov 12, 2016 at 9:44 AM Ali Akhtar wrote

Re: Consistency when adding data to collections concurrently?

2016-11-12 Thread Ali Akhtar
Or it could even take Set as the first bound var: void addLabel(Set label, String id); On Sat, Nov 12, 2016 at 8:41 PM, Ali Akhtar wrote: > Andrew, > > I was thinking about setting up an accessor with that query and a bound > variable ? which binds to the instance being added, e.g

Re: Consistency when adding data to collections concurrently?

2016-11-12 Thread Ali Akhtar
p0 = session.prepare("UPDATE my_table SET labels = > labels + ? where id = ?"); > BoundStatement b0 = p0.bind(*Lists.newArrayList(value)*, 0); > session.execute(b0); > > Thanks, > Andy > > On Sat, Nov 12, 2016 at 9:02 AM, Ali Akhtar wrote: > >> Looks like the tr

Re: Consistency when adding data to collections concurrently?

2016-11-12 Thread Ali Akhtar
Looks like the trick was to use [] around the udt value literal. Any way to do this using the java driver? On Sat, Nov 12, 2016 at 7:58 PM, Ali Akhtar wrote: > Changing the double quotes to single quotes gives: > > UPDATE my_table SET labels = labels + {id: '

Re: Consistency when adding data to collections concurrently?

2016-11-12 Thread Ali Akhtar
Changing the double quotes to single quotes gives: UPDATE my_table SET labels = labels + {id: 'foo'} where id = ''; InvalidRequest: Error from server: code=2200 [Invalid query] message="Invalid user type literal for labels of type list>" On Sat, Nov 12, 2

Re: Consistency when adding data to collections concurrently?

2016-11-12 Thread Ali Akhtar
> appending or prefixing an element to a list. > > > Best regards, Vladimir Yudovin, > > *Winguzone <https://winguzone.com?from=list> - Hosted Cloud > CassandraLaunch your cluster in minutes.* > > > On Sat, 12 Nov 2016 07:57:36 -0500*Ali Akhtar >* wrote

Re: Consistency when adding data to collections concurrently?

2016-11-12 Thread Ali Akhtar
The labels collection is of the type set> , where label is a udt containing: id, name, description , all text fields. On Sat, Nov 12, 2016 at 5:54 PM, Ali Akhtar wrote: > The problem isn't just the update / insert though, right? Don't frozen > entities get overwritten complet

Re: Consistency when adding data to collections concurrently?

2016-11-12 Thread Ali Akhtar
12, 2016 at 5:50 PM, DuyHai Doan wrote: > Maybe you should use my Achilles mapper, which does generates UPDATE > statements on collections and not only INSERT > Le 12 nov. 2016 13:08, "Ali Akhtar" a écrit : > >> I am using the Java Cassandra mapper for all of these cases,

Deadlock in callbacks to async operations (Java)

2016-11-12 Thread Ali Akhtar
At https://datastax.github.io/java-driver/manual/async/ the docs say to not do any blocking operations within the callback of an async operation. This example is given as one that can cause a deadlock: ListenableFuture resultSet = Futures.transform(session, new Function() { public Resu

Re: Consistency when adding data to collections concurrently?

2016-11-12 Thread Ali Akhtar
I am using the Java Cassandra mapper for all of these cases, so my code looks like this: Item myItem = myaccessor.get( itemId ); Mapper mapper = mappingManager.create( Item.class ); myItem.labels.add( newLabel ); mapper.save( myItem ); On Sat, Nov 12, 2016 at 5:06 PM, Ali Akhtar wrote

Re: Consistency when adding data to collections concurrently?

2016-11-12 Thread Ali Akhtar
list. Please note that prepend & append > operations on list do not require this read-delete-write and thus performs > slightly better > > On Sat, Nov 12, 2016 at 11:34 AM, Ali Akhtar wrote: > >> I have a table where each record contains a list of labels. >> >> I h

Re: Consistency when adding data to collections concurrently?

2016-11-12 Thread Ali Akhtar
If I used consistency = ALL both when getting the record, and when saving the record, will that avoid the race condition? On Sat, Nov 12, 2016 at 4:26 PM, Ali Akhtar wrote: > I'm responding to a 3rd party API, so I have no control over sending the > labels together instead of one

Re: Consistency when adding data to collections concurrently?

2016-11-12 Thread Ali Akhtar
l be added to list, without > overwriting old ones. Also consider usage of SET instead of LIST to avoid > duplicates. > > Best regards, Vladimir Yudovin, > > *Winguzone <https://winguzone.com?from=list> - Hosted Cloud > CassandraLaunch your cluster in minutes.* > &g

Consistency when adding data to collections concurrently?

2016-11-12 Thread Ali Akhtar
I have a table where each record contains a list of labels. I have an endpoint which responds to new labels being added to a record by the user. Consider the following scenario: - Record X, labels = [] - User selects 2 labels, clicks a button, and 2 http requests are generated. - The server rece

Re: Having Counters in a Collection, like a map?

2016-11-09 Thread Ali Akhtar
r: >> >> CREATE TABLE cnt (id int PRIMARY KEY , value counter); >> >> Best regards, Vladimir Yudovin, >> >> *Winguzone <https://winguzone.com?from=list> - Hosted Cloud >> CassandraLaunch your cluster in minutes.* >> >> >> On W

Having Counters in a Collection, like a map?

2016-11-09 Thread Ali Akhtar
I have a use-case where I need to have a dynamic number of counters. The easiest way to do this would be to have a map where the int is the key, and the counter is the value which is incremented / decremented. E.g if something related to 5 happened, then i'd get the counter for 5 and increment / d

Re: Improving performance where a lot of updates and deletes are required?

2016-11-08 Thread Ali Akhtar
re_c.html > > Best regards, Vladimir Yudovin, > > *Winguzone <https://winguzone.com?from=list> - Hosted Cloud > CassandraLaunch your cluster in minutes.* > > > ---- On Tue, 08 Nov 2016 05:04:12 -0500*Ali Akhtar >* wrote > > I have a use case where a lot of up

Re: Improving performance where a lot of updates and deletes are required?

2016-11-08 Thread Ali Akhtar
and do a truncate of the table at the end of the > day. > > On Tue, Nov 8, 2016 at 11:04 AM, Ali Akhtar wrote: > >> I have a use case where a lot of updates and deletes to a table will be >> necessary. >> >> The deletes will be done at a scheduled time, p

Improving performance where a lot of updates and deletes are required?

2016-11-08 Thread Ali Akhtar
I have a use case where a lot of updates and deletes to a table will be necessary. The deletes will be done at a scheduled time, probably at the end of the day, each day. Updates will be done throughout the day, as new data comes in. Are there any guidelines on improving cassandra's performance

Re: Using a Set for UDTs, how is uniqueness established?

2016-11-07 Thread Ali Akhtar
ever, if > the "id" values do match, it does not guarantee that the UDT values match > since it requires that all other fields match. > > > > On Mon, Nov 7, 2016 at 1:14 PM, Ali Akhtar wrote: > >> I have a UDT which contains a text 'id' field, which should

Using a Set for UDTs, how is uniqueness established?

2016-11-07 Thread Ali Akhtar
I have a UDT which contains a text 'id' field, which should be used to establish the uniqueness of the UDT. I'd like to have a set field in a table, and I'd like to use the id of the udts to establish uniqueness. Any ideas how this can be done? Also using Java, and c* 3.7

Re: Are Cassandra writes are faster than reads?

2016-11-06 Thread Ali Akhtar
tl;dr? I just want to know if updates are bad for performance, and if so, for how long. On Mon, Nov 7, 2016 at 10:23 AM, Ben Bromhead wrote: > Check out https://wiki.apache.org/cassandra/WritePathForUsers for the > full gory details. > > On Sun, 6 Nov 2016 at 21:09 Ali Akhtar wro

Re: Are Cassandra writes are faster than reads?

2016-11-06 Thread Ali Akhtar
How long does it take for updates to get merged / compacted into the main data file? On Mon, Nov 7, 2016 at 5:31 AM, Ben Bromhead wrote: > To add some flavor as to how the commitlog implementation is so quick. > > It only flushes to disk every 10s by default. So writes are effectively > done to

Re: Using Instants for timestamps in Java mappings?

2016-11-04 Thread Ali Akhtar
va-driver/blob/3.x/driver-extras/src/main/java/ > com/datastax/driver/extras/codecs/jdk8/InstantCodec.java > > On Fri, Nov 4, 2016 at 8:39 AM, Ali Akhtar wrote: > >> Is it possible to use Instants to represent timestamp columns in java >> mappings of cassandra tables? (Using the official java driver) >> > >

Using Instants for timestamps in Java mappings?

2016-11-04 Thread Ali Akhtar
Is it possible to use Instants to represent timestamp columns in java mappings of cassandra tables? (Using the official java driver)

Re: Cannot mix counter and non counter columns in the same table

2016-11-01 Thread Ali Akhtar
gnore the count of the value >> and use a multi-part column name as it's value. >> >> For example: >> >> create column family stuff ( >> rowkey string, >> column string, >> value string. >> counter_to_ignore long, >> primary key( rowkey, colu

Re: Cannot mix counter and non counter columns in the same table

2016-11-01 Thread Ali Akhtar
the same time, > just query both tables at the same time with asynchronous queries. > > On Nov 1, 2016, at 7:29 AM, Ali Akhtar wrote: > > That's a terrible gotcha rule. > > On Tue, Nov 1, 2016 at 6:27 PM, Cody Yancey wrote: > >> In your table schema, you

Re: Cannot mix counter and non counter columns in the same table

2016-11-01 Thread Ali Akhtar
ALUES, however, must be either all counters or all non-counters. The > official example you posted conforms to this limitation. > > Thanks, > Cody > > On Nov 1, 2016 7:16 AM, "Ali Akhtar" wrote: > >> I'm not referring to the primary key, just to other co

Re: Cannot mix counter and non counter columns in the same table

2016-11-01 Thread Ali Akhtar
mns in the same table' On Tue, Nov 1, 2016 at 6:11 PM, Cody Yancey wrote: > For counter tables, non-counter types are of course allowed in the primary > key. Counters would be meaningless otherwise. > > Thanks, > Cody > > On Nov 1, 2016 7:00 AM, "Ali Akhtar"

Cannot mix counter and non counter columns in the same table

2016-11-01 Thread Ali Akhtar
In the documentation for counters: https://docs.datastax.com/en/cql/3.1/cql/cql_using/use_counter_t.html The example table is created via: CREATE TABLE counterks.page_view_counts (counter_value counter, url_name varchar, page_name varchar, PRIMARY KEY (url_name, page_name) ); Yet if I t

Specifying multiple conditions for lightweight conditions?

2016-11-01 Thread Ali Akhtar
In the following query: UPDATE project SET last_due_at = '2013-01-01 00:00:00+0200' WHERE id = '1' IF last_due_at < '2013-01-01 00:00:00+0200'; The intent is to change the value of 'last_due_at' as long as 'last_due_at' isn't already set to a later date than the one I've supplied. The problem is

Cannot restrict clustering columns by IN relations when a collection is selected by the query

2016-10-27 Thread Ali Akhtar
I have the following table schema: *CREATE TABLE ticket_by_member (* * project_id text,* * member_id text,* * ticket_id text,* * ticket ticket,* *assigned_members list,* * votes list>,* *labels list>,* * PRIMARY KEY ( project_id, member_id, ticket_id )* *);* I have a scenario wher

Re: which one of the following choices is more efficient?

2016-10-26 Thread Ali Akhtar
> > On Wed, Oct 26, 2016 at 1:12 AM, Ali Akhtar wrote: > >> Depends on the use case. No one right answer. >> >> On Wed, Oct 26, 2016 at 1:03 PM, Kant Kodali wrote: >> >>> If one were given a choice of fitting all the data into one table vs >>>

Re: which one of the following choices is more efficient?

2016-10-26 Thread Ali Akhtar
Depends on the use case. No one right answer. On Wed, Oct 26, 2016 at 1:03 PM, Kant Kodali wrote: > If one were given a choice of fitting all the data into one table vs > fitting the data into two tables while say (keeping all the runtime and > space complexity for CRUD operations the same in ei

Re: CommitLogReadHandler$CommitLogReadException: Unexpected error deserializing mutation

2016-10-24 Thread Ali Akhtar
I want some of the newer UDT features, like not needing to have frozen UDTs On Tue, Oct 25, 2016 at 6:34 AM, Ali Akhtar wrote: > 3.0.x? Isn't 3.7 stable? > > On Tue, Oct 25, 2016 at 6:32 AM, Jonathan Haddad > wrote: > >> If you're not in prod *yet*, I once ag

Re: CommitLogReadHandler$CommitLogReadException: Unexpected error deserializing mutation

2016-10-24 Thread Ali Akhtar
3.0.x? Isn't 3.7 stable? On Tue, Oct 25, 2016 at 6:32 AM, Jonathan Haddad wrote: > If you're not in prod *yet*, I once again recommend not using 3.9 for > anything serious. Use the latest 3.0.x. > > On Mon, Oct 24, 2016 at 6:19 PM Ali Akhtar wrote: > >> Stefan

Re: CommitLogReadHandler$CommitLogReadException: Unexpected error deserializing mutation

2016-10-24 Thread Ali Akhtar
a change? This would be 12397. > > If not, and if you don't mind sharing the data, or you have the steps to > reproduce it, could you please open a ticket so it can be looked at? You > need to attach the schema as well. > > On Mon, Oct 24, 2016 at 9:33 PM, Ali Akhtar wrote: >

Doing an upsert into a collection?

2016-10-24 Thread Ali Akhtar
Say I have this UDT: *CREATE TYPE rating (* * user text,* * rating int* *);* And, I have this table: *CREATE TABLE movie (* * id text,* * name text,* * ratings list>,* * PRIMARY KEY ( id )* *);* Say a user 'bob' rated a movie as a 5. Is it possible to do something like this: *UPDATE movie set

Re: CommitLogReadHandler$CommitLogReadException: Unexpected error deserializing mutation

2016-10-24 Thread Ali Akhtar
Its 'text'. Don't know the answer of the 2nd question. On Mon, Oct 24, 2016 at 6:31 PM, Jonathan Haddad wrote: > What type is board id? Is the value a tombstone? > > On Mon, Oct 24, 2016 at 1:38 AM Ali Akhtar wrote: > >> Thanks, but I did come across those, it

Re: CommitLogReadHandler$CommitLogReadException: Unexpected error deserializing mutation

2016-10-24 Thread Ali Akhtar
gt; > On Mon, Oct 24, 2016 at 10:24 AM, Ali Akhtar wrote: > >> Any workarounds that don't involve me having to figure out how to >> uninstall and re-install a different version? >> >> On Mon, Oct 24, 2016 at 1:24 PM, Ali Akhtar wrote: >> >>>

Re: CommitLogReadHandler$CommitLogReadException: Unexpected error deserializing mutation

2016-10-24 Thread Ali Akhtar
Any workarounds that don't involve me having to figure out how to uninstall and re-install a different version? On Mon, Oct 24, 2016 at 1:24 PM, Ali Akhtar wrote: > 3.9.. > > On Mon, Oct 24, 2016 at 1:22 PM, DuyHai Doan wrote: > >> Which version of C* ? There was similar

Re: CommitLogReadHandler$CommitLogReadException: Unexpected error deserializing mutation

2016-10-24 Thread Ali Akhtar
3.9.. On Mon, Oct 24, 2016 at 1:22 PM, DuyHai Doan wrote: > Which version of C* ? There was similar issues with commitlogs in tic-toc > versions. > > On Mon, Oct 24, 2016 at 4:18 AM, Ali Akhtar wrote: > >> I have a single node cassandra installation on my dev laptop, whic

CommitLogReadHandler$CommitLogReadException: Unexpected error deserializing mutation

2016-10-23 Thread Ali Akhtar
I have a single node cassandra installation on my dev laptop, which is used just for dev / testing. Recently, whenever I restart my laptop, Cassandra fails to start when I run it via 'sudo service cassandra start'. Doing a tail on /var/log/cassandra/system.log gives this log: *INFO [main] 2016-

Re: Speeding up schema generation during tests

2016-10-23 Thread Ali Akhtar
;>> As I said, when I bootstrap the server and create some keyspace, >>>> sometimes the schema is not fully initialized and when the test code tried >>>> to insert data, it fails. >>>> >>>> I did not have time to dig into the source code to find th

Re: Hadoop vs Cassandra

2016-10-23 Thread Ali Akhtar
"from a particular query" should be " from a particular country" On Sun, Oct 23, 2016 at 2:36 PM, Ali Akhtar wrote: > They can be, but I would assume that if your Cassandra data model is > inefficient for the kind of queries you want to do, Spark won't magically &

Re: Hadoop vs Cassandra

2016-10-23 Thread Ali Akhtar
done in spark right? > > On 23 Oct 2016 4:08 p.m., "Ali Akhtar" wrote: > > > > > > I would say it depends on your use case. > > > > If you need a lot of queries that require joins, or complex analytics of > the kind that Cassandra isn't suite

Re: Hadoop vs Cassandra

2016-10-23 Thread Ali Akhtar
ll need to do, duplicating data - doing extra writes for faster reads) , then Cassandra should work for you. It is easier to setup and do dev ops with, in my experience. On Sun, Oct 23, 2016 at 2:05 PM, Welly Tambunan wrote: > I mean. HDFS and HBase. > > On Sun, Oct 23, 2016 at 4:00 PM

Re: Hadoop vs Cassandra

2016-10-23 Thread Ali Akhtar
By Hadoop do you mean HDFS? On Sun, Oct 23, 2016 at 1:56 PM, Welly Tambunan wrote: > Hi All, > > I read the following comparison between hadoop and cassandra. Seems the > conclusion that we use hadoop for data lake ( cold data ) and Cassandra for > hot data (real time data). > > http://www.dat

Re: What is the maximum value of Cassandra Counter Column?

2016-10-23 Thread Ali Akhtar
It seems obvious. On Sun, Oct 23, 2016 at 1:15 PM, Kant Kodali wrote: > where does it say counter is implemented as long? > > On Sun, Oct 23, 2016 at 1:13 AM, Ali Akhtar wrote: > >> Probably: https://docs.oracle.com/javase/8/docs/api/java/lan >> g/Long.html#MAX_VALUE &g

Re: What is the maximum value of Cassandra Counter Column?

2016-10-23 Thread Ali Akhtar
Probably: https://docs.oracle.com/javase/8/docs/api/java/lang/Long.html#MAX_VALUE On Sun, Oct 23, 2016 at 1:12 PM, Kant Kodali wrote: > What is the maximum value of Cassandra Counter Column? >

Re: Speeding up schema generation during tests

2016-10-19 Thread Ali Akhtar
omething really stupid and simple to fix. If you want to >> investigate and try out my CassandraDaemon server, I'd be happy to get >> feedbacks >> >> On Wed, Oct 19, 2016 at 9:22 AM, Ali Akhtar wrote: >> >>> Thanks. I've disabled durable writes

Re: Speeding up schema generation during tests

2016-10-19 Thread Ali Akhtar
> (CREATE KEYSPACE ... WITH durable_write=false) > > On Wed, Oct 19, 2016 at 3:24 AM, Ali Akhtar wrote: > >> Is there a way to speed up the creation of keyspace + tables during >> integration tests? I am using an RF of 1, with SimpleStrategy, but it still >> takes upto 10-15 seconds. >> > >

Speeding up schema generation during tests

2016-10-18 Thread Ali Akhtar
Is there a way to speed up the creation of keyspace + tables during integration tests? I am using an RF of 1, with SimpleStrategy, but it still takes upto 10-15 seconds.

Re: mapper.save() throws a ThreadPool error (Java)

2016-10-11 Thread Ali Akhtar
Uh, yeah, I'm a moron. I was doing this inside a try/catch block, and the class containing my session was autoclosing the session at the end of the try/ catch (i.e try (Environment env = new Environment() ). Nvm, I'm an idiot On Tue, Oct 11, 2016 at 8:29 PM, Ali Akhtar wrote: &

Re: mapper.save() throws a ThreadPool error (Java)

2016-10-11 Thread Ali Akhtar
This is a little urgent, so any help would be greatly appreciated. On Tue, Oct 11, 2016 at 8:22 PM, Ali Akhtar wrote: > I'm creating a session, connecting to it, then creating a > mappingManager(), then obtaining a mapper for MyPojo.class > > If I then try to do mapper.save(m

mapper.save() throws a ThreadPool error (Java)

2016-10-11 Thread Ali Akhtar
I'm creating a session, connecting to it, then creating a mappingManager(), then obtaining a mapper for MyPojo.class If I then try to do mapper.save(myPojo), I get the following stacktrace: Oct 11, 2016 8:16:26 PM com.google.common.util.concurrent.ExecutionList executeListener SEVERE: RuntimeExce

Re: Java Driver - Specifying parameters for an IN() query?

2016-10-11 Thread Ali Akhtar
Y ((id, category), val1, val2) > ); > > SELECT * FROM user WHERE id = 1 AND type IN ('user', 'admin') AND val1 = > 3 AND val2 IN ('a', 'v', 'd'); > > On Tue, 11 Oct 2016 at 07:11 Ali Akhtar wrote: > > Do you send the values t

Re: Java Driver - Specifying parameters for an IN() query?

2016-10-11 Thread Ali Akhtar
the brackets around the question mark) > > regards, > Ch > > On Tue, Oct 11, 2016 at 3:14 PM, Ali Akhtar wrote: > >> If I wanted to create an accessor, and have a method which does a query >> like this: >> >> 'Select * from my_table WHERE pk = ? And ck IN

Re: Java Driver - Specifying parameters for an IN() query?

2016-10-11 Thread Ali Akhtar
umn in the > full primary key. > > Example: > > 'Select * from my_table WHERE pk = 'test' And ck IN (1, 2)' > > > On Tue, 11 Oct 2016 at 06:15 Ali Akhtar wrote: > >> If I wanted to create an accessor, and have a method which does a query >>

Java Driver - Specifying parameters for an IN() query?

2016-10-11 Thread Ali Akhtar
If I wanted to create an accessor, and have a method which does a query like this: 'Select * from my_table WHERE pk = ? And ck IN (?)' And there were multiple options that could go inside the IN() query, how can I specify that? Will it e.g, let me pass in an array as the 2nd variable?

Re: Being asked to use frozen for UDT in 3.9

2016-10-10 Thread Ali Akhtar
Is it possible to use fields on the UDT as primary / cluster keys? On Tue, Oct 11, 2016 at 9:49 AM, Ali Akhtar wrote: > Yeah, you're right, it does work if I run it thru cqlsh. I was using > DevCenter which shows that error. > > On Tue, Oct 11, 2016 at 9:48 AM, Andrew Tolber

Re: Being asked to use frozen for UDT in 3.9

2016-10-10 Thread Ali Akhtar
> On Mon, Oct 10, 2016 at 11:25 PM Ali Akhtar wrote: > >> CREATE TYPE test ( >> foo text, >> bar text >> ); >> >> CREATE TABLE test_table ( >> id text, >> this_doesnt_work test, >> PRIMARY KEY (id) >> ); >> >> On Tue,

Re: NamingStrategy for the Java Driver for camelCase / snake_case conversion?

2016-10-10 Thread Ali Akhtar
ble > annotated class. > > Thanks, > Andy > > > > On Mon, Oct 10, 2016 at 11:23 PM Ali Akhtar wrote: > >> Thanks. >> >> Btw, is it possible to use UDTs and have them mapped via the java driver? >> If so, how does that work - do I just create a pojo for t

Re: Being asked to use frozen for UDT in 3.9

2016-10-10 Thread Ali Akhtar
gt;> the primary key? If that is the case it still needs to be frozen (the same >> goes for list, set, tuple as part of primary key). This is the error I get >> when I try that: >> >> InvalidRequest: Error from server: code=2200 [Invalid query] >> message="In

Re: NamingStrategy for the Java Driver for camelCase / snake_case conversion?

2016-10-10 Thread Ali Akhtar
/JAVA-1316> for this. > > Thanks, > Andy > > > > On Mon, Oct 10, 2016 at 10:30 PM Ali Akhtar wrote: > >> Please fix this. >> >> >> >> On Tue, Oct 11, 2016 at 8:28 AM, Andrew Tolbert < >> andrew.tolb...@datastax.com> wrote: >> >&

Re: Being asked to use frozen for UDT in 3.9

2016-10-10 Thread Ali Akhtar
t > when I try that: > > InvalidRequest: Error from server: code=2200 [Invalid query] > message="Invalid non-frozen user-defined type for PRIMARY KEY component > basics" > > Andy > > On Mon, Oct 10, 2016 at 8:27 PM Ali Akhtar wrote: > >> According to http:/

Re: NamingStrategy for the Java Driver for camelCase / snake_case conversion?

2016-10-10 Thread Ali Akhtar
name being mapped by that field. > > Thanks, > Andy > > On Mon, Oct 10, 2016 at 8:03 PM Ali Akhtar wrote: > >> In working with Jackson, it has a NamingStrategy which lets you >> automatically map snake_case fields in json to camelCase fields on the Java >> cla

Being asked to use frozen for UDT in 3.9

2016-10-10 Thread Ali Akhtar
According to http://docs.datastax.com/en/cql/3.3/cql/cql_using/useCreateUDT.html > In Cassandra 3.6 and later, the frozen keyword is not required for UDTs that contain only non-collection fields. However if I create a type with 4-5 all text fields, and try to use that type in another table, I ge

NamingStrategy for the Java Driver for camelCase / snake_case conversion?

2016-10-10 Thread Ali Akhtar
In working with Jackson, it has a NamingStrategy which lets you automatically map snake_case fields in json to camelCase fields on the Java class. Last time I worked w/ Cassandra, I didn't find anything like that, and had to define an @Column annotation for each field. Please tell me this has cha

Re: Ordering by multiple columns?

2016-10-10 Thread Ali Akhtar
and understand the basics before start using it. Otherwise you could easily > abuse it inadvertently. > > Praveen > > On Oct 10, 2016, at 6:22 PM, Ali Akhtar wrote: > > E.g if I wanted to select * from foo where last_updated <= ? > > In this case, (I believe) last_u

Re: Ordering by multiple columns?

2016-10-10 Thread Ali Akhtar
E.g if I wanted to select * from foo where last_updated <= ? In this case, (I believe) last_updated will have to be a clustering key. If the record got updated and I wanted to update last_updated accordingly, that's a bad idea? :S On Tue, Oct 11, 2016 at 3:19 AM, Ali Akhtar wrote: &g

Re: Ordering by multiple columns?

2016-10-10 Thread Ali Akhtar
I wasn’t able to find this information in the official documentation but >> it’s quite well described here: >> http://stackoverflow.com/questions/35708118/where-and- >> order-by-clauses-in-cassandra-cql >> >> Thanks, >> Mikhail >> >> On 10 Oct 2016, at

Re: Where to change the datacenter name?

2016-10-10 Thread Ali Akhtar
values > and snitch settings and there is a risk of node reporting invalid/ missing > data to client. > > > > On Mon, Oct 10, 2016 at 4:08 PM, Ali Akhtar wrote: > >> So I see this: >> >> cluster_name: 'Test Cluster' >> >> But when I grep -i

Re: Where to change the datacenter name?

2016-10-10 Thread Ali Akhtar
dc.properties. So my question is, which file contains 'datacenter1'? On Tue, Oct 11, 2016 at 12:54 AM, Adam Hutson wrote: > There is a cluster name in the cassandra.yaml for naming the cluster, aka > data center. Then you assign keyspaces to the data center within the CREATE &g

Where to change the datacenter name?

2016-10-10 Thread Ali Akhtar
Where can I change the default name 'datacenter1'? I've looked through the configuration files in /etc/cassandra , and can't find where this value is being defined.

Re: Ordering by multiple columns?

2016-10-10 Thread Ali Akhtar
ct 9, 2016 at 2:04 AM, Ali Akhtar wrote: > >> Is it possible to have multiple clustering keys in cassandra, or some >> other way to order by multiple columns? >> >> For example, say I have a table of songs, and each song has a rating and >> a date. >> >>

Doing a calculation in a query?

2016-10-10 Thread Ali Akhtar
I have a table for tracking orders. Each order has an `ordered_at` field (can be a timestamp, or a long with the milliseconds of the timestamp) and `shipped_at` field (ditto, timestamp or long). orderd_at tracks when the order was made. shipped_at tracks when the order was shipped. When retrievi

  1   2   >