Re: State in external db (dynamodb)

2016-07-25 Thread Chen Qin
to check whether eviction of (inactive/quiet) state from >>>>> memory is something that I should consider implementing, or whether Flink >>>>> already had some built-in way of doing it. >>>>> >>>>> Along the same lines, I am also wondering

Re: State in external db (dynamodb)

2016-07-25 Thread Josh
whether Flink already has >>>> means of compacting the state of a window by applying an aggregation >>>> function to the elements so-far (eg. every time window is triggered)? For >>>> example, if you are only executing a sum on the contents of the window, the >>

Re: State in external db (dynamodb)

2016-07-25 Thread Josh
ndow, >>> it only needs to store the sum. Aggregations other than "sum" might have >>> that characteristic too. I don't know if Flink is already that intelligent >>> or whether I should figure out how to aggregate window contents myself when >>> p

Re: State in external db (dynamodb)

2016-07-24 Thread Chen Qin
executing a sum on the contents of the window, the window >>> state doesn't need to store all the individual items in the window, it only >>> needs to store the sum. Aggregations other than "sum" might have that >>> characteristic too. I don't know if Fli

Re: State in external db (dynamodb)

2016-07-22 Thread Josh
ossible with something like a window fold? Another poster (Aljoscha) was >> talking about adding incremental snapshots, but it sounds like that would >> only improve the write throughput not the memory usage. >> >> Thanks again! >> Shannon Carey >> >> >

Re: State in external db (dynamodb)

2016-04-07 Thread Aljoscha Krettek
gregate window contents myself when > possible with something like a window fold? Another poster (Aljoscha) was > talking about adding incremental snapshots, but it sounds like that would > only improve the write throughput not the memory usage. > > Thanks again! > Shannon Carey

Re: State in external db (dynamodb)

2016-04-07 Thread Shannon Carey
roughput not the memory usage. Thanks again! Shannon Carey From: Stephan Ewen mailto:se...@apache.org>> Date: Wednesday, April 6, 2016 at 10:37 PM To: mailto:user@flink.apache.org>> Subject: Re: State in external db (dynamodb) Hi Shannon! Welcome to the Flink community! You are rig

Re: State in external db (dynamodb)

2016-04-06 Thread Stephan Ewen
Hi Shannon! Welcome to the Flink community! You are right, sinks need in general to be idempotent if you want "exactly-once" semantics, because there can be a replay of elements that were already written. However, what you describe later, overwriting of a key with a new value (or the same value

Re: State in external db (dynamodb)

2016-04-05 Thread Sanne de Roever
FYI Cassandra has a TTL on data: https://docs.datastax.com/en/cql/3.1/cql/cql_using/use_expire_t.html On Wed, Apr 6, 2016 at 7:55 AM, Shannon Carey wrote: > Hi, new Flink user here! > > I found a discussion on user@flink.apache.org about using DynamoDB as a > sink. However, as noted, sinks have

State in external db (dynamodb)

2016-04-05 Thread Shannon Carey
Hi, new Flink user here! I found a discussion on user@flink.apache.org about using DynamoDB as a sink. However, as noted, sinks have an at-least-once guarantee so your operations must idempotent. However, another way to go about this (and correct me if I'm wrong) is to write the state to the e