IIRC, all the above support data locality from back in the MR days. Not
sure how much data you're planning to checkpoint though -- is locality
really that important for transient processor state?

On Thu, Jun 9, 2016 at 11:06 AM, CPC <acha...@gmail.com> wrote:

> Cassandra backend would be interesting especially  if flink could benefit
> from cassandra data locality. Cassandra/spark integration is using this for
> information to schedule spark tasks.
>
> On 9 June 2016 at 19:55, Nick Dimiduk <ndimi...@gmail.com> wrote:
>
> > You might also consider support for a Bigtable
> > backend: HBase/Accumulo/Cassandra. The data model should be similar
> > (identical?) to RocksDB and you get HA, recoverability, and support for
> > really large state "for free".
> >
> > On Thursday, June 9, 2016, Chen Qin <qinnc...@gmail.com> wrote:
> >
> > > Hi there,
> > >
> > > What is progress on incremental checkpointing? Does flink dev has plan
> to
> > > work on this or JIRA to track this? super interested to know.
> > >
> > > I also research and consider use rocksdbstatebackend without running
> HDFS
> > > cluster nor talk to S3. Some primitive idea is to use ZK to store /
> > notify
> > > state propagation progress and propagate via implement chain
> replication
> > on
> > > top of YARN provisioned storage node.
> > >
> > > Thanks,
> > > Chen
> > >
> >
>

Reply via email to