[ 
https://issues.apache.org/jira/browse/FLINK-2055?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15509756#comment-15509756
 ] 

ASF GitHub Bot commented on FLINK-2055:
---------------------------------------

Github user fhueske commented on the issue:

    https://github.com/apache/flink/pull/2332
  
    Hi @delding, I'm sorry that I did not mention this earlier, but I just 
noticed that the `HBaseSink` does not implement any logic for checkpointing and 
fault-tolerance. 
    
    The checkpointing logic of a sink and the guarantees of the external data 
store define what kind of emission guarantees a sink function can offer 
(exactly once, at-least once). In order to make this PR mergable, we need to 
figure out what kind of guarantees it provides under which assumptions / in 
which scenarios. As an example see the [documentation of the Cassandra 
sink](https://ci.apache.org/projects/flink/flink-docs-master/dev/connectors/cassandra.html).
 This is very important because users will rely on the stated guarantees.
    
    I am not very familiar with HBase and what kind of write semantics and 
transnational guarantees it provides. Can you describe which kind of guarantees 
an `HBaseSink` can achieve and how it would be integrated with Flink's 
Checkpointing mechanism? 
    
    Thanks, Fabian
    



> Implement Streaming HBaseSink
> -----------------------------
>
>                 Key: FLINK-2055
>                 URL: https://issues.apache.org/jira/browse/FLINK-2055
>             Project: Flink
>          Issue Type: New Feature
>          Components: Streaming, Streaming Connectors
>    Affects Versions: 0.9
>            Reporter: Robert Metzger
>            Assignee: Erli Ding
>
> As per : 
> http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/Write-Stream-to-HBase-td1300.html



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to