Re: Write Stream to HBase

2015-05-20 Thread Robert Metzger
Hi, sorry. I was doing too many things at the same time. I confused inputs and outputs ;) Please open a pull request for the changed method name... On Wed, May 20, 2015 at 2:44 PM, Hilmi Yildirim wrote: > createInput creates a stream out of a file and can be used for HBase, > correct? > But

Re: Write Stream to HBase

2015-05-20 Thread Hilmi Yildirim
createInput creates a stream out of a file and can be used for HBase, correct? But I do not want to read from HBase. I only want to write to HBase. For that I implemented an HBaseOutputFormat which I pass to the writeToFile method of the dataStream. Then, the results of the stream processsing a

Re: Write Stream to HBase

2015-05-20 Thread Robert Metzger
Hi, great to hear that it is working. If the PR is going to be only about adding the "write()" method, you probably don't need to open the PR. https://github.com/apache/flink/pull/521 is going to add a method called: public DataStreamSource createInput(InputFormat inputFormat, TypeInformation type

Re: Write Stream to HBase

2015-05-20 Thread Hilmi Yildirim
Hi, I've changed "writeToFile" to public and then I implemented an Outputformat to write the stream into the HBase. This is working very well. I will do later a pull request. Maybe the method name "writeToFile" should be changed in, for example, "write". Alternatively, I can create a metho

Re: Write Stream to HBase

2015-05-20 Thread Márton Balassi
I'm merging the pull request, it was blocked by the streaming operator rework so it is free to go since yesterday. I do agree that it needs some additional love before it can be on the master, but I am positive that it should be there this week. On May 20, 2015 11:16 AM, "Robert Metzger" wrote:

Re: Write Stream to HBase

2015-05-20 Thread Robert Metzger
There is this pending pull request which is addressing exactly the issues I've mentioned (wrong naming, private method): https://github.com/apache/flink/pull/521 I'll see whats blocking the PR ... On Wed, May 20, 2015 at 11:11 AM, Robert Metzger wrote: > Maybe we can also use the Batch HBase Ou

Re: Write Stream to HBase

2015-05-20 Thread Robert Metzger
Maybe we can also use the Batch HBase OutputFormat. In the DataStream API there is a private method: private DataStreamSink writeToFile(OutputFormat format, long millis) { which seems to allow batch output formats. The naming of the method seems weird because its called "toFile" but its expectin

Re: Write Stream to HBase

2015-05-20 Thread Robert Metzger
Hi, I agree with Hilmi, Flavio's examples are for batch. I'm not aware of a StreamingHBaseSink for Flink yet. I'll filed a JIRA for the feature request: https://issues.apache.org/jira/browse/FLINK-2055 Are you interested in implementing this? On Wed, May 20, 2015 at 10:50 AM, Hilmi Yildirim wro

Re: Write Stream to HBase

2015-05-20 Thread Hilmi Yildirim
Thank you Flavio, these are examples for Batch Processing. But I want to write a continuous stream into an HBase within a StreamExecutionEnvironment instead of a ExecutionEnvironment. Best Regards, Hilmi Am 20.05.2015 um 10:42 schrieb Flavio Pompermaier: I've added an example of HBase writing

Re: Write Stream to HBase

2015-05-20 Thread Flavio Pompermaier
I've added an example of HBase writing at flink-staging/flink-hbase/src/test/java/org/apache/flink/addons/hbase/example/HBaseWriteExample.java. Otherwise you can look at these 2 URLs: - https://github.com/fpompermaier/flink/blob/hbaseOutExample/flink-staging/flink-hbase/src/test/java/org/a