Error a simple window example

2017-09-08 Thread philippe
ngProcessingTimeWindows.of(Time.seconds(5))) The sniper directly comes from the doc. Help welcome ! Philippe

re: Elegantly sharing state in a streaming environment

2016-05-30 Thread Philippe CAPARROY
Just transform the list in a DataStream. A datastream can be finite. One solution, in the context of a Streaming environment is to use Kafka, or any other distributed broker, although Flink ships with a KafkaSource.   1)Create a Kafka Topic dedicated to your list of key/values. Inject your va

Socket stream source in Python?

2022-01-28 Thread Philippe Rigaux
briefly mentioned in the documentation. Any help is much appreciated. Philippe

MAP data type (PyFlink)

2022-01-28 Thread Philippe Rigaux
_info' and 'value_type_info' How should I specify for instance the type for {‘url’: ‘’, ‘count’: 2} ? Thanks for your help. Philippe

Re: Socket stream source in Python?

2022-01-30 Thread Philippe Rigaux
Thank you so much. > Le 31 janv. 2022 à 01:11, Francis Conroy a > écrit : > > Hi Philippe, > after checking the source Flink master I think you're right, there is > currently no binding from python to Flink socketTextStream (via py4j) in > pyFlink. The py4j inter

Re: Hourly top-k statistics of DataStream

2016-06-09 Thread Philippe Caparroy
You should have a look at this project : https://github.com/addthis/stream-lib You can use it within Flink, storing intermediate values in a local state. > Le 9 juin 2016 à 15:29, Yukun Guo a écrit : > > Thank you very much for the detailed answer. Now I understand a DataStream > can be re

Using CustomPartitionerWrapper with KeyedStream

2016-08-12 Thread Philippe Caparroy
Hi there, It seems not possible to use some custom partitioner in the context of the KeyedStream, without modifying the KeyedStream. protected DataStream setConnectionType(StreamPartitioner partitioner) { throw new UnsupportedOperationException("Cannot override partitioning for

Re: Using CustomPartitionerWrapper with KeyedStream

2016-08-12 Thread Philippe Caparroy
rit : > > Hi Philippe, > > There is no particular reason other than hash partitioning is a > sensible default for most users. It seems like this is rarely an > issue. When the number of keys is close to the parallelism, having > idle partitions is usually not a problem due t

ProcessFunction example

2017-03-08 Thread Philippe Caparroy
I think there is an error in the code snippet describing the ProcessFunction time out example : https://ci.apache.org/projects/flink/flink-docs-release-1.2/dev/stream/process_function.html @Override public void onTimer(long timestamp, OnTimerContext ctx, Collector> out) throws

Re: deploying flink cluster in AWS - Containerized

2017-03-24 Thread Philippe Caparroy
Hi, If I can give my 2 cents. One simple solution to your problem is using weave (https://www.weave.works/) a Docker network plugin. We’ve been working for more then year with dockerized (Flink+zookeeper+Yarn+spark+Kafka+hadoop+elasticsearch ) cluster using weave. Design your docker container

Re: deploying flink cluster in AWS - Containerized

2017-03-24 Thread Philippe Caparroy
ace isolation and security policies on > these. How do this work if the flink cluster is standalone on AWS ? > > > Best Regards > CVP > > On Fri, Mar 24, 2017 at 8:49 AM, Philippe Caparroy > mailto:philippe.capar...@orange.fr>> wrote: > Hi, > > If I