Re: [ANNOUNCE] Apache Flink Python API(PyFlink) 1.9.2 released

2020-02-22 Thread Chad Dombrova
It’s an interesting idea to use the Beam SDK to execute PyFlink (i.e. non-Beam) UDFs. Thanks for the info. On Thu, Feb 20, 2020 at 5:41 PM Xingbo Huang wrote: > Hi Chad, > Beam Portability module solves the management of the Python runtime > environment and the communication between the Python r

Re: async io parallelism

2020-02-22 Thread Arvid Heise
Hi Alexey, the short answer is: order is preserved in all cases. Basically, ordered asyncIO maintains an internal FIFO queue where all pending elements reside. All async results are saved into this queue, but elements will only be outputted when the head element has a result. So assume you have

Flink on Kubernetes - Session vs Job cluster mode and storage

2020-02-22 Thread M Singh
Hey Folks: I am trying to figure out the options for running Flink on Kubernetes and am trying to find out the pros and cons of running in Flink Session vs Flink Cluster mode (https://ci.apache.org/projects/flink/flink-docs-stable/ops/deployment/kubernetes.html#flink-session-cluster-on-kubernete

yarn session: one JVM per task

2020-02-22 Thread David Morin
Hi, My app is based on a lib that is not thread safe (yet...). In waiting of the patch has been pushed, how can I be sure that my Sink that uses this lib is in one JVM ? Context: I use one Yarn session and send my Flink jobs to this session Regards, David

Re: [DISCUSS] Drop Savepoint Compatibility with Flink 1.2

2020-02-22 Thread Ufuk Celebi
Hey Stephan, +1. Reading over the linked ticket and your description here, I think it makes a lot of sense to go ahead with this. Since it's possible to upgrade via intermediate Flink releases as a fail-safe I don't have any concerns. – Ufuk On Fri, Feb 21, 2020 at 4:34 PM Till Rohrmann wrote

Re: [Flink 1.10] How do I use LocalCollectionOutputFormat now that writeUsingOutputFormat is deprecated?

2020-02-22 Thread Niels Basjes
Yes that's it! My code https://github.com/nielsbasjes/yauaa/blob/master/udfs/flink/src/test/java/nl/basjes/parse/useragent/flink/TestUserAgentAnalysisMapperInline.java#L144 now does this: DataStream resultDataStream = ... List result = new ArrayList<>(5); DataStreamUtils .collect(resultData

Re: Can Connected Components run on a streaming dataset using iterate delta?

2020-02-22 Thread kant kodali
Hi, Thanks for that but Looks like it is already available https://github.com/vasia/gelly-streaming in streaming but I wonder why this is not part of Flink? there are no releases either. Thanks! On Tue, Feb 18, 2020 at 9:13 AM Yun Gao wrote: >Hi Kant, > > As far as I know