date:20180712

Re: External checkpoint metadata in Flink 1.5.x

2018-07-12 Thread Aljoscha Krettek

Out of curiosity, how will you work around it? And how is it easier for your tooling if checkpoints are in a central location? Best, Aljoscha > On 12. Jul 2018, at 17:55, Gyula Fóra wrote: > > Hi! > > Well it depends on how we look at it FLINK-5627 >

答复: 答复: TumblingProcessingTimeWindow emits extra results for a same window

2018-07-12 Thread Yuan,Youjun

Hi Hequn, I am using Flink 1.4. The job was running with 1 parallelism. I don’t think the extra records are caused by different keys, because: 1. I ran 2 jobs consuming the same source, jobA with 2-minute window, and job with 4-minute window. If there are wired keys, then jobA will get no m

Re: high availability with automated disaster recovery using zookeeper

2018-07-12 Thread Scott Kidder

I've used a multi-datacenter Consul cluster used to coordinate service-discovery. When a service starts up in the primary DC, it registers itself in Consul with a key that has a TTL that must be periodically renewed. If the service shuts down or terminates abruptly, the key expires and is removed f

Taskmanager SSL fails looking for Subject Alternative IP Address

2018-07-12 Thread PACE, JAMES

I have the following SSL configuration for a 3 node HA flink cluster: #taskmanager.data.ssl.enabled: false security.ssl.enabled: true security.ssl.keystore: /opt/app/certificates/server-keystore.jks security.ssl.keystore-password: security.ssl.key-password: security.ssl.truststore: /opt/app/cert

RE: Regarding the use of RichAsyncFunction as a FlatMap

2018-07-12 Thread Martin, Nick

Regarding the ‘partiality’ you mention in option one, wouldn’t you have to give that up anyway to maintain exactly once processing? Suppose input message A results in asynchronous queries/futures B and C, and imagine the following series of events: 1. Your function receives A 2. As

Loading Rules from compacted Kafka Topic - open() vs Connected Streams

2018-07-12 Thread vijayakumar palaniappan

Hello All, I can think of two options of implementing below requirement and request some guidance on choosing the option with pros and cons. Requirements: - A in memory rules cache to be loaded from log compacted kafka topic. This cache has to be loaded prior to arrival of events. - Updates to th

Re: Need assistance : creating remote environment

2018-07-12 Thread Mohan mohan

Yep, that is due to version mismatch. Working fine now. Thank you Chesnay Schepler. On Wed, Jul 11, 2018 at 6:05 PM Chesnay Schepler wrote: > Based on the logs your client is using the RestClusterClient, which means > that the client is either > a) running 1.4 with the flip6 profile enabled > b

Re: External checkpoint metadata in Flink 1.5.x

2018-07-12 Thread Gyula Fóra

Hi! Well it depends on how we look at it FLINK-5627 is not necessarily the current behaviour. You still can't really specify the exact location from within the job as it now goes to a checkpoint specific place determined by the checkpoint dir. I

Regarding the use of RichAsyncFunction as a FlatMap

2018-07-12 Thread Konstantinos Barmpis

I was wondering if there is a way to create an asynchronous flatmap function in Flink. As far as I am aware, the asynchronous function only accepts a single result future as its return (which can be the aggregate list of the flatmap, but then partiality is lost as we have to wait for all of the re

Re: 答复: TumblingProcessingTimeWindow emits extra results for a same window

2018-07-12 Thread Hequn Cheng

Hi Yuan, Haven't heard about this before. Which flink version do you use? The cause may be: 1. userId not 100% identical, for example contains invisible characters. 2. The machine clock vibrated. Otherwise, there are some bugs we don't know. Best, Hequn On Thu, Jul 12, 2018 at 8:00 PM, Yuan,Yo

Re: How to create User Notifications/Reminder ?

2018-07-12 Thread shyla deshpande

Thanks a lot Dawid and Hequn. Dawid, the link you provided is very useful. Thanks shyla On Thu, Jul 12, 2018 at 5:59 AM, Dawid Wysakowicz < wysakowicz.da...@gmail.com> wrote: > Hi shyla, > > It should be doable with CEP. You can create pattern like: > Pattern.begin("start").next/followedBy("end"

Re: How to create User Notifications/Reminder ?

2018-07-12 Thread anna stax

Thanks Hequn and Dawid for your input. Thanks Dawid for the link. Great help! On Thu, Jul 12, 2018 at 5:59 AM, Dawid Wysakowicz < wysakowicz.da...@gmail.com> wrote: > Hi shyla, > > It should be doable with CEP. You can create pattern like: > Pattern.begin("start").next/followedBy("end").where(.

Re: External checkpoint metadata in Flink 1.5.x

2018-07-12 Thread Aljoscha Krettek

Hi, does that mean https://issues.apache.org/jira/browse/FLINK-5627 is no longer relevant for you, since it seems to request the behaviour that we have now? But yes, I think it's currently not possible (with out-of-box functionality) to write

Re: Flink job hangs using rocksDb as backend

2018-07-12 Thread Stefan Richter

Hi, Did you check the metrics for the garbage collector? Stuck with high CPU consumption and lots of timers sound like there could be a possible problem, because timer are currently on-heap objects, but we are working on RocksDB-based timers right now. Best, Stefan > Am 12.07.2018 um 14:54 sc

Re: How to create User Notifications/Reminder ?

2018-07-12 Thread Dawid Wysakowicz

Hi shyla, It should be doable with CEP. You can create pattern like: Pattern.begin("start").next/followedBy("end").where(...).within(/* two weeks*/) and subscribe for timed out events. You can check very similar example here[1]. Best, Dawid [1] https://github.com/dataArtisans/flink-training-exe

Re: Flink job hangs using rocksDb as backend

2018-07-12 Thread shishal singh

Thanks Stefan/Stephan/Nico, Indeed there are 2 problem. For the 2nd problem ,I am almost certain that explanation given by Stephan is the true as in my case as there number of timers are in millions. (Each for different key so I guess coalescing is not an option for me). If I simplify my problem,

Re: How to create User Notifications/Reminder ?

2018-07-12 Thread Hequn Cheng

Hi shyla, Considering window, I think it is not very convenient. Two weeks window is used to process data in the recent 2 weeks while you want to process data beyond 2 weeks. I'm not familiar with CEP, but it sounds like a good idea. Best, Hequn On Thu, Jul 12, 2018 at 10:56 AM, shyla deshpande

External checkpoint metadata in Flink 1.5.x

2018-07-12 Thread Gyula Fóra

Hi, It seems that the behaviour to store the checkpoint metadata files for externalized checkpoints changed from 1.4 to 1.5 and the docs seem to be incorrectly saying that: "state.checkpoints.dir: The target directory for meta data of externalized checkpoints

答复: TumblingProcessingTimeWindow emits extra results for a same window

2018-07-12 Thread Yuan,Youjun

Hi Timo, This problem happens 4-5 times a day on our online server, with ~15k events per second load, and it is using PROCESSING time. So I don’t think I can stably reproduce the issue on my local machine. The user ids are actually the same, I have doubled checked that. Now, I am wondering coul

Re: flink 1.5 Rest API issues

2018-07-12 Thread Will Du

Thank you for the response. The issue is caused by rest API use comma , as separator which breaks the code. Also, single quote is somehow removed by rest call process. Sent from my iPhone will > On Jul 12, 2018, at 01:05, Gary Yao wrote: > > Hi Will, > > Is the HTTP request hanging, or is th

Re: high availability with automated disaster recovery using zookeeper

2018-07-12 Thread Till Rohrmann

Hi Tovi, you can define hard host attribute constraints for the TaskManagers. See the configuration section [1] for more information. If you want to run the JobManager/cluster entry point on Mesos as well, then I recommend starting it with Marathon [2]. This will also give you HA for the master p

Re: State backend configuration

2018-07-12 Thread Aljoscha Krettek

Hi James, the configuration options were consolidates a bit for Flink 1.5. What was called state.backend.fs.checkpointdir before was the same as state.checkpoints.dir, we now made this the same for the two backends. Also, the file-backend does not need local storage on the worker nodes, that's

Re: TumblingProcessingTimeWindow emits extra results for a same window

2018-07-12 Thread Timo Walther

Hi Yuan, this sounds indeed weird. The SQL API uses regular DataStream API windows underneath so this problem should have come up earlier if this is problem in the implementation. Does this behavior reproducible on your local machine? One thing that comes to my mind is that the "userId"s mig

Re: REST API time out ( flink 1.5 ) on SP

2018-07-12 Thread Till Rohrmann

Hi, akka.jvm-exit-on-fatal-error says whether Akka will terminate the JVM if it encounters a fatal error (e.g. OOM). This has nothing todo with quarantining. The option is relevant for the legacy and new (FLIP-6) code base. taskmanager.exit-on-fatal-akka-error is the configuration option which en

State backend configuration

2018-07-12 Thread James Isaac

Comparing the state backend configurations between Flink 1.4 and 1.5, I have a question: In Flink 1.5, we have a *state.checkpoints.dir* location and a *state.backend.rocksdb.localdir*. It was the same in Flink 1.4. However, in the case of filesystem backends, we have *state.backend.fs.checkpointd

Re: Flink job hangs using rocksDb as backend

2018-07-12 Thread Stefan Richter

Hi, adding to what has already been said, I think that here can be two orthogonal problems here: i) why is your job slowing down/getting stuck? and ii) why is cancellation blocked? As for ii) I think Stephan already gave to right reason that shutdown could take longer and that is what gets the

TumblingProcessingTimeWindow emits extra results for a same window

2018-07-12 Thread Yuan,Youjun

Hi community, I have a job which counts event number every 2 minutes, with TumblingWindow in ProcessingTime. However, it occasionally produces extra DUPLICATED records. For instance, for timestamp 153136848 below, it emits a normal result (cnt=1641161), and then followed by a few more recor

Re: External checkpoint metadata in Flink 1.5.x

答复: 答复: TumblingProcessingTimeWindow emits extra results for a same window

Re: high availability with automated disaster recovery using zookeeper

Taskmanager SSL fails looking for Subject Alternative IP Address

RE: Regarding the use of RichAsyncFunction as a FlatMap

Loading Rules from compacted Kafka Topic - open() vs Connected Streams

Re: Need assistance : creating remote environment

Re: External checkpoint metadata in Flink 1.5.x

Regarding the use of RichAsyncFunction as a FlatMap

Re: 答复: TumblingProcessingTimeWindow emits extra results for a same window

Re: How to create User Notifications/Reminder ?

Re: How to create User Notifications/Reminder ?

Re: External checkpoint metadata in Flink 1.5.x

Re: Flink job hangs using rocksDb as backend

Re: How to create User Notifications/Reminder ?

Re: Flink job hangs using rocksDb as backend

Re: How to create User Notifications/Reminder ?

External checkpoint metadata in Flink 1.5.x

答复: TumblingProcessingTimeWindow emits extra results for a same window

Re: flink 1.5 Rest API issues

Re: high availability with automated disaster recovery using zookeeper

Re: State backend configuration

Re: TumblingProcessingTimeWindow emits extra results for a same window

Re: REST API time out ( flink 1.5 ) on SP

State backend configuration

Re: Flink job hangs using rocksDb as backend

TumblingProcessingTimeWindow emits extra results for a same window

27 matches

Site Navigation

Mail list logo

Footer information