Hi,
I see the problem that you are bumping into as the following:
- In your previous job, you seem to be falling back to Kryo for the state
serialization.
- In your new job, you are trying to change that to use a custom serializer.
You can confirm this by looking at the stack trace of the "new st
Hi Kristoff,
case 1:
first of all Flink groups keys internally into so-called "key groups"
for reducing the management overhead. The maximum parallelism decides
about the number of key groups. When performing a rescale, the key
groups are basically distributed using some consistent hashing al
Hi Manas,
first of all, after assigning watermarks at the source level, usually
Flink operators make sure to handle the watermarks.
In case of a `union()`, the subsequent operator will increment its
internal event-time clock and emit a new watermark only if all input
streams (and their paral
Hi Robin,
this is a very good observation and maybe even unintended behavior.
Maybe Arvid in CC is more familiar with the checkpointing?
Regards,
Timo
On 02.04.20 15:37, Robin Cassan wrote:
Hi all,
I am wondering if there is a way to make a flink job fail (not cancel
it) when one or sever
Hi Gordon,
thanks for your response , So I have done a POC on state migration using
avro, it seems it works out well.
I am using custom avro serializer (with avro schema and (TypeSerializer,
TypeSerializerSnapshot) and based on that written my own custom serializer
for the scala case class that I
Hi all,
I am wondering if there is a way to make a flink job fail (not cancel it)
when one or several checkpoints have failed due to being expired (taking
longer than the timeout) ?
I am using Flink 1.9.2 and have set `
*setTolerableCheckpointFailureNumber(1)*` which doesn't do the trick.
Looking
Also
- What happens to watermarks after a union operation? Do I have to
assignTimestampsAndWatermarks() again? I guess I will have to since
multiple streams are being combined and Flink needs to know how to resolve
individual watermarks.
- What is the difference between union() and
Hi Dinesh,
Thanks for sharing the logs. There were couple of HA fixes since 1.7, e.g.
[1] and [2].
I would suggest to try Flink 1.10.
If the problem persists, could you also find the logs of the failed Job
Manager before the failover?
Best,
Andrey
[1] https://jira.apache.org/jira/browse/FLINK-14
Sorry for the copy & paste error in my earlier message. 🙄 I agree with
Robert.
On 2. Apr 2020, at 11:06, Robert Metzger wrote:
Good catch!. Yes, you can add this to FLINK-16696.
On Wed, Apr 1, 2020 at 10:59 PM Aaron Langford
wrote:
> All, it looks like the actual return structure from the API
Hi,
I'm interested with uising Managed Flink service.
does anyone has an experience with hosting long running (generally 24/7)
Flink jobs on AWS EMR?
I'm interested, is it stable enough to host long running state size
intensive job.
With EMR i have all the config, HA zookeeper part handled by
Good catch!. Yes, you can add this to FLINK-16696.
On Wed, Apr 1, 2020 at 10:59 PM Aaron Langford
wrote:
> All, it looks like the actual return structure from the API is:
>
> 1. Success
>
>> {
>> "status": {
>> "id": "completed"
>> },
>> *"operation"*: {
>> "location": "string"
>>
Hi
I have few question regarding Flink's state.
Lets say we have:
Case 1.
stream.keybBy(...).process(myProcessFunction).parallelism(3).
MyProcessFucntion uses a managed state (mapState, ListState etc). I'm using
state checkpoints.
Flink will redistribute events across 3 instances of myProcessF
Hey Laurent,
Flink developed an internal framework for executing end to end tests from
Java. Here's an example of such a test:
https://github.com/apache/flink/blob/master/flink-end-to-end-tests/flink-metrics-availability-test/src/test/java/org/pache/flink/metrics/tests/MetricsAvailabilityITCase.jav
Hi,
sorry for a long wait.
Answering our questions:
1 - yes
2 - thx
3 - rigth, understood
4 - well, in general I want to understand how this works. To be able in
future to modify my job, for example extracting cpu heavy operators to
separate tasks. Actually in my job some of my operators are ch
Thanks,
I would suggest adding my "tutorial" about using testHarnes for
AsynOperators, to the documentation. Or maybe build something based on this
use case, that could be helpful for others in the future :)
Thanks,
Krzysztof
--
Sent from: http://apache-flink-user-mailing-list-archive.2336050
16 matches
Mail list logo