]. Sender[null] sent message
>>>> of type "akka.actor.Identify"..
>>>>
>>>> This reads like the machine lost network connectivity for some reason.
>>>> The tasks start failing because kafka cannot be reached, and the TM then
>>>
r reach the ResourceManager.
>>>
>>> On 25/12/2019 04:34, Zhijiang wrote:
>>>
>>> If you use rocksDB state backend, it might consume extra native memory.
>>> Some resource framework cluster like yarn would kill the container if
>>> the memory usa
Zhijiang wrote:
>>
>> If you use rocksDB state backend, it might consume extra native memory.
>> Some resource framework cluster like yarn would kill the container if the
>> memory usage exceeds some threshold. You can also double check whether it
>> exists in your case.
>>
>>
ld kill the container if the
> memory usage exceeds some threshold. You can also double check whether it
> exists in your case.
>
> --------------
> From:John Smith
> Send Time:2019 Dec. 25 (Wed.) 03:40
> To:Zhijiang
> Cc:user
> Subject:Re: Flink task node shut it
shold. You can also double check
whether it exists in your case.
--
From:John Smith
Send Time:2019 Dec. 25 (Wed.) 03:40
To:Zhijiang
Cc:user
Subject:Re: Flink task node shut it self off.
The shutdown happ
.
--
From:John Smith
Send Time:2019 Dec. 25 (Wed.) 03:40
To:Zhijiang
Cc:user
Subject:Re: Flink task node shut it self off.
The shutdown happened after the massive IO wait. I don't use any state
Checkpoints are disk based...
On Mon., Dec. 23, 2019, 1:42 a.m. Zhijiang, wrote:
Hi
The shutdown happened after the massive IO wait. I don't use any state
Checkpoints are disk based...
On Mon., Dec. 23, 2019, 1:42 a.m. Zhijiang,
wrote:
> Hi John,
>
> Thanks for the positive comments of Flink usage. No matter at least-once
> or exactly-once you used for checkpoint, it would neve
Hi John,
Thanks for the positive comments of Flink usage. No matter at least-once or
exactly-once you used for checkpoint, it would never lose one message during
failure recovery.
Unfortunatelly I can not visit the logs you posted. Generally speaking the
longer internal checkpoint would mean r
hi john
in our experience , the checkpoint interval we set interval 1-10 minute and
timeout usurally 5*interval . mostly we set 2 or 5 minute and 10 or
20timeout.
it depend on u data bulk per second and which window used.
John Smith 于2019年12月21日周六 上午5:26写道:
> Hi, using Flink 1.8.0
>
> 1st off
Hi, using Flink 1.8.0
1st off I must say Flink resiliency is very impressive, we lost a node and
never lost one message by using checkpoints and Kafka. Thanks!
The cluster is a self hosted cluster and we use our own zookeeper cluster.
We have...
3 zookeepers: 4 cpu, 8GB (each)
3 job nodes: 4 cpu,
10 matches
Mail list logo