subject:"State Backend"

Re: ForSt State backend seem to try to download all state locally

2025-04-05 Thread Gyula Fóra

state clipping may > involve too much file rewriting affecting the speed. I'll do a similar > experiment. > > > Best, > Zakelly > > On Fri, Apr 4, 2025 at 4:28 PM Gyula Fóra wrote: > >> Hi All! >> >> I am experimenting with the ForSt state backend on 2

Re: ForSt State backend seem to try to download all state locally

2025-04-04 Thread gyula . fora

Hi!This job is definitely using the old , sync data access. Where is this limitation mentioned in the docs? It sounds a bit strange that a fundamental behavior of the state backend depends on this. I assumed without the new async api it would be slower but the general characteristics of remote

Re: ForSt State backend seem to try to download all state locally

2025-04-04 Thread Zakelly Lan

and see if it could start >> running immediately? And it would be great if you could provide some logs >> from the taskmanager during the restore. I suspect that state clipping may >> involve too much file rewriting affecting the speed. I'll do a similar >> experiment.

Re: ForSt State backend seem to try to download all state locally

2025-04-04 Thread Zakelly Lan

(I missed to cc user ML so I re-send this) Hi Gyula, Just because the Sync mode is basically inherited from rocksdb, and the async mode is a completely new code path. You are right, the state backend should have the remote storage even for synchronous state access. I'll open a ticket for

Re: ForSt State backend seem to try to download all state locally

2025-04-04 Thread Zakelly Lan

great if you could provide some logs from the taskmanager during the restore. I suspect that state clipping may involve too much file rewriting affecting the speed. I'll do a similar experiment. Best, Zakelly On Fri, Apr 4, 2025 at 4:28 PM Gyula Fóra wrote: > Hi All! > > I am experiment

ForSt State backend seem to try to download all state locally

2025-04-04 Thread Gyula Fóra

Hi All! I am experimenting with the ForSt state backend on 2.0.0 and I noticed the following thing. If I have a job with a larger state, let's say 500GB and now I want to start the job with a lower parallelism on a single TaskManager, the job will simply not start a

Question about large windows and RocksDB state backend

2025-02-20 Thread Gabriele Mencagli

falling within the boundaries of each triggered window. We are interested in running the application with the RocksDB embedded state backend to reduce the memory footprint. We would like to know if there is information available about how windows are stored and represented in RocksDB, whether

Re: Redis as a State Backend

2024-02-14 Thread David Morávek

ce mechanism, beyond what can be implemented natively in the heap-state backend. It can do a full snapshot of a database (costly, we need incremental, ...) or a text-based append log (changelog state backend). All data needs to fit in memory. In text form. No compression (you can of course com

Re: Redis as a State Backend

2024-01-31 Thread David Anderson

When it comes to decoupling the state store from Flink, I suggest taking a look at FlinkNDB, which is an experimental state backend for Flink that puts the state into an external distributed database. There's a Flink Forward talk [1] and a master's thesis [2] available.

Re: Redis as a State Backend

2024-01-31 Thread Chirag Dewan via user

Thanks Zakelly and Junrui. I was actually exploring RocksDB as a state backend and I thought maybe Redis could offer more features as a state backend. For e.g. maybe state sharing between operators, geo-red of state, partitioning etc. I understand these are not native use cases for Flink, but

Re: Redis as a State Backend

2024-01-30 Thread Zakelly Lan

tter performance compared with Redis I > guess. > > > Best, > Zakelly > > On Tue, Jan 30, 2024 at 2:15 PM Chirag Dewan via user < > user@flink.apache.org> wrote: > >> Hi, >> >> I was looking at the FLIP-254: Redis Streams Connector and I was &g

Re: Redis as a State Backend

2024-01-30 Thread Zakelly Lan

erformance compared with Redis I guess. Best, Zakelly On Tue, Jan 30, 2024 at 2:15 PM Chirag Dewan via user wrote: > Hi, > > I was looking at the FLIP-254: Redis Streams Connector and I was > wondering if Flink ever considered Redis as a state backend? And if yes, > why was

Re: Redis as a State Backend

2024-01-29 Thread Junrui Lee

Hi Chirag, Indeed, the possibility of using Redis as a state backend for Flink has been considered in the past. You can find a detailed discussion about this topic in the JIRA issue FLINK-3035[1] as well as in the comments section of this PR[2]. The outcome of these discussions was that Redis is

Redis as a State Backend

2024-01-29 Thread Chirag Dewan via user

Hi, I was looking at the FLIP-254: Redis Streams Connector and I was wondering if Flink ever considered Redis as a state backend? And if yes, why was it discarded compared to RocksDB? If someone can point me towards any deep dives on why RocksDB is a better fit as a state backend, it would be

Re: RocksDB State Backend GET returns null intermittently

2023-06-27 Thread Prabhu Joseph

zer, as this is one of the >> main differences between RocksDBStateBackend and HashMapStateBackend >> (HashMapStateBackend does not perform serialization and deserialization). >> >> On Wed, Jun 21, 2023 at 3:44 PM Prabhu Joseph >> wrote: >> >>> Hi, >>> >>&

Re: RocksDB State Backend GET returns null intermittently

2023-06-27 Thread Alexander Fedulov

and HashMapStateBackend > (HashMapStateBackend does not perform serialization and deserialization). > > On Wed, Jun 21, 2023 at 3:44 PM Prabhu Joseph > wrote: > >> Hi, >> >> RocksDB State Backend GET call on a key that was PUT into the state like >> 100 ms earli

Re: RocksDB State Backend GET returns null intermittently

2023-06-24 Thread Hangxiang Yu

one of the main differences between RocksDBStateBackend and HashMapStateBackend (HashMapStateBackend does not perform serialization and deserialization). On Wed, Jun 21, 2023 at 3:44 PM Prabhu Joseph wrote: > Hi, > > RocksDB State Backend GET call on a key that was PUT into the state like

RocksDB State Backend GET returns null intermittently

2023-06-21 Thread Prabhu Joseph

Hi, RocksDB State Backend GET call on a key that was PUT into the state like 100 ms earlier but is not returned intermittently. The issue never happened with the HashDB State backend. We are trying to increase block cache size, write buffer size, and enable bloom filter as per the doc: - https

Re: Could not restore keyed state backend for KeyedProcessOperator

2022-12-15 Thread Lars Skjærven

.x. Last time was during a node upgrade, but that was not >>>>> the case this time. >>>>> >>>>> I've not been able to reproduce this issue. I've checked that I can >>>>> kill the taskmanager and jobmanager (using kubectl delete po

Re: Could not restore keyed state backend for KeyedProcessOperator

2022-12-14 Thread Lars Skjærven

node upgrade, but that was not >>>> the case this time. >>>> >>>> I've not been able to reproduce this issue. I've checked that I can >>>> kill the taskmanager and jobmanager (using kubectl delete pod), and the job >>>> restores as expect

Re: Could not restore keyed state backend for KeyedProcessOperator

2022-12-13 Thread Martijn Visser

(using kubectl delete pod), and the job >>> restores as expected. >>> >>> The job is running with kubernetes high availability, rocksdb and >>> incremental checkpointing. >>> >>> Any tips are highly appreciated. >>> >>> Thanks, &g

Re: Could not restore keyed state backend for KeyedProcessOperator

2022-12-09 Thread Lars Skjærven

using kubectl delete pod), and the job >> restores as expected. >> >> The job is running with kubernetes high availability, rocksdb and >> incremental checkpointing. >> >> Any tips are highly appreciated. >> >> Thanks, >> Lars >> >> Caused by

Re: Could not restore keyed state backend for KeyedProcessOperator

2022-12-08 Thread Hangxiang Yu

tal checkpointing. > > Any tips are highly appreciated. > > Thanks, > Lars > > Caused by: org.apache.flink.util.FlinkException: Could not restore keyed > state backend for > KeyedProcessOperator_bf374b55

Could not restore keyed state backend for KeyedProcessOperator

2022-12-08 Thread Lars Skjærven

. Any tips are highly appreciated. Thanks, Lars Caused by: org.apache.flink.util.FlinkException: Could not restore keyed state backend for KeyedProcessOperator_bf374b554824ef28e76619f4fa153430_(2/2) from any of the 1 provided rest

Re: Setting S3 as State Backend in SQL Client

2022-03-16 Thread Martijn Visser

;> Hi, >> >> If I remember correctly, set operations supports only a limited set of >> configurations. >> >> Most of them are table options that are listed on table configuration [1] >> plus some pipeline options. >> >> State backend options are not likely on

Re: Setting S3 as State Backend in SQL Client

2022-03-16 Thread dz902

e table options that are listed on table configuration [1] > plus some pipeline options. > > State backend options are not likely one of them. > > [1] > https://nightlies.apache.org/flink/flink-docs-release-1.14/docs/dev/table/config/ > > Best, > Paul Lam > > 2022年3

Re: Setting S3 as State Backend in SQL Client

2022-03-16 Thread Paul Lam

Hi, If I remember correctly, set operations supports only a limited set of configurations. Most of them are table options that are listed on table configuration [1] plus some pipeline options. State backend options are not likely one of them. [1] https://nightlies.apache.org/flink/flink

Re: Setting S3 as State Backend in SQL Client

2022-03-15 Thread dz902

Just tried editing flink-conf.yaml and it seems SQL Client does not respect that also. Is this an intended behavior? On Tue, Mar 15, 2022 at 7:14 PM dz902 wrote: > Hi, > > I'm using Flink 1.14 and was unable to set S3 as state backend. I tried > combination of: > > SET st

Setting S3 as State Backend in SQL Client

2022-03-15 Thread dz902

Hi, I'm using Flink 1.14 and was unable to set S3 as state backend. I tried combination of: SET state.backend='filesystem'; SET state.checkpoints.dir='s3://xxx/checkpoints/'; SET state.backend.fs.checkpointdir='s3://xxx/checkpoints/'; SET state.checkpoint-sto

RE: Re: Job requiring a lot of memory despite using rocksdb state backend

2022-02-07 Thread Salva Alcántara

ase the > amount of memory you provided to RocksDB? > > On 05/02/2022 07:12, Salva Alcántara wrote: > > I have a job running on Flink 1.14.3 (Java 11) that uses rocksdb as > > the state backend. The problem is that the job requires an amount of > > memory pretty sim

Re: Job requiring a lot of memory despite using rocksdb state backend

2022-02-07 Thread Chesnay Schepler

What where the errors you received that caused you to increase the amount of memory you provided to RocksDB? On 05/02/2022 07:12, Salva Alcántara wrote: I have a job running on Flink 1.14.3 (Java 11) that uses rocksdb as the state backend. The problem is that the job requires an amount of

Job requiring a lot of memory despite using rocksdb state backend

2022-02-04 Thread Salva Alcántara

I have a job running on Flink 1.14.3 (Java 11) that uses rocksdb as the state backend. The problem is that the job requires an amount of memory pretty similar to the overall state size. Indeed, for making it stable (and capable of taking snapshots) this is what I'm using: - 4 TMs with 30

Re: Some question of RocksDB state backend on ARM os

2021-07-20 Thread Robert Metzger

our reply. > > Will this feature be released in version 1.14? > > Best, > > Hui > > *发件人:* Robert Metzger [mailto:rmetz...@apache.org] > *发送时间:* 2021年7月20日 19:45 > *收件人:* Wanghui (HiCampus) > *抄送:* user@flink.apache.org > *主题:* Re: Some question of RocksDB state

RE: Some question of RocksDB state backend on ARM os

2021-07-20 Thread Wanghui (HiCampus)

Hi Robert： Thank you for your reply. Will this feature be released in version 1.14? Best, Hui 发件人: Robert Metzger [mailto:rmetz...@apache.org] 发送时间: 2021年7月20日 19:45 收件人: Wanghui (HiCampus) 抄送: user@flink.apache.org 主题: Re: Some question of RocksDB state backend on ARM os The RocksDB

Re: Some question of RocksDB state backend on ARM os

2021-07-20 Thread Robert Metzger

commenting on https://issues.apache.org/jira/browse/FLINK-13598 to raise awareness. On Tue, Jul 20, 2021 at 5:26 AM Wanghui (HiCampus) wrote: > Hi all: > >When I use RocksDB as state backend on an aarch64 system, the > following error occurs: > > 1. Does the aarch64

Some question of RocksDB state backend on ARM os

2021-07-19 Thread Wanghui (HiCampus)

Hi all: When I use RocksDB as state backend on an aarch64 system, the following error occurs: 1. Does the aarch64 system not support rocksdb? 2. If not, is there a support plan for later versions of flink? Caused by: java.lang.Exception: Exception while creating

Question about setting up Task-local recovery with a RocksDB state backend

2021-04-01 Thread Sonam Mandal

Hello, I've been going through the documentation for task-local recovery and came across this section which discusses that with incremental check

Re: EOFException on attempt to scale up job with RocksDB state backend

2021-03-24 Thread Alexey Trenikhun

Tang ; Tzu-Li (Gordon) Tai ; user@flink.apache.org Subject: Re: EOFException on attempt to scale up job with RocksDB state backend I Yun, I've changed configuration to use block blobs, however due to another issue [1], I can't make savepoint, I hope eventually job will able to proces

Re: EOFException on attempt to scale up job with RocksDB state backend

2021-03-18 Thread Alexey Trenikhun

March 18, 2021 5:08 AM To: Alexey Trenikhun ; Tzu-Li (Gordon) Tai ; user@flink.apache.org Subject: Re: EOFException on attempt to scale up job with RocksDB state backend Hi Alexey, Flink would only write once for checkpointed files. Could you try to write checkpointed files as block blob format

Re: EOFException on attempt to scale up job with RocksDB state backend

2021-03-18 Thread Yun Tang

; Tzu-Li (Gordon) Tai ; user@flink.apache.org Subject: Re: EOFException on attempt to scale up job with RocksDB state backend Hi Yun, How underlying storage explains fact that without re-scale I can restore from savepoint? Does Flink write file once or many times, if many times, then potentially

Re: EOFException on attempt to scale up job with RocksDB state backend

2021-03-17 Thread Alexey Trenikhun

ed in [1] or without compaction? Thanks, Alexey From: Yun Tang Sent: Wednesday, March 17, 2021 9:31 PM To: Alexey Trenikhun ; Tzu-Li (Gordon) Tai ; user@flink.apache.org Subject: Re: EOFException on attempt to scale up job with RocksDB state backend Hi Alexey,

Re: EOFException on attempt to scale up job with RocksDB state backend

2021-03-17 Thread Yun Tang

ordon) Tai ; user@flink.apache.org Subject: Re: EOFException on attempt to scale up job with RocksDB state backend Hi Yun, Azure web UI shows size of all files created by Flink as 128Mib * X (128, 256, 640), see screenshot attached. In my understanding this is because Flink creates them as Page Blob

Re: EOFException on attempt to scale up job with RocksDB state backend

2021-03-17 Thread Yun Tang

Yun Tang From: Alexey Trenikhun Sent: Thursday, March 18, 2021 0:45 To: Yun Tang ; Tzu-Li (Gordon) Tai ; user@flink.apache.org Subject: Re: EOFException on attempt to scale up job with RocksDB state backend Hi Yun, I've copied 77e77928-cb26-4543

Re: EOFException on attempt to scale up job with RocksDB state backend

2021-03-17 Thread Alexey Trenikhun

, not sure does it have effect on savepoint or not) Thanks, Alexey From: Yun Tang Sent: Wednesday, March 17, 2021 12:33 AM To: Alexey Trenikhun ; Tzu-Li (Gordon) Tai ; user@flink.apache.org Subject: Re: EOFException on attempt to scale up job with RocksDB state b

Re: EOFException on attempt to scale up job with RocksDB state backend

2021-03-17 Thread Yun Tang

on attempt to scale up job with RocksDB state backend Attached. From: Yun Tang Sent: Tuesday, March 16, 2021 11:13 PM To: Alexey Trenikhun ; Tzu-Li (Gordon) Tai ; user@flink.apache.org Subject: Re: EOFException on attempt to scale up job with RocksDB state back

Re: EOFException on attempt to scale up job with RocksDB state backend

2021-03-16 Thread Yun Tang

@flink.apache.org Subject: Re: EOFException on attempt to scale up job with RocksDB state backend Hi Yun, I'm attaching shorter version of log, looks like full version didn't come through Thanks, Alexey From: Yun Tang Sent: Tuesday, March 16, 2021

Re: EOFException on attempt to scale up job with RocksDB state backend

2021-03-16 Thread Yun Tang

From: Alexey Trenikhun Sent: Tuesday, March 16, 2021 15:10 To: Yun Tang ; Tzu-Li (Gordon) Tai ; user@flink.apache.org Subject: Re: EOFException on attempt to scale up job with RocksDB state backend Also restore from same savepoint without change in parallelism works fine

Re: EOFException on attempt to scale up job with RocksDB state backend

2021-03-16 Thread Alexey Trenikhun

state backend No, I believe original exception was from 1.12.1 to 1.12.1 Thanks, Alexey From: Yun Tang Sent: Monday, March 15, 2021 8:07:07 PM To: Alexey Trenikhun ; Tzu-Li (Gordon) Tai ; user@flink.apache.org Subject: Re: EOFException on attempt to scale up job

Re: EOFException on attempt to scale up job with RocksDB state backend

2021-03-15 Thread Alexey Trenikhun

RocksDB state backend Hi, Can you scale the job at the same version from 1.12.1 to 1.12.1? Best Yun Tang From: Alexey Trenikhun Sent: Tuesday, March 16, 2021 4:46 To: Tzu-Li (Gordon) Tai ; user@flink.apache.org Subject: Re: EOFException on attempt to scale up job

Re: EOFException on attempt to scale up job with RocksDB state backend

2021-03-15 Thread Yun Tang

state backend Savepoint was taken with 1.12.1, I've tried to scale up using same version and 1.12.2 From: Tzu-Li (Gordon) Tai Sent: Monday, March 15, 2021 12:06 AM To: user@flink.apache.org Subject: Re: EOFException on attempt to scale up job with RocksDB

Re: EOFException on attempt to scale up job with RocksDB state backend

2021-03-15 Thread Alexey Trenikhun

Savepoint was taken with 1.12.1, I've tried to scale up using same version and 1.12.2 From: Tzu-Li (Gordon) Tai Sent: Monday, March 15, 2021 12:06 AM To: user@flink.apache.org Subject: Re: EOFException on attempt to scale up job with RocksDB state backen

Re: EOFException on attempt to scale up job with RocksDB state backend

2021-03-15 Thread Tzu-Li (Gordon) Tai

Hi, Could you provide info on the Flink version used? Cheers, Gordon -- Sent from: http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/

EOFException on attempt to scale up job with RocksDB state backend

2021-03-10 Thread Alexey Trenikhun

org.apache.flink.runtime.taskmanager.Task.run(Task.java:570) at java.lang.Thread.run(Thread.java:748) Caused by: org.apache.flink.util.FlinkException: Could not restore keyed state backend for KeyedProcessOperator_8131c39062c4256ee506e2382c4a7bfd_(3/8) from any of the 1 provided restore

Re: OOM error for heap state backend.

2020-08-27 Thread Congxian Qiu

used for other tasks ? >>> >>> >>> I don't quite understand this question. I believe the recommendation in >>> docs has the same reason: use larger state objects so that the Java object >>> overhead pays off. >>> RocksDB keeps state in mem

Re: OOM error for heap state backend.

2020-08-27 Thread Robert Metzger

hat the Java object >> overhead pays off. >> RocksDB keeps state in memory and on disk in the serialized form. >> Therefore it usually has a smaller footprint. >> Other jobs in the same task manager can potentially use other state >> backend depending on their state requireme

Re: OOM error for heap state backend.

2020-08-26 Thread Vishwas Siravara

son: use larger state objects so that the Java object > overhead pays off. > RocksDB keeps state in memory and on disk in the serialized form. > Therefore it usually has a smaller footprint. > Other jobs in the same task manager can potentially use other state > backend depending on th

Re: OOM error for heap state backend.

2020-08-26 Thread Andrey Zagrebin

pays off. RocksDB keeps state in memory and on disk in the serialized form. Therefore it usually has a smaller footprint. Other jobs in the same task manager can potentially use other state backend depending on their state requirements. All tasks in the same task manager share the JVM heap as the task

Re: OOM error for heap state backend.

2020-08-26 Thread Andrey Zagrebin

t;- Memory leak >>>- Not enough GC threads >>>- Concurrent GC starts too late >>>- ... >>> >>> I would suggest taking a look at the GC logs. >>> >>> Thank you~ >>> >>> Xintong Song >>> >>>

Re: OOM error for heap state backend.

2020-08-25 Thread Andrey Zagrebin

> > On Fri, Aug 21, 2020 at 10:34 PM Vishwas Siravara > wrote: > >> Hi guys, >> I use flink version 1.7.2 >> I have a stateful streaming job which uses a keyed process function. I >> use heap state backend. Although I set TM heap size to 16 GB, I get OOM >>

Re: OOM error for heap state backend.

2020-08-23 Thread Xintong Song

, Aug 21, 2020 at 10:34 PM Vishwas Siravara wrote: > Hi guys, > I use flink version 1.7.2 > I have a stateful streaming job which uses a keyed process function. I use > heap state backend. Although I set TM heap size to 16 GB, I get OOM error > when the state size is around 2.5 GB(f

OOM error for heap state backend.

2020-08-21 Thread Vishwas Siravara

Hi guys, I use flink version 1.7.2 I have a stateful streaming job which uses a keyed process function. I use heap state backend. Although I set TM heap size to 16 GB, I get OOM error when the state size is around 2.5 GB(from dashboard I get the state size). I have set taskmanager.memory.fraction

Re: State backend considerations

2020-06-22 Thread Arvid Heise

Hi Nick, Both questions are hard to answer given that it depends on your hardware, access patterns (read/update), record size/structure, parallelism, and probably a ton of other parameters. The usual approach is to simply evaluate it in your setting. Since it's a matter of configuration, you can

State backend considerations

2020-06-21 Thread Nick Bendtner

Hi guys, I have a few questions on state backends. Is there a guideline on how big the state has to be where it makes sense to use RocksDB rather than FsStatebackend ? Is there an analysis on latency for a full checkpoint for FsSateBackend based on increase in state size ? Best, Nick.

Re: Pulsar as a state backend

2020-04-07 Thread Michael Colson

- Work on state integration is also in progress and expected sometime > in May. >- Using Pulsar as a Flink state backend is something that the team >will also be working on once the off-loader (mentioned above) is complete >so this should be expected sometime in June. > >

Re: Pulsar as a state backend

2020-04-07 Thread Markos Sfikas

currently in progress and expected to be out sometime in May. - Work on state integration is also in progress and expected sometime in May. - Using Pulsar as a Flink state backend is something that the team will also be working on once the off-loader (mentioned above) is complete so

Pulsar as a state backend

2020-04-06 Thread Michael Colson

Hello, I recently browse this post : https://flink.apache.org/2019/05/03/pulsar-flink.html and mainly : *Finally, an alternative way of integrating the technologies could include using Pulsar as a state backend with Flink. Since Pulsar has a layered architecture (Streams and Segmented Streams

Re: Flink cluster on k8s with rocksdb state backend

2019-10-17 Thread Steven Nelson

:31 PM, dhanesh arole wrote: > > Hello all, > > I am trying to provision a Flink cluster on k8s. Some of the jobs in our > existing cluster use RocksDB state backend. I wanted to take a look at the > Flink helm chart or deployment manifests that provision task managers with

Flink cluster on k8s with rocksdb state backend

2019-10-17 Thread dhanesh arole

Hello all, I am trying to provision a Flink cluster on k8s. Some of the jobs in our existing cluster use RocksDB state backend. I wanted to take a look at the Flink helm chart or deployment manifests that provision task managers with dynamic PV and how they manage it. We are running on kops

Re: Flink 1.8: Using the RocksDB state backend causes "NoSuchMethodError" when trying to stop a pipeline

2019-08-14 Thread Kaymak, Tobias

t; Yun Tang > > -- > *From:* Kaymak, Tobias > *Sent:* Tuesday, August 13, 2019 21:20 > *To:* user@flink.apache.org > *Subject:* Flink 1.8: Using the RocksDB state backend causes > "NoSuchMethodError" when trying to stop a pipeline > >

Re: Flink 1.8: Using the RocksDB state backend causes "NoSuchMethodError" when trying to stop a pipeline

2019-08-13 Thread Yun Tang

ay, August 13, 2019 21:20 To: user@flink.apache.org Subject: Flink 1.8: Using the RocksDB state backend causes "NoSuchMethodError" when trying to stop a pipeline Hi, I am using Apache Beam 2.14.0 with Flink 1.8.0 and I have included the RocksDb dependency in my projects pom.xml as w

Flink 1.8: Using the RocksDB state backend causes "NoSuchMethodError" when trying to stop a pipeline

2019-08-13 Thread Kaymak, Tobias

Hi, I am using Apache Beam 2.14.0 with Flink 1.8.0 and I have included the RocksDb dependency in my projects pom.xml as well as baked it into the Dockerfile like this: FROM flink:1.8.0-scala_2.11 ADD --chown=flink:flink http://central.maven.org/maven2/org/apache/flink/flink-statebackend-rocksdb_

Re: HDFS checkpoints for rocksDB state backend:

2019-06-27 Thread Yang Wang

Hi, Andrea If you are running flink cluster on Yarn, the jar `flink-shaded-hadoop2-uber-1.6.4.jar` should exist in the lib dir of the flink client, so that it could be uploaded to the Yarn Distributed Cache and then be available on JM and TM. And if you are running flink standalone cluster, the j

Re: HDFS checkpoints for rocksDB state backend:

2019-06-27 Thread Andrea Spina

HI Qiu, my jar does not contain the class `org.apache.hadoop.hdfs.protocol.HdfsConstants*`, *but I do expect it is contained within `flink-shaded-hadoop2-uber-1.6.4.jar` which is located in Flink cluster libs. Il giorno gio 27 giu 2019 alle ore 04:03 Congxian Qiu < qcx978132...@gmail.com> ha scrit

Re: HDFS checkpoints for rocksDB state backend:

2019-06-26 Thread Congxian Qiu

Hi Andrea As the NoClassDefFoundError, could you please verify that there exist `org.apache.hadoop.hdfs.protocol.HdfsConstants*` *in your jar. Or could you use Arthas[1] to check if there exists the class when running the job? [1] https://github.com/alibaba/arthas Best, Congxian Andrea Spina

HDFS checkpoints for rocksDB state backend:

2019-06-26 Thread Andrea Spina

Dear community, I'm trying to use HDFS checkpoints in flink-1.6.4 with the following configuration state.backend: rocksdb state.checkpoints.dir: hdfs:// rbl1.stage.certilogo.radicalbit.io:8020/flink/checkpoint state.savepoints.dir: hdfs:// rbl1.stage.certilogo.radicalbit.io:8020/flink/savepoints

Re: Unable to set S3 like object storage for state backend.

2019-06-24 Thread Vishwas Siravara

Hi Ken, Thanks for reaching out, I created a compliant bucket with name aip-featuretoolkit. I now get the exception "Unable to execute HTTP request: aip-featuretoolkit.SU73ECSG1P1d.***.COM: Name or service not known" from org.apache.flink.fs.shaded.hadoop3.org.apache.hadoop.fs.s3a.Invoker.class l

Re: Unable to set S3 like object storage for state backend.

2019-06-20 Thread Ken Krugler

Hi Vishwas, It might be that you’ve got a legacy bucket name (“aip_featuretoolkit”), as AWS no longer allows bucket names to contain an underscore . I’m guessing that the Hadoop S3 code is trying to treat your path as a va

Unable to set S3 like object storage for state backend.

2019-06-20 Thread Vishwas Siravara

Hi, I am using flink version 1.7.2 , I am trying to use S3 like object storage EMC ECS( https://www.emc.com/techpubs/ecs/ecs_s3_supported_features-1.htm) . I am using the flink-s3-fs-hadoop-1.7.2.jar file as a dependency for s3 filesystem and I have placed it under the lib folder and is available

Re: Checkpoints periodically fail with hdfs as the state backend - Could not flush and close the file system output stream

2019-05-22 Thread PedroMrChaves

Unfortunately the audit logs for hdfs were not enabled. We will enable them and post he results when the problem happens again. Nonetheless, we don't have ay other process using hadoop besides flink - Best Regards, Pedro Chaves -- Sent from: http://apache-flink-user-mailing-list-archive.2336

Re: Checkpoints periodically fail with hdfs as the state backend - Could not flush and close the file system output stream

2019-05-21 Thread Congxian Qiu

Hi Pedro From the previous given log, I found that checkpoint 65912 has been expired then, the raise the IOException. When some checkpoint expired, the checkpoint dir will be deleted(CheckpointCoordinator#549 on release-1.6 branch), and the unfinished task will still write to the previous files,

Re: Checkpoints periodically fail with hdfs as the state backend - Could not flush and close the file system output stream

2019-05-21 Thread PedroMrChaves

The issue happened again. /AsynchronousException{java.lang.Exception: Could not materialize checkpoint 47400 for operator ENRICH (1/4).} at org.apache.flink.streaming.runtime.tasks.StreamTask$AsyncCheckpointExceptionHandler.tryHandleCheckpointException(StreamTask.java:1153) at org.

Re: Checkpoints periodically fail with hdfs as the state backend - Could not flush and close the file system output stream

2019-05-16 Thread PedroMrChaves

Hello Andrey, The audit log doesn't have anything that would point to it being deleted. The only thing worth mentioning is the following line. /2019-05-15 10:01:39,082 INFO org.apache.hadoop.hdfs.server.namenode.FSNamesystem: BLOCK* blk_1248714854_174974084 is COMMITTED but not COMPLETE(numNodes=

Re: Checkpoints periodically fail with hdfs as the state backend - Could not flush and close the file system output stream

2019-05-16 Thread PedroMrChaves

Hello, Thanks for the help. I've attached the logs. Our cluster has 2 job managers (HA) and 4 task managers. logs.tgz Regards, Pedro - Best Regards, Pedro Chaves -- Sent from: http://apache-flink

Re: Checkpoints periodically fail with hdfs as the state backend - Could not flush and close the file system output stream

2019-05-16 Thread Congxian Qiu

Hi Pedro Could you please share the audit log for file `/flink/data/checkpoints/76f7b4f5c679e8f2d822c9c3c73faf5d/chk-65912/68776faf-b687-403b-ba0c-17419f8684dc`, seems this did not exist cause this problem (maybe this file was created and deleted for some reason) Best, Congxian Andrey Zagrebin

Re: flink 1.4.2. java.lang.IllegalStateException: Could not initialize operator state backend

2019-05-16 Thread Andrey Zagrebin

The stack trace shows that the state is being restored which has probably already happened after job restart. I am wondering why it has been restarted after some time of running. Could you share full job/task manager logs? On Thu, May 16, 2019 at 6:26 AM anaray wrote: > Thank You Andrey. Arity o

Re: Checkpoints periodically fail with hdfs as the state backend - Could not flush and close the file system output stream

2019-05-16 Thread Andrey Zagrebin

Hi, could you also post job master logs? and ideally full task manager logs. This failure can be caused by some other previous failure. Best, Andrey On Wed, May 15, 2019 at 2:48 PM PedroMrChaves wrote: > Hello, > > Every once in a while our checkpoints fail with the following exception: > > /A

Re: flink 1.4.2. java.lang.IllegalStateException: Could not initialize operator state backend

2019-05-15 Thread anaray

Thank You Andrey. Arity of the job has not changed. Here issue is that job will run for sometime (with checkpoint enabled) and then after some time will get into above exception. The job keeps restarting afterwards. One thing that I want point out here is that we have a custom *serialization sche

Re: flink 1.4.2. java.lang.IllegalStateException: Could not initialize operator state backend

2019-05-15 Thread Andrey Zagrebin

Hi, I am not sure that FLINK-8836 is related to the failure in the stack trace. You say you are using Flink in production, does it mean it always worked and has started to fail recently? >From the stack trace, it looks like the arity of some Tup

Checkpoints periodically fail with hdfs as the state backend - Could not flush and close the file system output stream

2019-05-15 Thread PedroMrChaves

Hello, Every once in a while our checkpoints fail with the following exception: /AsynchronousException{java.lang.Exception: Could not materialize checkpoint 65912 for operator AGGREGATION-FILTER (2/2).} at org.apache.flink.streaming.runtime.tasks.StreamTask$AsyncCheckpointExceptionHandler

flink 1.4.2. java.lang.IllegalStateException: Could not initialize operator state backend

2019-05-14 Thread anaray

options. Please advice. java.lang.IllegalStateException: Could not initialize operator state backend. at org.apache.flink.streaming.api.operators.AbstractStreamOperator.initOperatorState(AbstractStreamOperator.java:301) at

Re: [EXTERNAL] Re: Looking for help in configuring Swift as State Backend

2019-04-24 Thread Till Rohrmann

.apache.flink.runtime.client.JobExecutionException: Could > not set up JobManager > >at > org.apache.flink.runtime.jobmaster.JobManagerRunner.(JobManagerRunner.java:176) > >at > org.apache.flink.runtime.dispatcher.Dis

Re: [EXTERNAL] Re: Looking for help in configuring Swift as State Backend

2019-04-24 Thread PoolakkalMukkath, Shakir

4) ... 7 more Caused by: org.apache.flink.runtime.client.JobExecutionException: Could not instantiate configured state backend at org.apache.flink.runtime.executiongraph.ExecutionGraphBuilder.buildGraph(ExecutionGr

Re: Looking for help in configuring Swift as State Backend

2019-04-24 Thread Till Rohrmann

kkalmukk...@comcast.com> wrote: > Hi, > > > > I am looking for some help in configuring the Swift Fs as State Backend. I > am unable to configure it, let me know if anyone has prior done this or > knowledge to help me > > Do we still need to run an HDFS to use this feature ? > > > > Thanks, > > Shakir >

Looking for help in configuring Swift as State Backend

2019-04-23 Thread PoolakkalMukkath, Shakir

Hi, I am looking for some help in configuring the Swift Fs as State Backend. I am unable to configure it, let me know if anyone has prior done this or knowledge to help me Do we still need to run an HDFS to use this feature ? Thanks, Shakir

Re: getting an error when configuring state backend to hdfs

2018-12-24 Thread Avi Levi

[1] > > [1] https://flink.apache.org/downloads.html#latest-stable-release-v171 > > > Best > Yun Tang > -- > *From:* Avi Levi > *Sent:* Thursday, December 20, 2018 2:11 > *To:* Steven Nelson > *Cc:* Chesnay Schepler; user@flink.apache.o

Re: getting an error when configuring state backend to hdfs

2018-12-23 Thread Yun Tang

error when configuring state backend to hdfs when I try running from my IDE (intellij) I am getting this exception Exception in thread "main" org.apache.flink.runtime.client.JobExecutionException: Could not retrieve JobResult. at org.apache.flink.runtime.minicluster.MiniCluste

Re: getting an error when configuring state backend to hdfs

2018-12-19 Thread Avi Levi

when I try running from my IDE (intellij) I am getting this exception Exception in thread "main" org.apache.flink.runtime.client.JobExecutionException: Could not retrieve JobResult. at org.apache.flink.runtime.minicluster.MiniCluster.executeJobBlocking(MiniCluster.java:643) at org.apache.flink.stre

Re: getting an error when configuring state backend to hdfs

2018-12-19 Thread Steven Nelson

What image are you using? Sent from my iPhone > On Dec 19, 2018, at 9:44 AM, Avi Levi wrote: > > Hi Chesnay, > What do you mean? I am creating a fat jar with all dependencies (using sbt > assembly). which jar I should place in the /lib directory ? > >> On Wed, Dec 19, 2018 at 4:44 PM Chesnay

Re: getting an error when configuring state backend to hdfs

2018-12-19 Thread Chesnay Schepler

flink-hadoop-fs should be in /lib On 19.12.2018 16:44, Avi Levi wrote: Hi Chesnay, What do you mean? I am creating a fat jar with all dependencies (using sbt assembly). which jar I should place in the /lib directory ? On Wed, Dec 19, 2018 at 4:44 PM Chesnay Schepler

Re: getting an error when configuring state backend to hdfs

2018-12-19 Thread Avi Levi

Hi Chesnay, What do you mean? I am creating a fat jar with all dependencies (using sbt assembly). which jar I should place in the /lib directory ? On Wed, Dec 19, 2018 at 4:44 PM Chesnay Schepler wrote: > Are you including the filesystems in your jar? Filesystem jars must be > placed in the /lib

1 2 3 >

1 - 100 of 263 matches

Mail list logo