Re: Re:Backpressure causing operators to stop ingestion completely

2024-10-23 Thread Raihan Sunny via user
Hi Jake, Thanks for the suggestion. I'm actually using PyFlink and it seems that the flame graph can only account for Java methods. Is there any other methods to debug this? I was curious about the network buffers tuning. Given the situation that there's a surge of input data at times while oth

Backpressure causing operators to stop ingestion completely

2024-10-21 Thread Raihan Sunny via user
Hello, I have an aggregator job that experiences backpressure after running for a while and completely stops processing. It doesn't take any further input from the source. Here's a bit of context: - There are 3 producer jobs, all of which write data to a common Kafka topic - The aggregator job r

Flink metrics to Prometheus on Kubernetes

2023-11-07 Thread Raihan Sunny via user
Hi, I have a few Flink jobs running on Kubernetes using the Flink Kubernetes Operator. By following the documentation [1] I was able to set up monitoring for the Operator itself. As for the jobs themselves, I'm a bit confused about how to properly set it up. Here's my FlinkDeployment configuration

Re: Flink HDFS with Flink Kubernetes Operator

2023-10-19 Thread Raihan Sunny via user
decorators. It's true by default, but by setting it to false > Flink won't add this mount. > > [1] > https://nightlies.apache.org/flink/flink-docs-release-1.17/docs/deployment/config/#kubernetes-decorator-hadoop-conf-mount-enabled > > Regards, > Mate > > Raihan

Flink HDFS with Flink Kubernetes Operator

2023-10-19 Thread Raihan Sunny via user
Hi, I've been using HDFS with Flink for checkpoint and savepoint storage which works perfectly fine. Now I have another use case where I want to read and write to HDFS from the application code as well. For this, I'm using the "pyarrow" library which is already installed with PyFlink as a dependen

Re: Flink Kubernetes Operator autoscaling GPU-based workload

2023-08-01 Thread Raihan Sunny via user
nt. > After that you can check the autoscaler logs for more info. > > Gyula > > On Tue, Aug 1, 2023 at 10:33 AM Raihan Sunny via user < > user@flink.apache.org> wrote: > >> Hi, >> >> I have a workload that depends on the GPU. I have only 1 GPU card. As per

Flink Kubernetes Operator autoscaling GPU-based workload

2023-08-01 Thread Raihan Sunny via user
Hi, I have a workload that depends on the GPU. I have only 1 GPU card. As per the documentation I have added the necessary configurations and can run the GPU workload in standalone REACTIVE mode with as many taskmanager instances as required. I have set the number of task slots to 1 so that a rai