Re: "Self-service ingestion pipelines with evolving schema via Flink and Iceberg" presentation recording from Flink Forward Seattle 2023

2024-05-24 Thread Giannis Polyzos
before. One job to sync and > evolve an entire database? That is amazing. > > If we could do this with Iceberg, we might be able to make an argument to > product managers to push for CDC. > > > > On Fri, May 24, 2024 at 8:36 AM Giannis Polyzos > wrote: > >> I’m c

Re: "Self-service ingestion pipelines with evolving schema via Flink and Iceberg" presentation recording from Flink Forward Seattle 2023

2024-05-24 Thread Giannis Polyzos
I’m curious if there is any reason for choosing Iceberg instead of Paimon (other than - iceberg is more popular). Especially for a use case like CDC that iceberg struggles to support. On Fri, 24 May 2024 at 3:22 PM, Andrew Otto wrote: > Interesting thank you! > > I asked this in the Paimon users

Re: Continuous transfer of data from a partitioned table

2024-02-13 Thread Giannis Polyzos
You can check the Oracle CDC connector, which provides that https://ververica.github.io/flink-cdc-connectors/master/content/connectors/oracle-cdc.html Best, G. On Tue, Feb 13, 2024 at 3:25 PM К В wrote: > Hello! > > We need to read data from an Oracle database table in order to pass it to > Kaf

Re: Optional fields during SQL insert

2024-01-11 Thread Giannis Polyzos
Hi Gyula, to the best of my knowledge, this is not feasible and you will have to do something like *CAST(NULL AS STRING)* to insert null values manually. Best, Giannis On Thu, Jan 11, 2024 at 9:58 AM Gyula Fóra wrote: > Hi All! > > Is it possible to insert into a table without specifying all co

Re: Barriers in Flink SQL

2023-10-25 Thread Giannis Polyzos
Hi Ralph, can you explain a bit more? When you say "barriers" you should be referring to the checkpoints, but from your description seems more like watermarks. What functionality is supported in Flink and not Flink SQL? In terms of watermarks, there were a few shortcomings between the two APIs whic

Re: Dealing with stale Watermark

2023-10-20 Thread Giannis Polyzos
Hi Irakli, If you see the watermarks tab on the operator do you see being propagated? If for example your source has multiple splits (like Kafka partitions) if one is idle or stays behind the watermark won’t be propagated as it is the minimum from all inputs (partitions/splits) Best On Fri, 20 Oc

Re: Urgent: Mitigating Slow Consumer Impact and Seeking Open-Source Solutions in Apache Kafka Consumers

2023-09-16 Thread Giannis Polyzos
appears to be a common challenge related to a slow-consuming >> situation. Those with relevant experience in addressing such matters should >> be capable of providing assistance. >> >> Thanks and regards, >> Gowtham S >> >> >> On Fri, 15 Sept 2023 at 23

Re: Urgent: Mitigating Slow Consumer Impact and Seeking Open-Source Solutions in Apache Kafka Consumers

2023-09-15 Thread Giannis Polyzos
Hi Karthick, on a high level seems like a data skew issue and some partitions have way more data than others? What is the number of your devices? how many messages are you processing? Most of the things you share above sound like you are looking for suggestions around load distribution for Kafka.

Re: Usecase advise for Apache Flink Table API

2023-08-27 Thread Giannis Polyzos
You can also check the apache paimon project https://paimon.apache.org/ (previously known as Flink Table Store). Might help in some scenarios On Mon, Aug 28, 2023 at 5:05 AM liu ron wrote: > Hi, Nirmal > > Flink SQL is standard ANSI SQL and extends upon it. Flink SQL provides > rich Join and Agg

[Discussion] Slack Channel

2023-08-22 Thread Giannis Polyzos
Hello folks, considering how apache flink gains more and more popularity and seeing how other open-source projects use Slack, I wanted to start this thread to see how we can grow the community. First of all one thing I have noticed, although there are people involved with Flink only lately they sta

Re: Async IO For Cassandra

2023-07-17 Thread Giannis Polyzos
Hi Pritam.. since this is a look-up to an external system considering there is network i/o in place and also the time to get the results it might be normal to notice backpressure there. Also note that the queries in Cassandra highly depend on the data model, so data can be easy to find between the

Re: Data Stream Processing applications testing

2023-05-22 Thread Giannis Polyzos
Seems interesting, thanks for sharing On Tue, May 23, 2023 at 2:04 AM Alexandre Strapacao Guedes Vianna < a...@cin.ufpe.br> wrote: > Hey everyone, > > I wanted to share my latest paper, "A Grey Literature Review on Data > Stream Processing Applications Testing," in the Journal of Systems and > So

Re: Flink SQL Async UDF

2023-05-08 Thread Giannis Polyzos
What im curious is about the completable future in the function signature (eval) public final void eval(CompletableFuture> future, Object... keys) is is injected automatically? Best On Mon, May 8, 2023 at 10:02 AM Giannis Polyzos wrote: > Hi Biao, > yeah, this means that since it&

Re: Flink SQL Async UDF

2023-05-08 Thread Giannis Polyzos
es, in your example, you actually implement an async user-defiend > table function(see here > <https://nightlies.apache.org/flink/flink-docs-master/docs/dev/table/functions/udfs/#table-functions> > for > more details about UDTF usage), not a scalar UDF. The usage could be > diffe

Re: Flink SQL Async UDF

2023-05-07 Thread Giannis Polyzos
r message "No match found for function signature" usually > indicates that the function does not exist or the parameters do not match. > > Best, > Shammon FY > > On Sun, May 7, 2023 at 2:55 PM Giannis Polyzos > wrote: > >> I can't really find any examp

Flink SQL Async UDF

2023-05-06 Thread Giannis Polyzos
I can't really find any examples / docs for Flink's AsyncTableFunction and I have a hard time getting it to work. Is there any example you can share that just takes as input a String key and outputs lets say a record (returned by the lookup?) Also is not clear to me how the async happens internally

Re: Flink SQL State

2023-04-27 Thread Giannis Polyzos
t? I think it can be > extremely useful for the community. > > Thank you. > > On Thu, Apr 27, 2023 at 8:08 AM Giannis Polyzos > wrote: > >> Correct, its some custom code i put together to investigate what gets >> written in rocksdb >> >> On Thu, Apr 2

Re: Flink SQL State

2023-04-27 Thread Giannis Polyzos
> Thank you. > > On Wed, Apr 26, 2023 at 10:38 PM Giannis Polyzos > wrote: > >> This is really helpful, >> >> Thanks >> >> On Thu, Apr 27, 2023 at 5:46 AM Yanfei Lei wrote: >> >>> Hi Giannis, >>> >>> Except “default” C

Re: Flink SQL State

2023-04-26 Thread Giannis Polyzos
gt; implements the logic of temporal join, this post[2] might be helpful > in understanding what happened to temporal join. > > [1] > https://www.alibabacloud.com/help/en/realtime-compute-for-apache-flink/latest/datastream-timer > [2] > https://nightlies.apache.org/flink/flink-

Re: UI error in 1.17 release

2023-04-07 Thread Giannis Polyzos
Nice 👍 P.S and forgot to mention I use Kafka as the source On Fri, Apr 7, 2023 at 11:03 AM Weihua Hu wrote: > Hi, > Thanks for reporting this issue. > > I created a ticket to track this[1]. > > [1]https://issues.apache.org/jira/browse/FLINK-31752 > > Best, > Weihua > > > On Fri, Apr 7, 2023 at 1