Flink's Data sources

2022-02-08 Thread mohan radhakrishnan
Hi, The source for Flink ( or even Kafka ) is a problem we find hard to solve. This data seems to indicate that the source could be MQ. Is there a need to pull from MQ to Hive and then write to Flink ? What can be the flow ? Kafka connect workers can issue JDBC queries and pull to Kafka. Is there

How to access Task.isBackPressured() from a SourceFunction?

2022-02-08 Thread Darren Whobrey
Hi, is there a way for the UDF of a source function, extended from RichParallelSourceFunction, to access its Task instance, so as to call Task.isBackPressured()? I'm trying to give priorities to different input sources that need to be managed from within the same source function and want to stop

Re: Inaccurate checkpoint trigger time

2022-02-08 Thread Paul Lam
Hi Robert & Yun, Thanks a lot for your helpful analysis! I’ve confirmed that it’s the checkpoint cleanup problem that caused the inaccurate checkpoint trigger time. Best, Paul Lam > 2022年1月30日 19:45,Yun Tang 写道: > > Hi Paul, > > I think Robert's idea might be right. > > From the log you pas

Using Universal Kafka Connector with Kafka Log Message Version 0.10.2.0

2022-02-08 Thread Mohammad Haseeb
Hi, We are currently running Flink 1.11 and I am exploring the option of using the universal Kafka connector with our Kafka cluster running on Kafka version 1.1.1 but log message format version 0.10.2.0. On the wiki page of the connector ( https://nightlies.apache.org/flink/flink-docs-release-1.11

Re: Flink Table API and Rules

2022-02-08 Thread Roman Khachatryan
Hi, Could you clarify whether your question is about - connecting Table and DataStream APIs? [1] - matching the records using the Table API? - or matching in the DataStream API? [3] ... You should probably look at the temporal joins to match records using Table API [2]; and connect/join/ or broad

Re: Temporal join unexpected behavior

2022-02-08 Thread Roman Khachatryan
Hi, According to the documentation, event-time temporal join is triggered by a watermark from the left and right sides. So the described behavior seems correct. [1] https://nightlies.apache.org/flink/flink-docs-release-1.14/docs/dev/table/sql/queries/joins/#event-time-temporal-join Regards, Roma

Re: Flink JDBC connector behavior

2022-02-08 Thread Roman Khachatryan
Hi, scan.partition.num (the number of partitions [1]) translates into parallel queries to the database (with different to/from). Batch size is further calculated from lower and upper bounds and the number of partitions. scan.fetch-size hints JDBC driver to adjust the fetch size (see [2]). The fi

Flink JDBC connector behavior

2022-02-08 Thread Qihua Yang
Hi, We are using flink jdbc connector to read whole database table line by line. A few things I don't quite understand. We configured BATCH_SIZE=100 and PARTITION_NUM=1000, table is pretty big. What is the flink internal behavior to read data from table? Flink read BATCH_SIZE data each time? Or it

Re: JSONKeyValueDeserializationSchema cannot be converted to ObjectNode>

2022-02-08 Thread Roman Khachatryan
Hi, setDeserializer() expects KafkaRecordDeserializationSchema; JSONKeyValueDeserializationSchema you provided is not compatible with it. You can convert it using [1] [1] https://nightlies.apache.org/flink/flink-docs-master/api/java/org/apache/flink/connector/kafka/source/reader/deserializer/Kafk

Re: Avro BulkFormat for the new FileSource API?

2022-02-08 Thread Roman Khachatryan
Hi Kevin, I'm afraid AvroFileFormatFactory is not intended for the DataStream API. Have you tried adapting AvroParquetRecordFormat via StreamFormatAdapter? Regards, Roman On Fri, Jan 7, 2022 at 7:23 PM Kevin Lam wrote: > > Hi all, > > We're looking into using the new FileSource API, we see that

Temporal join unexpected behavior

2022-02-08 Thread Ahmad Ebrahimi
Hi. It seems event time temporal join has a bug in version 1.14.3. the left and right side of the join are inverted. In the this query: SELECT * FROM orders AS o LEFT OUTER JOIN products FOR SYSTEM_TIME AS OF o.updatedAt AS p ON o.pId = p.id join triggered when a new event is published to the p

Flink Table API and Rules

2022-02-08 Thread Shahdat Hossain
I am trying to match streaming data against my database table using Flink Table API. I am able to match the data that exists in my streaming data and the specific column of the database table. Now I need to activate some rules that users have entered in the system (another database table) and I wa

Re: Issue with Flink UI for Flink 1.14.0

2022-02-08 Thread Sweta Kalakuntla
I am facing the same issue, do we know when 1.14.4 will be released? Thanks. On Fri, Jan 21, 2022 at 3:28 AM Chesnay Schepler wrote: > While FLINK-24550 was indeed fixed unfortunately a similar bug was also > introduced (https://issues.apache.org/jira/browse/FLINK-25732). > On 20/01/2022 21:18,

JSONKeyValueDeserializationSchema cannot be converted to ObjectNode>

2022-02-08 Thread HG
Hi all, When I build this code: KafkaSource source = KafkaSource.builder() .setProperties(kafkaProps) .setProperty("ssl.truststore.type",trustStoreType) .setProperty("ssl.truststore.password",trustStorePassword) .setProperty("ssl.truststore.location",trustStoreLoca

Re: Questions about Kryo setRegistrationRequired(false)

2022-02-08 Thread Shane Bishop
I tried ExecutionConfig#disableGenericTypes and I get this error when I try to execute my Flink job with StreamExecutionEnvironment#execute (attached is the full stacktrace): java.lang.UnsupportedOperationException: Generic types have been disabled in the ExecutionConfig and type java.util.List

Re: How to prevent check pointing of timers ?

2022-02-08 Thread Alex Drobinsky
Event time would be fine for window processing or if all keyed streams use at least the same timeline - however, in our case it isn't always the case. Imagine that half of streams have real time timestamps and other half is historic data ( in our case from pcap files ). The real time timestamps are

Re: Avro BulkFormat for the new FileSource API?

2022-02-08 Thread Kevin Lam
Sorry to revive this thread, I'm just circling back to this now. Is it possible to use https://issues.apache.org/jira/browse/FLINK-24565 with the DataStream API? I am not sure how to make use of AvroFileFormatFactory in the DataStream APi context, and couldn't find any examples. On Mon, Jan 10,

StateFun NoSuchMethodError when deploying to existing Cluster

2022-02-08 Thread Christopher Gustafson
Hi! I am having continued issues running a StateFun job in an existing Flink Cluster. My Flink cluster is using Flink version 1.14.3 and the StateFun job is using version 3.2.0 of the java SDK and statefun distribution. I get the following error: Caused by: java.lang.NoSuchMethodError: org.

Re: Reading from Kafka kafkarecorddeserializationschema

2022-02-08 Thread Roman Khachatryan
Hi, You can use other (de)serialization methods besides ByteBuffer as well. Endianness is set explicitly to have the same byte order during serialization and deserialization. Regards, Roman On Fri, Feb 4, 2022 at 2:43 PM Dawid Wysakowicz wrote: > > Hi, > > You can use DeserializationSchema with

[RESULT] [VOTE] Deprecate Per-Job Mode

2022-02-08 Thread Konstantin Knauf
Hi everyone, The vote on deprecating per-job mode in Flink 1.15 has been unanimously approved in [1]. I've created a ticket for deprecation [2] and dropping [3] and linked the current blockers for dropping it to the latter. Binding +1 Thomas Weise Xintong Song Yang Wang Jing Zhang Till Rohrmann