Hi Dian,
thanks, that did the trick.
Unfortunately, I have a new problem now.
As I said I'm trying to read json data from a kafka topic into a datastream. I
tried doing this using the JsonRowDeserializationSchema-class as below (the
Json-objects are tweets and thus pretty nested and complex). H
Hi,
I am trying to run a very basic job in PyFlink (getting Data from a Kafka-Topic
and printing the stream).
In the command line I run:
./bin/flink run \
--python /home/ubuntu/load_kafka.py \
--jarfile /home/ubuntu/flink-connector-kafka_2.12-1.12.2.jar
I downloaded the jar from:
https://mvnr
Thanks to everyone for the tips. It helps a lot. I'll try the table API first and if that doesn't succeed I'll do as Klemens says.
Cheers,
Giacomo
Gesendet: Mittwoch, 14. April 2021 um 16:18 Uhr
Von: "Dian Fu"
An: "Klemens Muthmann"
Cc: "Yik San Chan" , g.g.m.5...@web.de, "user"
Betreff:
Hi,
I'm new to Flink and I am trying to create a stream from locally downloaded tweets. The tweets are in json format, like in this example:
{"data":{"text":"Polsek Kakas Cegah Covid-19 https://t.co/ADjEgpt7bC","public_metrics":"retweet_count":0,"reply_count":0,"like_count":0,"quote_count":0},
"a