Re: PyFlink and parallelism

2022-07-18 Thread John Tipper
s broken if the tablestream contains a timestamp, which I reported a little while ago and Dian filed as FLINK-28253. Kind regards, John From: Juntao Hu Sent: 18 July 2022 04:13 To: John Tipper Cc: user@flink.apache.org Subject: Re: PyFlink and parallelism It&

Re: PyFlink and parallelism

2022-07-17 Thread Dian Fu
llelism([class java.lang.Integer]) does not exist. > > > > Looks like Python is converting to the Integer object in Java and not the > int primitive. I actually see this if I just call set_parallelism(1)​ > without the call to get_config()​. Is this a bug or is there a workaround

Re: PyFlink and parallelism

2022-07-17 Thread Juntao Hu
and not the > int primitive. I actually see this if I just call set_parallelism(1)​ > without the call to get_config()​. Is this a bug or is there a workaround? > > -- > *From:* John Tipper > *Sent:* 15 July 2022 16:44 > *To:* user@flink.apache.org >

Re: PyFlink and parallelism

2022-07-16 Thread John Tipper
t in Java and not the int primitive. I actually see this if I just call set_parallelism(1)​ without the call to get_config()​. Is this a bug or is there a workaround? From: John Tipper Sent: 15 July 2022 16:44 To: user@flink.apache.org Subject: PyFlink and p

PyFlink and parallelism

2022-07-15 Thread John Tipper
Hi all, I have a processing topology using PyFlink and SQL where there is data skew: I'm splitting a stream of heterogenous data into separate streams based on the type of data that's in it and some of these substreams have very many more events than others and this is causing issues when check