Re: [DISCUSS] Support scalar vectorized Python UDF in PyFlink

2020-02-05 Thread dianfu
on call. > - ColumnarRow for vectorization reading seems that we need implement > ArrowColumnVectors. > > Best, > Jingsong Lee > > On Wed, Feb 5, 2020 at 12:45 PM dianfu wrote: > >> Hi all, >> >> Scalar Python UDF has already been supported in the coming re

Re: [DISCUSS] Upload the Flink Python API 1.9.x to PyPI for user convenience.

2020-02-04 Thread dianfu
Hi Jincheng, Thanks for the proposal. I think this is a good idea. Usually Python users will firstly try to install a Python package from PyPI when they are trying it out. This would definitely benefit Python users. Regards, Dian > 在 2020年2月5日,上午8:56,jincheng sun 写道: > > I see, thanks for th

[DISCUSS] Support scalar vectorized Python UDF in PyFlink

2020-02-04 Thread dianfu
Hi all, Scalar Python UDF has already been supported in the coming release 1.10 (FLIP-58[1]). It operates one row at a time. It works in the way that the Java operator serializes one input row to bytes and sends them to the Python worker; the Python worker deserializes the input row and evaluat