Re: [DISCUSS] Enhance convenience of TableEnvironment in TableAPI/SQL

Timo Walther Mon, 10 Dec 2018 23:13:43 -0800

Hi Jincheng,

thanks for the proposal. I totally agree with the problem of having 3StreamTableEnvironments and 3 BatchTableEnvironments. We also identifiedthis problem when doing Flink trainings and introductions to the Table &SQL API.

Actually, @Dawid and I were already discussing to remove thisshortcoming while working on FLINK-11067 [1]. The porting allows to fixthe class hierarchy because some visibilities of members change as wellfrom Scala to Java. This would not break backwards compatibility as thebase classes should not be used by users anyway.

However, I don't like the design of putting all methods of Batch andStream environments into the base class and throw exceptions if notsupported by base classes. This sounds not like a nice object orienteddesign and confuses users.

I added some comments to the document. I think we can improve thecurrent situation without breaking backwards compatibility. Methods thatinteract with Scala and Java API such as toDataSet/toDataStream shouldnot be moved to an abstract class as they would otherwise pull in Scaladependencies transitively or do not incoperate with the type extractionlogic of the target API.


Regards,
Timo

[1] https://issues.apache.org/jira/browse/FLINK-11067


Am 11.12.18 um 06:12 schrieb Zhang, Xuefu:

Hi Jincheng,

Thanks for bringing this up. It seems making good sense to me. However, one
concern I have is about backward compatibility. Could you clarify whether
existing user program will break with the proposed changes?

The answer to the question would largely determine when this can be introduced.

Thanks,
Xuefu

------------------------------------------------------------------
Sender:jincheng sun <sunjincheng...@gmail.com>
Sent at:2018 Dec 10 (Mon) 18:14
Recipient:dev <dev@flink.apache.org>
Subject:[DISCUSS] Enhance convenience of TableEnvironment in TableAPI/SQL

Hi All,

According to the feedback from users, the design of TableEnvironment is very
inconvenient for users, and often mistakenly imported by IDE, especially for
Java users, such as:

ExecutionEnvironment env = ...BatchTableEnvironment tEnv =
TableEnvironment.getTableEnvironment(env);

The user does not know which BatchTableEnvironment should be imported, because
there are three implementations of BatchTableEnvironment, shown as below:

1. org.apache.flink.table.api.BatchTableEnvironment 2.
org.apache.flink.table.api.java.BatchTableEnvironment 3.
org.apache.flink.table.api.scala.BatchTableEnvironment
[image.png]

This brings unnecessary inconveniences to the flink user. To solve this
problem, Wei Zhong, Hequn Cheng, Dian Fu, Shaoxuan Wang and myself discussed
offline a bit and propose to change the inheritance diagram of TableEnvironment
is shown as follows:
1. AbstractTaleEnvironment - rename current TableEnvironment to
AbstractTableEnvironment, The functionality implemented by Abstract
TableEnvironment is stream and batch shared.2. TableEnvironment - Create a new
TableEnvironment(abstract), and defined all methods in
(java/scala)StreamTableEnvironment and (java/scala)BatchTableEnvironment. In
the implementation of BatchTableEnviroment and StreamTableEnviroment, the
unsupported operations will be reported as an error.
[image.png]
Then the usage as follows:

ExecutionEnvironment env = …TableEnvironment tEnv =
TableEnvironment.getTableEnvironment(env)
For detailed proposals please refer to the Google doc:
https://docs.google.com/document/d/1t-AUGuaChADddyJi6e0WLsTDEnf9ZkupvvBiQ4yTTEI/edit?usp=sharing

Any mail feedback and Google doc comment are welcome.

Thanks,
Jincheng

Re: [DISCUSS] Enhance convenience of TableEnvironment in TableAPI/SQL

Reply via email to