[GitHub] [flink] twalthr commented on a change in pull request #14053: [FLINK-18279][docs] Simplify Table Overview

GitBox Mon, 23 Nov 2020 06:31:21 -0800


twalthr commented on a change in pull request #14053:
URL: https://github.com/apache/flink/pull/14053#discussion_r528738848




##########
File path: docs/dev/table/index.md
##########
@@ -25,93 +25,106 @@ specific language governing permissions and limitations
 under the License.
 -->
 
-Apache Flink features two relational APIs - the Table API and SQL - for 
unified stream and batch processing. The Table API is a language-integrated 
query API for Scala and Java that allows the composition of queries from 
relational operators such as selection, filter, and join in a very intuitive 
way. Flink's SQL support is based on [Apache 
Calcite](https://calcite.apache.org) which implements the SQL standard. Queries 
specified in either interface have the same semantics and specify the same 
result regardless whether the input is a batch input (DataSet) or a stream 
input (DataStream).
-
-The Table API and the SQL interfaces are tightly integrated with each other as 
well as Flink's DataStream and DataSet APIs. You can easily switch between all 
APIs and libraries which build upon the APIs. For instance, you can extract 
patterns from a DataStream using the [CEP library]({{ site.baseurl 
}}/dev/libs/cep.html) and later use the Table API to analyze the patterns, or 
you might scan, filter, and aggregate a batch table using a SQL query before 
running a [Gelly graph algorithm]({{ site.baseurl }}/dev/libs/gelly) on the 
preprocessed data.
-
-**Please note that the Table API and SQL are not yet feature complete and are 
being actively developed. Not all operations are supported by every combination 
of \[Table API, SQL\] and \[stream, batch\] input.**
-
-Dependency Structure
---------------------
-
-Starting from Flink 1.9, Flink provides two different planner implementations 
for evaluating Table & SQL API programs: the Blink planner and the old planner 
that was available before Flink 1.9. Planners are responsible for
-translating relational operators into an executable, optimized Flink job. Both 
of the planners come with different optimization rules and runtime classes.
-They may also differ in the set of supported features.
-
-<span class="label label-danger">Attention</span> For production use cases, we 
recommend the blink planner that has become the default planner since 1.11.
-
-All Table API and SQL components are bundled in the `flink-table` or 
`flink-table-blink` Maven artifacts.
-
-The following dependencies are relevant for most projects:
-
-* `flink-table-common`: A common module for extending the table ecosystem by 
custom functions, formats, etc.
-* `flink-table-api-java`: The Table & SQL API for pure table programs using 
the Java programming language (in early development stage, not recommended!).
-* `flink-table-api-scala`: The Table & SQL API for pure table programs using 
the Scala programming language (in early development stage, not recommended!).
-* `flink-table-api-java-bridge`: The Table & SQL API with DataStream/DataSet 
API support using the Java programming language.
-* `flink-table-api-scala-bridge`: The Table & SQL API with DataStream/DataSet 
API support using the Scala programming language.
-* `flink-table-planner`: The table program planner and runtime. This was the 
only planner of Flink before the 1.9 release. It's no longer recommended since 
Flink 1.11.
-* `flink-table-planner-blink`: The new Blink planner, which has become the 
default one since Flink 1.11.
-* `flink-table-runtime-blink`: The new Blink runtime.
-* `flink-table-uber`: Packages the API modules above plus the old planner into 
a distribution for most Table & SQL API use cases. The uber JAR file 
`flink-table-*.jar` is located in the `/lib` directory of a Flink release by 
default.
-* `flink-table-uber-blink`: Packages the API modules above plus the Blink 
specific modules into a distribution for most Table & SQL API use cases. The 
uber JAR file `flink-table-blink-*.jar` is located in the `/lib` directory of a 
Flink release by default.
-
-See the [common API](common.html) page for more information about how to 
switch between the old and new Blink planner in table programs.
+Apache Flink features two relational APIs - the Table API and SQL - for 
unified stream and batch
+processing. The Table API is a language-integrated query API for Java, Scala, 
and Python that
+allows the composition of queries from relational operators such as selection, 
filter, and join in
+a very intuitive way. Flink's SQL support is based on [Apache 
Calcite](https://calcite.apache.org)
+which implements the SQL standard. Queries specified in either interface have 
the same semantics
+and specify the same result regardless of whether the input is continuous 
(streaming) or bounded (batch).
+
+The Table API and SQL interfaces integrate seamlessly with each other and 
Flink's DataStream API. 
+You can easily switch between all APIs and libraries which build upon them.
+For instance, you can extract patterns from a Table using [Match Recognize]({% 
link dev/table/streaming/match_recognize.md %})
+and later use the DataStream API to build alerting based on the matched 
patterns.
+
+Table Planners
+--------------
+
+Table planners are responsible for translating relational operators into an 
executable, optimized Flink job.
+Flink supports two different planner implementations; the modern Blink planner 
and the legacy planner.
+For production use cases, we recommend the blink planner which has been the 
default planner since 1.11.

Review comment:
       ```suggestion
   For production use cases, we recommend the Blink planner which has been the 
default planner since 1.11.
   ```

##########
File path: docs/dev/table/index.md
##########
@@ -25,93 +25,106 @@ specific language governing permissions and limitations
 under the License.
 -->
 
-Apache Flink features two relational APIs - the Table API and SQL - for 
unified stream and batch processing. The Table API is a language-integrated 
query API for Scala and Java that allows the composition of queries from 
relational operators such as selection, filter, and join in a very intuitive 
way. Flink's SQL support is based on [Apache 
Calcite](https://calcite.apache.org) which implements the SQL standard. Queries 
specified in either interface have the same semantics and specify the same 
result regardless whether the input is a batch input (DataSet) or a stream 
input (DataStream).
-
-The Table API and the SQL interfaces are tightly integrated with each other as 
well as Flink's DataStream and DataSet APIs. You can easily switch between all 
APIs and libraries which build upon the APIs. For instance, you can extract 
patterns from a DataStream using the [CEP library]({{ site.baseurl 
}}/dev/libs/cep.html) and later use the Table API to analyze the patterns, or 
you might scan, filter, and aggregate a batch table using a SQL query before 
running a [Gelly graph algorithm]({{ site.baseurl }}/dev/libs/gelly) on the 
preprocessed data.
-
-**Please note that the Table API and SQL are not yet feature complete and are 
being actively developed. Not all operations are supported by every combination 
of \[Table API, SQL\] and \[stream, batch\] input.**
-
-Dependency Structure
---------------------
-
-Starting from Flink 1.9, Flink provides two different planner implementations 
for evaluating Table & SQL API programs: the Blink planner and the old planner 
that was available before Flink 1.9. Planners are responsible for
-translating relational operators into an executable, optimized Flink job. Both 
of the planners come with different optimization rules and runtime classes.
-They may also differ in the set of supported features.
-
-<span class="label label-danger">Attention</span> For production use cases, we 
recommend the blink planner that has become the default planner since 1.11.
-
-All Table API and SQL components are bundled in the `flink-table` or 
`flink-table-blink` Maven artifacts.
-
-The following dependencies are relevant for most projects:
-
-* `flink-table-common`: A common module for extending the table ecosystem by 
custom functions, formats, etc.
-* `flink-table-api-java`: The Table & SQL API for pure table programs using 
the Java programming language (in early development stage, not recommended!).
-* `flink-table-api-scala`: The Table & SQL API for pure table programs using 
the Scala programming language (in early development stage, not recommended!).
-* `flink-table-api-java-bridge`: The Table & SQL API with DataStream/DataSet 
API support using the Java programming language.
-* `flink-table-api-scala-bridge`: The Table & SQL API with DataStream/DataSet 
API support using the Scala programming language.
-* `flink-table-planner`: The table program planner and runtime. This was the 
only planner of Flink before the 1.9 release. It's no longer recommended since 
Flink 1.11.
-* `flink-table-planner-blink`: The new Blink planner, which has become the 
default one since Flink 1.11.
-* `flink-table-runtime-blink`: The new Blink runtime.
-* `flink-table-uber`: Packages the API modules above plus the old planner into 
a distribution for most Table & SQL API use cases. The uber JAR file 
`flink-table-*.jar` is located in the `/lib` directory of a Flink release by 
default.
-* `flink-table-uber-blink`: Packages the API modules above plus the Blink 
specific modules into a distribution for most Table & SQL API use cases. The 
uber JAR file `flink-table-blink-*.jar` is located in the `/lib` directory of a 
Flink release by default.
-
-See the [common API](common.html) page for more information about how to 
switch between the old and new Blink planner in table programs.
+Apache Flink features two relational APIs - the Table API and SQL - for 
unified stream and batch
+processing. The Table API is a language-integrated query API for Java, Scala, 
and Python that
+allows the composition of queries from relational operators such as selection, 
filter, and join in
+a very intuitive way. Flink's SQL support is based on [Apache 
Calcite](https://calcite.apache.org)
+which implements the SQL standard. Queries specified in either interface have 
the same semantics
+and specify the same result regardless of whether the input is continuous 
(streaming) or bounded (batch).
+
+The Table API and SQL interfaces integrate seamlessly with each other and 
Flink's DataStream API. 
+You can easily switch between all APIs and libraries which build upon them.
+For instance, you can extract patterns from a Table using [Match Recognize]({% 
link dev/table/streaming/match_recognize.md %})

Review comment:
       ```suggestion
   For instance, you can detect patterns from a table using [`MATCH_RECOGNIZE` 
clause]({% link dev/table/streaming/match_recognize.md %})
   ```

##########
File path: docs/dev/table/index.md
##########
@@ -25,93 +25,106 @@ specific language governing permissions and limitations
 under the License.
 -->
 
-Apache Flink features two relational APIs - the Table API and SQL - for 
unified stream and batch processing. The Table API is a language-integrated 
query API for Scala and Java that allows the composition of queries from 
relational operators such as selection, filter, and join in a very intuitive 
way. Flink's SQL support is based on [Apache 
Calcite](https://calcite.apache.org) which implements the SQL standard. Queries 
specified in either interface have the same semantics and specify the same 
result regardless whether the input is a batch input (DataSet) or a stream 
input (DataStream).
-
-The Table API and the SQL interfaces are tightly integrated with each other as 
well as Flink's DataStream and DataSet APIs. You can easily switch between all 
APIs and libraries which build upon the APIs. For instance, you can extract 
patterns from a DataStream using the [CEP library]({{ site.baseurl 
}}/dev/libs/cep.html) and later use the Table API to analyze the patterns, or 
you might scan, filter, and aggregate a batch table using a SQL query before 
running a [Gelly graph algorithm]({{ site.baseurl }}/dev/libs/gelly) on the 
preprocessed data.
-
-**Please note that the Table API and SQL are not yet feature complete and are 
being actively developed. Not all operations are supported by every combination 
of \[Table API, SQL\] and \[stream, batch\] input.**
-
-Dependency Structure
---------------------
-
-Starting from Flink 1.9, Flink provides two different planner implementations 
for evaluating Table & SQL API programs: the Blink planner and the old planner 
that was available before Flink 1.9. Planners are responsible for
-translating relational operators into an executable, optimized Flink job. Both 
of the planners come with different optimization rules and runtime classes.
-They may also differ in the set of supported features.
-
-<span class="label label-danger">Attention</span> For production use cases, we 
recommend the blink planner that has become the default planner since 1.11.
-
-All Table API and SQL components are bundled in the `flink-table` or 
`flink-table-blink` Maven artifacts.
-
-The following dependencies are relevant for most projects:
-
-* `flink-table-common`: A common module for extending the table ecosystem by 
custom functions, formats, etc.
-* `flink-table-api-java`: The Table & SQL API for pure table programs using 
the Java programming language (in early development stage, not recommended!).
-* `flink-table-api-scala`: The Table & SQL API for pure table programs using 
the Scala programming language (in early development stage, not recommended!).
-* `flink-table-api-java-bridge`: The Table & SQL API with DataStream/DataSet 
API support using the Java programming language.
-* `flink-table-api-scala-bridge`: The Table & SQL API with DataStream/DataSet 
API support using the Scala programming language.
-* `flink-table-planner`: The table program planner and runtime. This was the 
only planner of Flink before the 1.9 release. It's no longer recommended since 
Flink 1.11.
-* `flink-table-planner-blink`: The new Blink planner, which has become the 
default one since Flink 1.11.
-* `flink-table-runtime-blink`: The new Blink runtime.
-* `flink-table-uber`: Packages the API modules above plus the old planner into 
a distribution for most Table & SQL API use cases. The uber JAR file 
`flink-table-*.jar` is located in the `/lib` directory of a Flink release by 
default.
-* `flink-table-uber-blink`: Packages the API modules above plus the Blink 
specific modules into a distribution for most Table & SQL API use cases. The 
uber JAR file `flink-table-blink-*.jar` is located in the `/lib` directory of a 
Flink release by default.
-
-See the [common API](common.html) page for more information about how to 
switch between the old and new Blink planner in table programs.
+Apache Flink features two relational APIs - the Table API and SQL - for 
unified stream and batch
+processing. The Table API is a language-integrated query API for Java, Scala, 
and Python that
+allows the composition of queries from relational operators such as selection, 
filter, and join in
+a very intuitive way. Flink's SQL support is based on [Apache 
Calcite](https://calcite.apache.org)
+which implements the SQL standard. Queries specified in either interface have 
the same semantics
+and specify the same result regardless of whether the input is continuous 
(streaming) or bounded (batch).
+
+The Table API and SQL interfaces integrate seamlessly with each other and 
Flink's DataStream API. 
+You can easily switch between all APIs and libraries which build upon them.
+For instance, you can extract patterns from a Table using [Match Recognize]({% 
link dev/table/streaming/match_recognize.md %})
+and later use the DataStream API to build alerting based on the matched 
patterns.
+
+Table Planners
+--------------
+
+Table planners are responsible for translating relational operators into an 
executable, optimized Flink job.
+Flink supports two different planner implementations; the modern Blink planner 
and the legacy planner.
+For production use cases, we recommend the blink planner which has been the 
default planner since 1.11.
+See the [common API]({% link dev/table/common.md %}) page for more information 
on how to switch between the two planners.
 
 ### Table Program Dependencies
 
-Depending on the target programming language, you need to add the Java or 
Scala API to a project in order to use the Table API & SQL for defining 
pipelines:
+Depending on the target programming language, you need to add the Java or 
Scala API to a project

Review comment:
       mention Python here as well?

##########
File path: docs/dev/table/index.md
##########
@@ -25,93 +25,106 @@ specific language governing permissions and limitations
 under the License.
 -->
 
-Apache Flink features two relational APIs - the Table API and SQL - for 
unified stream and batch processing. The Table API is a language-integrated 
query API for Scala and Java that allows the composition of queries from 
relational operators such as selection, filter, and join in a very intuitive 
way. Flink's SQL support is based on [Apache 
Calcite](https://calcite.apache.org) which implements the SQL standard. Queries 
specified in either interface have the same semantics and specify the same 
result regardless whether the input is a batch input (DataSet) or a stream 
input (DataStream).
-
-The Table API and the SQL interfaces are tightly integrated with each other as 
well as Flink's DataStream and DataSet APIs. You can easily switch between all 
APIs and libraries which build upon the APIs. For instance, you can extract 
patterns from a DataStream using the [CEP library]({{ site.baseurl 
}}/dev/libs/cep.html) and later use the Table API to analyze the patterns, or 
you might scan, filter, and aggregate a batch table using a SQL query before 
running a [Gelly graph algorithm]({{ site.baseurl }}/dev/libs/gelly) on the 
preprocessed data.
-
-**Please note that the Table API and SQL are not yet feature complete and are 
being actively developed. Not all operations are supported by every combination 
of \[Table API, SQL\] and \[stream, batch\] input.**
-
-Dependency Structure
---------------------
-
-Starting from Flink 1.9, Flink provides two different planner implementations 
for evaluating Table & SQL API programs: the Blink planner and the old planner 
that was available before Flink 1.9. Planners are responsible for
-translating relational operators into an executable, optimized Flink job. Both 
of the planners come with different optimization rules and runtime classes.
-They may also differ in the set of supported features.
-
-<span class="label label-danger">Attention</span> For production use cases, we 
recommend the blink planner that has become the default planner since 1.11.
-
-All Table API and SQL components are bundled in the `flink-table` or 
`flink-table-blink` Maven artifacts.
-
-The following dependencies are relevant for most projects:
-
-* `flink-table-common`: A common module for extending the table ecosystem by 
custom functions, formats, etc.
-* `flink-table-api-java`: The Table & SQL API for pure table programs using 
the Java programming language (in early development stage, not recommended!).
-* `flink-table-api-scala`: The Table & SQL API for pure table programs using 
the Scala programming language (in early development stage, not recommended!).
-* `flink-table-api-java-bridge`: The Table & SQL API with DataStream/DataSet 
API support using the Java programming language.
-* `flink-table-api-scala-bridge`: The Table & SQL API with DataStream/DataSet 
API support using the Scala programming language.
-* `flink-table-planner`: The table program planner and runtime. This was the 
only planner of Flink before the 1.9 release. It's no longer recommended since 
Flink 1.11.
-* `flink-table-planner-blink`: The new Blink planner, which has become the 
default one since Flink 1.11.
-* `flink-table-runtime-blink`: The new Blink runtime.
-* `flink-table-uber`: Packages the API modules above plus the old planner into 
a distribution for most Table & SQL API use cases. The uber JAR file 
`flink-table-*.jar` is located in the `/lib` directory of a Flink release by 
default.
-* `flink-table-uber-blink`: Packages the API modules above plus the Blink 
specific modules into a distribution for most Table & SQL API use cases. The 
uber JAR file `flink-table-blink-*.jar` is located in the `/lib` directory of a 
Flink release by default.
-
-See the [common API](common.html) page for more information about how to 
switch between the old and new Blink planner in table programs.
+Apache Flink features two relational APIs - the Table API and SQL - for 
unified stream and batch
+processing. The Table API is a language-integrated query API for Java, Scala, 
and Python that
+allows the composition of queries from relational operators such as selection, 
filter, and join in
+a very intuitive way. Flink's SQL support is based on [Apache 
Calcite](https://calcite.apache.org)
+which implements the SQL standard. Queries specified in either interface have 
the same semantics
+and specify the same result regardless of whether the input is continuous 
(streaming) or bounded (batch).
+
+The Table API and SQL interfaces integrate seamlessly with each other and 
Flink's DataStream API. 
+You can easily switch between all APIs and libraries which build upon them.
+For instance, you can extract patterns from a Table using [Match Recognize]({% 
link dev/table/streaming/match_recognize.md %})
+and later use the DataStream API to build alerting based on the matched 
patterns.
+
+Table Planners
+--------------
+
+Table planners are responsible for translating relational operators into an 
executable, optimized Flink job.
+Flink supports two different planner implementations; the modern Blink planner 
and the legacy planner.
+For production use cases, we recommend the blink planner which has been the 
default planner since 1.11.
+See the [common API]({% link dev/table/common.md %}) page for more information 
on how to switch between the two planners.
 
 ### Table Program Dependencies
 
-Depending on the target programming language, you need to add the Java or 
Scala API to a project in order to use the Table API & SQL for defining 
pipelines:
+Depending on the target programming language, you need to add the Java or 
Scala API to a project
+in order to use the Table API & SQL for defining pipelines.
 
+<div class="codetabs" markdown="1">
+<div data-lang="java" markdown="1">
 {% highlight xml %}
-<!-- Either... -->
 <dependency>
   <groupId>org.apache.flink</groupId>
   <artifactId>flink-table-api-java-bridge{{ site.scala_version_suffix 
}}</artifactId>
   <version>{{site.version}}</version>
   <scope>provided</scope>
 </dependency>
-<!-- or... -->
+{% endhighlight %}
+</div>
+<div data-lang="scala" markdown="1">
+{% highlight xml %}
 <dependency>
   <groupId>org.apache.flink</groupId>
   <artifactId>flink-table-api-scala-bridge{{ site.scala_version_suffix 
}}</artifactId>
   <version>{{site.version}}</version>
   <scope>provided</scope>
 </dependency>
 {% endhighlight %}
+</div>
+<div data-lang="python">
+{% highlight bash %}
+{% if site.is_stable %}
+$ python -m pip install apache-flink {{ site.version }}
+{% else %}
+$ python -m pip install apache-flink
+{% endif %}
+{% endhighlight %}
+</div>
+</div>
 
-Additionally, if you want to run the Table API & SQL programs locally within 
your IDE, you must add one of the
-following set of modules, depending which planner you want to use:
+Additionally, if you want to run the Table API & SQL programs locally within 
your IDE, you must add the
+following set of modules, depending which planner you want to use.
 
+<div class="codetabs" markdown="1">
+<div data-lang="Blink Planner" markdown="1">
 {% highlight xml %}
-<!-- Either... (for the old planner that was available before Flink 1.9) -->
 <dependency>
   <groupId>org.apache.flink</groupId>
-  <artifactId>flink-table-planner{{ site.scala_version_suffix }}</artifactId>
+  <artifactId>flink-table-planner-blink{{ site.scala_version_suffix 
}}</artifactId>
   <version>{{site.version}}</version>
   <scope>provided</scope>
 </dependency>
-<!-- or.. (for the new Blink planner) -->
 <dependency>
   <groupId>org.apache.flink</groupId>
-  <artifactId>flink-table-planner-blink{{ site.scala_version_suffix 
}}</artifactId>
+  <artifactId>flink-streaming-scala{{ site.scala_version_suffix }}</artifactId>
   <version>{{site.version}}</version>
   <scope>provided</scope>
 </dependency>
 {% endhighlight %}
-
-Internally, parts of the table ecosystem are implemented in Scala. Therefore, 
please make sure to add the following dependency for both batch and streaming 
applications:
-
+</div>
+<div data-lang="Legacy Planner" markdown="1">
 {% highlight xml %}
+<dependency>
+  <groupId>org.apache.flink</groupId>
+  <artifactId>flink-table-planner{{ site.scala_version_suffix }}</artifactId>
+  <version>{{site.version}}</version>
+  <scope>provided</scope>
+</dependency>
 <dependency>
   <groupId>org.apache.flink</groupId>
   <artifactId>flink-streaming-scala{{ site.scala_version_suffix }}</artifactId>
   <version>{{site.version}}</version>
   <scope>provided</scope>
 </dependency>
 {% endhighlight %}
+</div>
+</div>
 
 ### Extension Dependencies
 
-If you want to implement a [custom format]({{ site.baseurl 
}}/dev/table/sourceSinks.html#define-a-tablefactory) for interacting with Kafka 
or a set of [user-defined functions]({{ site.baseurl 
}}/dev/table/functions/systemFunctions.html), the following dependency is 
sufficient and can be used for JAR files for the SQL Client:
+If you want to implement a [custom format]({% link dev/table/sourceSinks.md 
%}#define-a-tablefactory) 
+for (de)serializing rows or a set of [user-defined functions]({% link 
dev/table/functions/systemFunctions.md %}),

Review comment:
       ```suggestion
   for (de)serializing rows or a set of [user-defined functions]({% link 
dev/table/functions/udfs.md %}),
   ```

##########
File path: docs/dev/table/index.md
##########
@@ -25,93 +25,106 @@ specific language governing permissions and limitations
 under the License.
 -->
 
-Apache Flink features two relational APIs - the Table API and SQL - for 
unified stream and batch processing. The Table API is a language-integrated 
query API for Scala and Java that allows the composition of queries from 
relational operators such as selection, filter, and join in a very intuitive 
way. Flink's SQL support is based on [Apache 
Calcite](https://calcite.apache.org) which implements the SQL standard. Queries 
specified in either interface have the same semantics and specify the same 
result regardless whether the input is a batch input (DataSet) or a stream 
input (DataStream).
-
-The Table API and the SQL interfaces are tightly integrated with each other as 
well as Flink's DataStream and DataSet APIs. You can easily switch between all 
APIs and libraries which build upon the APIs. For instance, you can extract 
patterns from a DataStream using the [CEP library]({{ site.baseurl 
}}/dev/libs/cep.html) and later use the Table API to analyze the patterns, or 
you might scan, filter, and aggregate a batch table using a SQL query before 
running a [Gelly graph algorithm]({{ site.baseurl }}/dev/libs/gelly) on the 
preprocessed data.
-
-**Please note that the Table API and SQL are not yet feature complete and are 
being actively developed. Not all operations are supported by every combination 
of \[Table API, SQL\] and \[stream, batch\] input.**
-
-Dependency Structure
---------------------
-
-Starting from Flink 1.9, Flink provides two different planner implementations 
for evaluating Table & SQL API programs: the Blink planner and the old planner 
that was available before Flink 1.9. Planners are responsible for
-translating relational operators into an executable, optimized Flink job. Both 
of the planners come with different optimization rules and runtime classes.
-They may also differ in the set of supported features.
-
-<span class="label label-danger">Attention</span> For production use cases, we 
recommend the blink planner that has become the default planner since 1.11.
-
-All Table API and SQL components are bundled in the `flink-table` or 
`flink-table-blink` Maven artifacts.
-
-The following dependencies are relevant for most projects:
-
-* `flink-table-common`: A common module for extending the table ecosystem by 
custom functions, formats, etc.
-* `flink-table-api-java`: The Table & SQL API for pure table programs using 
the Java programming language (in early development stage, not recommended!).
-* `flink-table-api-scala`: The Table & SQL API for pure table programs using 
the Scala programming language (in early development stage, not recommended!).
-* `flink-table-api-java-bridge`: The Table & SQL API with DataStream/DataSet 
API support using the Java programming language.
-* `flink-table-api-scala-bridge`: The Table & SQL API with DataStream/DataSet 
API support using the Scala programming language.
-* `flink-table-planner`: The table program planner and runtime. This was the 
only planner of Flink before the 1.9 release. It's no longer recommended since 
Flink 1.11.
-* `flink-table-planner-blink`: The new Blink planner, which has become the 
default one since Flink 1.11.
-* `flink-table-runtime-blink`: The new Blink runtime.
-* `flink-table-uber`: Packages the API modules above plus the old planner into 
a distribution for most Table & SQL API use cases. The uber JAR file 
`flink-table-*.jar` is located in the `/lib` directory of a Flink release by 
default.
-* `flink-table-uber-blink`: Packages the API modules above plus the Blink 
specific modules into a distribution for most Table & SQL API use cases. The 
uber JAR file `flink-table-blink-*.jar` is located in the `/lib` directory of a 
Flink release by default.
-
-See the [common API](common.html) page for more information about how to 
switch between the old and new Blink planner in table programs.
+Apache Flink features two relational APIs - the Table API and SQL - for 
unified stream and batch
+processing. The Table API is a language-integrated query API for Java, Scala, 
and Python that
+allows the composition of queries from relational operators such as selection, 
filter, and join in
+a very intuitive way. Flink's SQL support is based on [Apache 
Calcite](https://calcite.apache.org)
+which implements the SQL standard. Queries specified in either interface have 
the same semantics
+and specify the same result regardless of whether the input is continuous 
(streaming) or bounded (batch).
+
+The Table API and SQL interfaces integrate seamlessly with each other and 
Flink's DataStream API. 
+You can easily switch between all APIs and libraries which build upon them.
+For instance, you can extract patterns from a Table using [Match Recognize]({% 
link dev/table/streaming/match_recognize.md %})
+and later use the DataStream API to build alerting based on the matched 
patterns.
+
+Table Planners
+--------------
+
+Table planners are responsible for translating relational operators into an 
executable, optimized Flink job.
+Flink supports two different planner implementations; the modern Blink planner 
and the legacy planner.
+For production use cases, we recommend the blink planner which has been the 
default planner since 1.11.
+See the [common API]({% link dev/table/common.md %}) page for more information 
on how to switch between the two planners.
 
 ### Table Program Dependencies
 
-Depending on the target programming language, you need to add the Java or 
Scala API to a project in order to use the Table API & SQL for defining 
pipelines:
+Depending on the target programming language, you need to add the Java or 
Scala API to a project
+in order to use the Table API & SQL for defining pipelines.
 
+<div class="codetabs" markdown="1">
+<div data-lang="java" markdown="1">
 {% highlight xml %}
-<!-- Either... -->
 <dependency>
   <groupId>org.apache.flink</groupId>
   <artifactId>flink-table-api-java-bridge{{ site.scala_version_suffix 
}}</artifactId>
   <version>{{site.version}}</version>
   <scope>provided</scope>
 </dependency>
-<!-- or... -->
+{% endhighlight %}
+</div>
+<div data-lang="scala" markdown="1">
+{% highlight xml %}
 <dependency>
   <groupId>org.apache.flink</groupId>
   <artifactId>flink-table-api-scala-bridge{{ site.scala_version_suffix 
}}</artifactId>
   <version>{{site.version}}</version>
   <scope>provided</scope>
 </dependency>
 {% endhighlight %}
+</div>
+<div data-lang="python">
+{% highlight bash %}
+{% if site.is_stable %}
+$ python -m pip install apache-flink {{ site.version }}
+{% else %}
+$ python -m pip install apache-flink
+{% endif %}
+{% endhighlight %}
+</div>
+</div>
 
-Additionally, if you want to run the Table API & SQL programs locally within 
your IDE, you must add one of the
-following set of modules, depending which planner you want to use:
+Additionally, if you want to run the Table API & SQL programs locally within 
your IDE, you must add the
+following set of modules, depending which planner you want to use.
 
+<div class="codetabs" markdown="1">
+<div data-lang="Blink Planner" markdown="1">
 {% highlight xml %}
-<!-- Either... (for the old planner that was available before Flink 1.9) -->
 <dependency>
   <groupId>org.apache.flink</groupId>
-  <artifactId>flink-table-planner{{ site.scala_version_suffix }}</artifactId>
+  <artifactId>flink-table-planner-blink{{ site.scala_version_suffix 
}}</artifactId>
   <version>{{site.version}}</version>
   <scope>provided</scope>
 </dependency>
-<!-- or.. (for the new Blink planner) -->
 <dependency>
   <groupId>org.apache.flink</groupId>
-  <artifactId>flink-table-planner-blink{{ site.scala_version_suffix 
}}</artifactId>
+  <artifactId>flink-streaming-scala{{ site.scala_version_suffix }}</artifactId>
   <version>{{site.version}}</version>
   <scope>provided</scope>
 </dependency>
 {% endhighlight %}
-
-Internally, parts of the table ecosystem are implemented in Scala. Therefore, 
please make sure to add the following dependency for both batch and streaming 
applications:
-
+</div>
+<div data-lang="Legacy Planner" markdown="1">
 {% highlight xml %}
+<dependency>
+  <groupId>org.apache.flink</groupId>
+  <artifactId>flink-table-planner{{ site.scala_version_suffix }}</artifactId>
+  <version>{{site.version}}</version>
+  <scope>provided</scope>
+</dependency>
 <dependency>
   <groupId>org.apache.flink</groupId>
   <artifactId>flink-streaming-scala{{ site.scala_version_suffix }}</artifactId>
   <version>{{site.version}}</version>
   <scope>provided</scope>
 </dependency>
 {% endhighlight %}
+</div>
+</div>
 
 ### Extension Dependencies
 
-If you want to implement a [custom format]({{ site.baseurl 
}}/dev/table/sourceSinks.html#define-a-tablefactory) for interacting with Kafka 
or a set of [user-defined functions]({{ site.baseurl 
}}/dev/table/functions/systemFunctions.html), the following dependency is 
sufficient and can be used for JAR files for the SQL Client:
+If you want to implement a [custom format]({% link dev/table/sourceSinks.md 
%}#define-a-tablefactory) 

Review comment:
       ```suggestion
   If you want to implement a [custom format or connector]({% link 
dev/table/sourceSinks.md %}) 
   ```




----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

[GitHub] [flink] twalthr commented on a change in pull request #14053: [FLINK-18279][docs] Simplify Table Overview

Reply via email to