This is an automated email from the ASF dual-hosted git repository. morningman pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/incubator-doris.git
The following commit(s) were added to refs/heads/master by this push: new 40f993c [docs][seatunnel] add seatunnel flink doris sink doc (#7844) 40f993c is described below commit 40f993ca15015450fcd7bd596153314c3a31be57 Author: wudi <676366...@qq.com> AuthorDate: Mon Jan 24 21:13:06 2022 +0800 [docs][seatunnel] add seatunnel flink doris sink doc (#7844) --- docs/.vuepress/sidebar/en.js | 9 +- docs/.vuepress/sidebar/zh-CN.js | 9 +- docs/en/extending-doris/seatunnel/flink-sink.md | 116 +++++++++++++++++++++ .../{seatunnel.md => seatunnel/spark-sink.md} | 4 +- docs/zh-CN/extending-doris/seatunnel/flink-sink.md | 116 +++++++++++++++++++++ .../{seatunnel.md => seatunnel/spark-sink.md} | 4 +- 6 files changed, 252 insertions(+), 6 deletions(-) diff --git a/docs/.vuepress/sidebar/en.js b/docs/.vuepress/sidebar/en.js index 868958b..9081d69 100644 --- a/docs/.vuepress/sidebar/en.js +++ b/docs/.vuepress/sidebar/en.js @@ -240,7 +240,14 @@ module.exports = [ "spark-doris-connector", "flink-doris-connector", "datax", - "seatunnel", + { + title: "Seatunnel", + directoryPath: "seatunnel/", + children: [ + "spark-sink", + "flink-sink", + ], + }, { title: "UDF", directoryPath: "udf/", diff --git a/docs/.vuepress/sidebar/zh-CN.js b/docs/.vuepress/sidebar/zh-CN.js index 1f173eb..e94119d 100644 --- a/docs/.vuepress/sidebar/zh-CN.js +++ b/docs/.vuepress/sidebar/zh-CN.js @@ -241,7 +241,14 @@ module.exports = [ "spark-doris-connector", "flink-doris-connector", "datax", - "seatunnel", + { + title: "Seatunnel", + directoryPath: "seatunnel/", + children: [ + "spark-sink", + "flink-sink", + ], + }, { title: "UDF", directoryPath: "udf/", diff --git a/docs/en/extending-doris/seatunnel/flink-sink.md b/docs/en/extending-doris/seatunnel/flink-sink.md new file mode 100644 index 0000000..4a04f3d --- /dev/null +++ b/docs/en/extending-doris/seatunnel/flink-sink.md @@ -0,0 +1,116 @@ +--- +{ + "title": "Seatunnel Connector Flink Doris", + "language": "en" +} +--- + +<!-- +Licensed to the Apache Software Foundation (ASF) under one +or more contributor license agreements. See the NOTICE file +distributed with this work for additional information +regarding copyright ownership. The ASF licenses this file +to you under the Apache License, Version 2.0 (the +"License"); you may not use this file except in compliance +with the License. You may obtain a copy of the License at + + http://www.apache.org/licenses/LICENSE-2.0 + +Unless required by applicable law or agreed to in writing, +software distributed under the License is distributed on an +"AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY +KIND, either express or implied. See the License for the +specific language governing permissions and limitations +under the License. +--> + +# Seatunnel +[Apache SeaTunnel (formerly waterdrop )](https://github.com/apache/incubator-seatunnel) already supports Doris's connector, SeaTunnel can use Spark engine and Flink engine to synchronize data to Doris. +## Flink Sink Doris(2.x) +Seatunnel Flink Sink Doris [plugin code](https://github.com/apache/incubator-seatunnel/tree/dev/seatunnel-connectors/seatunnel-connector-flink-doris) + +### Options +| name | type | required | default value | engine | +| --- | --- | --- | --- | --- | +| fenodes | string | yes | - | Flink | +| database | string | yes | - | Flink | +| table | string | yes | - | Flink | +| user | string | yes | - | Flink | +| password | string | yes | - | Flink | +| batch_size | int | no | 100 | Flink | +| interval | int | no |1000 | Flink | +| max_retries | int | no | 1 | Flink| +| doris.* | - | no | - | Flink | + +`fenodes [string]` + +Doris Fe http url, eg: 127.0.0.1:8030 + +`database [string]` + +Doris database + +`table [string]` + +Doris table + +`user [string]` + +Doris user + +`password [string]` + +Doris password + +`batch_size [int]` + +The maximum number of lines to write to Doris at a time, the default value is 100 + +`interval [int]` + +The flush interval (in milliseconds), after which the asynchronous thread writes the data in the cache to Doris. Set to 0 to turn off periodic writes. + +`max_retries [int]` + +Number of retries after writing to Doris fails + +`doris.* [string]` + +Import parameters for Stream load. For example: 'doris.column_separator' = ', ' etc. + +[More Stream Load parameter configuration](https://doris.apache.org/administrator-guide/load-data/stream-load-manual.html) + +### Examples +Socket To Doris +``` +env { + execution.parallelism = 1 +} +source { + SocketStream { + host = 127.0.0.1 + port = 9999 + result_table_name = "socket" + field_name = "info" + } +} +transform { +} +sink { + DorisSink { + fenodes = "127.0.0.1:8030" + user = root + password = 123456 + database = test + table = test_tbl + batch_size = 5 + max_retries = 1 + interval = 5000 + } +} + +``` +### Start command +``` +sh bin/start-seatunnel-flink.sh --config config/flink.streaming.conf +``` \ No newline at end of file diff --git a/docs/en/extending-doris/seatunnel.md b/docs/en/extending-doris/seatunnel/spark-sink.md similarity index 98% rename from docs/en/extending-doris/seatunnel.md rename to docs/en/extending-doris/seatunnel/spark-sink.md index 56a2041..000e04c 100644 --- a/docs/en/extending-doris/seatunnel.md +++ b/docs/en/extending-doris/seatunnel/spark-sink.md @@ -1,6 +1,6 @@ --- { - "title": "SeaTunnel", + "title": "Seatunnel Connector Spark Doris", "language": "en" } --- @@ -120,4 +120,4 @@ Doris { Start command ``` sh bin/start-waterdrop-spark.sh --master local[4] --deploy-mode client --config ./config/spark.conf -``` +``` \ No newline at end of file diff --git a/docs/zh-CN/extending-doris/seatunnel/flink-sink.md b/docs/zh-CN/extending-doris/seatunnel/flink-sink.md new file mode 100644 index 0000000..ed0b547 --- /dev/null +++ b/docs/zh-CN/extending-doris/seatunnel/flink-sink.md @@ -0,0 +1,116 @@ +--- +{ + "title": "Seatunnel Connector Flink Doris", + "language": "zh-CN" +} +--- + +<!-- +Licensed to the Apache Software Foundation (ASF) under one +or more contributor license agreements. See the NOTICE file +distributed with this work for additional information +regarding copyright ownership. The ASF licenses this file +to you under the Apache License, Version 2.0 (the +"License"); you may not use this file except in compliance +with the License. You may obtain a copy of the License at + + http://www.apache.org/licenses/LICENSE-2.0 + +Unless required by applicable law or agreed to in writing, +software distributed under the License is distributed on an +"AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY +KIND, either express or implied. See the License for the +specific language governing permissions and limitations +under the License. +--> + +# Seatunnel +[Apache SeaTunnel (原 waterdrop )](https://github.com/apache/incubator-seatunnel) 已经支持 Doris 的连接器, SeaTunnel 可以用过 Spark 引擎和 Flink 引擎同步数据至 Doris 中. + +## Flink Sink Doris(2.x) +Seatunnel Flink Sink Doris [插件代码](https://github.com/apache/incubator-seatunnel/tree/dev/seatunnel-connectors/seatunnel-connector-flink-doris) +### 参数列表 +| 配置项 | 类型 | 必填 | 默认值 | 支持引擎 | +| --- | --- | --- | --- | --- | +| fenodes | string | yes | - | Flink | +| database | string | yes | - | Flink | +| table | string | yes | - | Flink | +| user | string | yes | - | Flink | +| password | string | yes | - | Flink | +| batch_size | int | no | 100 | Flink | +| interval | int | no |1000 | Flink | +| max_retries | int | no | 1 | Flink| +| doris.* | - | no | - | Flink | + +`fenodes [string]` + +Doris Fe Http访问地址, eg: 127.0.01:8030 + +`database [string]` + +写入 Doris 的库名 + +`table [string]` + +写入 Doris 的表名 + +`user [string]` + +Doris 访问用户 + +`password [string]` + +Doris 访问用户密码 + +`batch_size [int]` + +单次写Doris的最大行数,默认值100 + +`interval [int]` + +flush 间隔时间(毫秒),超过该时间后异步线程将 缓存中数据写入Doris。设置为0表示关闭定期写入。 + +`max_retries [int]` + +写Doris失败之后的重试次数 + +`doris.* [string]` + +Stream load 的导入参数。例如:'doris.column_separator' = ', '等 + +[更多 Stream Load 参数配置](https://doris.apache.org/zh-CN/administrator-guide/load-data/stream-load-manual.html) + +### Examples +Socket 数据写入 Doris +``` +env { + execution.parallelism = 1 +} +source { + SocketStream { + host = 127.0.0.1 + port = 9999 + result_table_name = "socket" + field_name = "info" + } +} +transform { +} +sink { + DorisSink { + fenodes = "127.0.0.1:8030" + user = root + password = 123456 + database = test + table = test_tbl + batch_size = 5 + max_retries = 1 + interval = 5000 + } +} + +``` +### 启动命令 +``` +sh bin/start-seatunnel-flink.sh --config config/flink.streaming.conf +``` \ No newline at end of file diff --git a/docs/zh-CN/extending-doris/seatunnel.md b/docs/zh-CN/extending-doris/seatunnel/spark-sink.md similarity index 98% rename from docs/zh-CN/extending-doris/seatunnel.md rename to docs/zh-CN/extending-doris/seatunnel/spark-sink.md index d861c62..fc868d3 100644 --- a/docs/zh-CN/extending-doris/seatunnel.md +++ b/docs/zh-CN/extending-doris/seatunnel/spark-sink.md @@ -1,6 +1,6 @@ --- { - "title": "SeaTunnel", + "title": "Seatunnel Connector Spark Doris", "language": "zh-CN" } --- @@ -121,4 +121,4 @@ Doris { 启动命令 ``` sh bin/start-waterdrop-spark.sh --master local[4] --deploy-mode client --config ./config/spark.conf -``` +``` \ No newline at end of file --------------------------------------------------------------------- To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org For additional commands, e-mail: commits-h...@doris.apache.org