yangzhg commented on a change in pull request #6925:
URL: https://github.com/apache/incubator-doris/pull/6925#discussion_r736105562



##########
File path: be/src/exec/tablet_sink.h
##########
@@ -191,15 +192,20 @@ class NodeChannel {
     }
 
     int64_t node_id() const { return _node_id; }
-    const NodeInfo* node_info() const { return _node_info; }
-    std::string print_load_info() const { return _load_info; }
     std::string name() const { return _name; }
 
     Status none_of(std::initializer_list<bool> vars);
 
     // TODO(HW): remove after mem tracker shared
     void clear_all_batches();
 
+    std::string print_channel_info() const {

Review comment:
       print_xxx means should output this string, for return the string, use 
`channel_info` is better

##########
File path: tools/ssb-tools/doris-cluster.conf
##########
@@ -0,0 +1,27 @@
+# Licensed to the Apache Software Foundation (ASF) under one
+# or more contributor license agreements.  See the NOTICE file
+# distributed with this work for additional information
+# regarding copyright ownership.  The ASF licenses this file
+# to you under the Apache License, Version 2.0 (the
+# "License"); you may not use this file except in compliance
+# with the License.  You may obtain a copy of the License at
+#
+#   http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing,
+# software distributed under the License is distributed on an
+# "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
+# KIND, either express or implied.  See the License for the
+# specific language governing permissions and limitations
+# under the License.
+
+# Any of FE host
+export FE_HOST='xafj-palo-rpm65.xafj.baidu.com'

Review comment:
       should mask this host name

##########
File path: docs/en/best-practices/star-schema-benchmark.md
##########
@@ -0,0 +1,163 @@
+---
+{
+    "title": "Star-Schema-Benchmark",
+    "language": "en"
+}
+---
+
+<!--
+Licensed to the Apache Software Foundation (ASF) under one
+or more contributor license agreements.  See the NOTICE file
+distributed with this work for additional information
+regarding copyright ownership.  The ASF licenses this file
+to you under the Apache License, Version 2.0 (the
+"License"); you may not use this file except in compliance
+with the License.  You may obtain a copy of the License at
+
+  http://www.apache.org/licenses/LICENSE-2.0
+
+Unless required by applicable law or agreed to in writing,
+software distributed under the License is distributed on an
+"AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
+KIND, either express or implied.  See the License for the
+specific language governing permissions and limitations
+under the License.
+-->
+
+# Star Schema Benchmark
+
+[Star Schema Benchmark(SSB)](https://www.cs.umb.edu/~poneil/StarSchemaB.PDF) 
is a lightweight data warehouse scenario performance test set. Based on 
[TPC-H](http://www.tpc.org/tpch/), SSB provides a simplified version of the 
star model data set, which is mainly used to test the performance of 
multi-table association queries under the star model.
+
+This document mainly introduces how to pass the preliminary performance test 
of the SSB process in Doris.
+
+> Note 1: The standard test set including SSB is usually far from the actual 
business scenario, and some tests will perform parameter tuning for the test 
set. Therefore, the test results of the standard test set can only reflect the 
performance of the database in a specific scenario. It is recommended that 
users use actual business data for further testing.
+>
+> Note 2: The operations involved in this document are all performed in the 
CentOS 7 environment.
+
+## Environmental preparation
+
+Please refer to the [official 
document](http://doris.incubator.apache.org/master/en/installing/install-deploy.html)
 to install and deploy Doris to obtain a normal running Doris cluster ( Contain 
at least 1 FE, 1 BE).
+
+The scripts involved in the following documents are all stored under 
`tools/ssb-tools/` in the Doris code base.
+
+## data preparation
+
+### 1. Download and install the SSB data generation tool.
+
+Execute the following script to download and compile the 
[ssb-dbgen](https://github.com/electrum/ssb-dbgen.git) tool.
+
+```
+sh build-ssb-dbgen.sh
+```
+
+After the installation is successful, the `dbgen` binary file will be 
generated in the `ssb-dbgen/` directory.
+
+### 2. Generate SSB test set
+
+Execute the following script to generate the SSB data set:
+
+```
+sh gen-ssb-data.sh -s 100 -c 100
+```
+
+> Note 1: `sh gen-ssb-data.sh -h View help`
+>
+> Note 2: The data will be generated under the directory `ssb-data/` with a 
suffix of `.tbl`. The total file size is about 60GB. The generation time may 
vary from a few minutes to an hour.
+>
+> Note 3: `-s 100` means that the test set size factor is 100, `-c 100` means 
that 100 threads concurrently generate data in the lineorder table. The `-c` 
parameter also determines the number of files in the final lineorder table. The 
larger the parameter, the more files and the smaller each file.
+
+Under the `-s 100` parameter, the generated data set size is:
+
+|Table |Rows |Size | File Number |
+|---|---|---|---|
+|lineorder| 600 million (600037902) | 60GB | 100|
+|customer|30 million (3000000) |277M |1|
+|part|1.4 million (1400000) | 116M|1|
+|supplier|200,000 (200,000) |17M |1|
+|date| 2556|228K |1|
+
+3. Build a table
+
+    Copy the table creation statement in 
[create-tables.sql](https://github.com/apache/incubator-doris/tree/master/tools/ssb-tools/create-tables.sql)
 and execute it in Doris.
+
+4. Import data
+
+    1. Import 4 dimension table data (customer, part, supplier and date)
+
+        Because the data volume of these 4 dimension tables is small, and the 
import is simpler, we use the following command to import the data of these 4 
tables first:
+
+        `sh load-dimension-data.sh`
+
+    2. Import the fact table lineorder.
+
+        Import the lineorder table data with the following command:
+
+        `sh load-fact-data.sh -c 3`
+
+        `-c 3` means to start 5 concurrent threads to import (the default is 
3). In the case of a single BE node, the import time of lineorder data 
generated by `sh gen-ssb-data.sh -s 100 -c 100` using `sh load-fact-data.sh -c 
3` is about 10 minutes. The memory overhead is about 5-6GB. If you turn on more 
threads, you can speed up the import speed, but it will increase additional 
memory overhead.
+
+5. Check the imported data
+
+    ```
+    select count(*) from part;
+    select count(*) from customer;
+    select count(*) from supplier;
+    select count(*) from date;
+    select count(*) from lineorder;
+    ```
+
+    The amount of data should be the same as the number of rows of generated 
data.
+
+## Query test
+
+There are 4 groups of 14 SQL in the SSB test set. The query statement is in 
the 
[queries/](https://github.com/apache/incubator-doris/tree/master/tools/ssb-tools/queries)
 directory.
+
+## testing report
+
+The following test report is based on Doris 
[branch-0.15](https://github.com/apache/incubator-doris/tree/branch-0.15) 
branch code test, for reference only. (Update time: October 25, 2021)
+
+1. Hardware environment
+
+    * 1 FE + 1-3 BE mixed
+    * CPU: 96core, Intel(R) Xeon(R) Gold 6271C CPU @ 2.60GHz
+    * Memory: 384GB
+    * Hard disk: 1 mechanical hard disk

Review comment:
       ```suggestion
       * Hard disk: 1 HDD
   ```




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org

Reply via email to