wangbo commented on a change in pull request #3938:
URL: https://github.com/apache/incubator-doris/pull/3938#discussion_r446591322



##########
File path: docs/zh-CN/administrator-guide/load-data/spark-load-manual.md
##########
@@ -88,16 +89,27 @@ Spark load 任务的执行主要分为以下5个阶段。
 
 
 
-### 全局字典
+## 全局字典
+### 适用场景
+目前Doris中Bitmap列是使用类库```Roaringbitmap```实现的,而```Roaringbitmap```的输入数据类型只能是整型,因此如果要在导入流程中实现对于Bitmap列的预计算,那么就需要将输入数据的类型转换成整型。
 
-待补
-
-
-
-### 数据预处理(DPP)
-
-待补
+在Doris现有的导入流程中,全局字典的数据结构是基于Hive表实现的,保存了原始值到编码值的映射。
+### 构建流程
+1. 读取上游数据源的数据,生成一张hive临时表,记为hive_table。

Review comment:
       👌




----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to