[GitHub] [incubator-doris] yiguolei commented on issue #8374: [Feature] support TiKV as an option for metadata storage

2022-03-07 Thread GitBox


yiguolei commented on issue #8374:
URL: 
https://github.com/apache/incubator-doris/issues/8374#issuecomment-1060291918


   Currently, doris store rowset meta and tablet meta in local rocksdb. Could 
you explain more detail why need to store it to a external global KV store? 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] wangbo commented on issue #8374: [Feature] support TiKV as an option for metadata storage

2022-03-07 Thread GitBox


wangbo commented on issue #8374:
URL: 
https://github.com/apache/incubator-doris/issues/8374#issuecomment-1060298717


   Doris's FE and BE both has metadata, so which part do you want to replace?
   Can you offer more detailed design?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] wangbo commented on pull request #8292: (improving) refactor column::insert_many_dict_data

2022-03-07 Thread GitBox


wangbo commented on pull request #8292:
URL: https://github.com/apache/incubator-doris/pull/8292#issuecomment-1060312019


   We did a performance test for this pr in SSB, no significant improvement. 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] dataroaring opened a new pull request #8375: add an aggregation test from trino.

2022-03-07 Thread GitBox


dataroaring opened a new pull request #8375:
URL: https://github.com/apache/incubator-doris/pull/8375


   # Proposed changes
   
   Issue Number: close #xxx
   
   ## Problem Summary:
   
   Describe the overview of changes.
   
   ## Checklist(Required)
   
   1. Does it affect the original behavior: (Yes/No/I Don't know)
   2. Has unit tests been added: (Yes/No/No Need)
   3. Has document been added or modified: (Yes/No/No Need)
   4. Does it need to update dependencies: (Yes/No)
   5. Are there any changes that cannot be rolled back: (Yes/No)
   
   ## Further comments
   
   If this is a relatively large or complex change, kick off the discussion at 
[d...@doris.apache.org](mailto:d...@doris.apache.org) by explaining why you 
chose the solution you did and what alternatives you considered, etc...
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] yangzhg commented on pull request #8358: [Doc][Vectorized] Add vectorized execution engine docs

2022-03-07 Thread GitBox


yangzhg commented on pull request #8358:
URL: https://github.com/apache/incubator-doris/pull/8358#issuecomment-1060327441


   It is better to redraw those two images, the source of those two images is 
unknown


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] github-actions[bot] commented on pull request #8363: [fix](broker-load) fix bug that a cancelled job's state is LOADING

2022-03-07 Thread GitBox


github-actions[bot] commented on pull request #8363:
URL: https://github.com/apache/incubator-doris/pull/8363#issuecomment-1060329065






-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] BiteTheDDDDt opened a new issue #8376: [Bug] fe start fail

2022-03-07 Thread GitBox


BiteThet opened a new issue #8376:
URL: https://github.com/apache/incubator-doris/issues/8376


   ### Search before asking
   
   - [X] I had searched in the 
[issues](https://github.com/apache/incubator-doris/issues?q=is%3Aissue) and 
found no similar issues.
   
   
   ### Version
   
   master
   
   ### What's Wrong?
   
   java.io.IOException: failed to read session variable: Can not set int field 
org.apache.doris.qe.SessionVariable.queryTimeoutS to java.lang.Long
at 
org.apache.doris.qe.SessionVariable.readFromJson(SessionVariable.java:1023)
at 
org.apache.doris.qe.SessionVariable.readFields(SessionVariable.java:981)
at org.apache.doris.qe.VariableMgr.read(VariableMgr.java:312)
at 
org.apache.doris.catalog.Catalog.loadGlobalVariable(Catalog.java:1902)
at org.apache.doris.common.MetaReader.read(MetaReader.java:89)
at org.apache.doris.catalog.Catalog.loadImage(Catalog.java:1618)
at org.apache.doris.catalog.Catalog.initialize(Catalog.java:848)
at org.apache.doris.PaloFe.start(PaloFe.java:125)
at org.apache.doris.PaloFe.main(PaloFe.java:63)
   
   ### What You Expected?
   
   fix it
   
   ### How to Reproduce?
   
   _No response_
   
   ### Anything Else?
   
   _No response_
   
   ### Are you willing to submit PR?
   
   - [ ] Yes I am willing to submit a PR!
   
   ### Code of Conduct
   
   - [X] I agree to follow this project's [Code of 
Conduct](https://www.apache.org/foundation/policies/conduct)
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] BiteTheDDDDt opened a new pull request #8377: [Bug] fe start fail

2022-03-07 Thread GitBox


BiteThet opened a new pull request #8377:
URL: https://github.com/apache/incubator-doris/pull/8377


   # Proposed changes
   
   Issue Number: close #8376
   
   ## Problem Summary:
   
   Describe the overview of changes.
   
   ## Checklist(Required)
   
   1. Does it affect the original behavior: (Yes/No/I Don't know)
   2. Has unit tests been added: (Yes/No/No Need)
   3. Has document been added or modified: (Yes/No/No Need)
   4. Does it need to update dependencies: (Yes/No)
   5. Are there any changes that cannot be rolled back: (Yes/No)
   
   ## Further comments
   
   If this is a relatively large or complex change, kick off the discussion at 
[d...@doris.apache.org](mailto:d...@doris.apache.org) by explaining why you 
chose the solution you did and what alternatives you considered, etc...
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] xinyiZzz closed pull request #7911: [Help] gcc11 compiles `thread_local` variable, BE start: version `GLIBC_2.18' not found

2022-03-07 Thread GitBox


xinyiZzz closed pull request #7911:
URL: https://github.com/apache/incubator-doris/pull/7911


   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] morrySnow opened a new issue #8378: [Enhancement] [doc] there are some dead link in fe doc pages

2022-03-07 Thread GitBox


morrySnow opened a new issue #8378:
URL: https://github.com/apache/incubator-doris/issues/8378


   ### Search before asking
   
   - [X] I had searched in the 
[issues](https://github.com/apache/incubator-doris/issues?q=is%3Aissue) and 
found no similar issues.
   
   
   ### Description
   
   when i read doris developer docs, i found some dead link:
   1. link for [problems with mounting] in 
https://doris.incubator.apache.org/developer-guide/docker-dev.html
   2. link for [挂载的问题] in 
https://doris.incubator.apache.org/zh-CN/developer-guide/docker-dev.html
   
   i don't know which is the correct link. so just raise an issue here.
   if anybody know the correct link. i'd like to commit a pr to fix it.
   
   ### Solution
   
   _No response_
   
   ### Are you willing to submit PR?
   
   - [X] Yes I am willing to submit a PR!
   
   ### Code of Conduct
   
   - [X] I agree to follow this project's [Code of 
Conduct](https://www.apache.org/foundation/policies/conduct)
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] xinyiZzz commented on a change in pull request #8322: [refactor] Impl of MemTracker, and related use

2022-03-07 Thread GitBox


xinyiZzz commented on a change in pull request #8322:
URL: https://github.com/apache/incubator-doris/pull/8322#discussion_r820504386



##
File path: be/src/exec/es/es_scroll_parser.cpp
##
@@ -410,11 +411,12 @@ Status ScrollParser::fill_tuple(const TupleDescriptor* 
tuple_desc, Tuple* tuple,
 }
 }
 size_t val_size = val.length();
-char* buffer = 
reinterpret_cast(tuple_pool->try_allocate_unaligned(val_size));
+Status rst;
+char* buffer = 
reinterpret_cast(tuple_pool->try_allocate_unaligned(val_size, &rst));

Review comment:
   There are so many places to call MemPool allocate, "return Status and 
use buffer as parameter" will cause a lot of changes, so your suggestion is 
whether to change it.
   
   The purpose of getting the status here is to print out the real reason of 
allocate failure externally, such as the details of tracker exceed, not just 
the location of the failure.




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] morrySnow opened a new pull request #8379: Fix dead link in build docker environment in developer guide

2022-03-07 Thread GitBox


morrySnow opened a new pull request #8379:
URL: https://github.com/apache/incubator-doris/pull/8379


   # Proposed changes
   
   Issue Number: close #8378
   
   ## Problem Summary:
   
   Describe the overview of changes.
   
   ## Checklist(Required)
   
   1. Does it affect the original behavior: (No)
   2. Has unit tests been added: (No Need)
   3. Has document been added or modified: (Yes)
   4. Does it need to update dependencies: (No)
   5. Are there any changes that cannot be rolled back: (No)
   
   ## Further comments
   
   If this is a relatively large or complex change, kick off the discussion at 
[d...@doris.apache.org](mailto:d...@doris.apache.org) by explaining why you 
chose the solution you did and what alternatives you considered, etc...
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] github-actions[bot] commented on pull request #8379: Fix dead link in build docker environment in developer guide

2022-03-07 Thread GitBox


github-actions[bot] commented on pull request #8379:
URL: https://github.com/apache/incubator-doris/pull/8379#issuecomment-1060399237


   PR approved by anyone and no changes requested.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] HappenLee commented on pull request #8358: [Doc][Vectorized] Add vectorized execution engine docs

2022-03-07 Thread GitBox


HappenLee commented on pull request #8358:
URL: https://github.com/apache/incubator-doris/pull/8358#issuecomment-1060407124


   > It is better to redraw those two images, the source of those two images is 
unknown
   
   had set quoted from [Column-Oriented
   Database 
Systems](https://web.stanford.edu/class/cs346/2015/notes/old/column.pdf


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] github-actions[bot] commented on pull request #8379: Fix dead link in build docker environment in developer guide

2022-03-07 Thread GitBox


github-actions[bot] commented on pull request #8379:
URL: https://github.com/apache/incubator-doris/pull/8379#issuecomment-1060413099


   PR approved by at least one committer and no changes requested.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] HappenLee commented on a change in pull request #8373: [Refactor] to_bitmap(-1) return NULL instead of return parse failed error_message

2022-03-07 Thread GitBox


HappenLee commented on a change in pull request #8373:
URL: https://github.com/apache/incubator-doris/pull/8373#discussion_r820543213



##
File path: be/src/exprs/bitmap_function.cpp
##
@@ -358,21 +358,17 @@ BigIntVal BitmapFunctions::bitmap_min(FunctionContext* 
ctx, const StringVal& src
 
 StringVal BitmapFunctions::to_bitmap(doris_udf::FunctionContext* ctx,
  const doris_udf::StringVal& src) {
+if(src.is_null) {

Review comment:
   `if (`

##
File path: be/src/vec/functions/function_bitmap.cpp
##
@@ -125,23 +151,48 @@ struct BitmapHash {
 }
 };
 
-struct NameBitmapCount {
+class FunctionBitmapCount : public IFunction {

Review comment:
   why bitmap count is always nullable?

##
File path: be/src/vec/functions/function_bitmap.cpp
##
@@ -125,23 +151,48 @@ struct BitmapHash {
 }
 };
 
-struct NameBitmapCount {
+class FunctionBitmapCount : public IFunction {
+public:
 static constexpr auto name = "bitmap_count";
-};
 
-struct BitmapCount {
-using ReturnType = DataTypeInt64;
-static constexpr auto TYPE_INDEX = TypeIndex::BitMap;
-using Type = DataTypeBitMap::FieldType;
-using ReturnColumnType = ColumnVector;
-using ReturnColumnContainer = ColumnVector::Container;
+String get_name() const override { return name; }
 
-static Status vector(const std::vector& data, 
ReturnColumnContainer& res) {
-size_t size = data.size();
-res.reserve(size);
-for (size_t i = 0; i < size; ++i) {
-res.push_back(data[i].cardinality());
+static FunctionPtr create() { return 
std::make_shared(); }
+
+DataTypePtr get_return_type_impl(const DataTypes& arguments) const 
override {
+return std::make_shared();
+}
+
+size_t get_number_of_arguments() const override { return 1; }
+
+bool use_default_implementation_for_nulls() const override { return false; 
}
+
+bool use_default_implementation_for_constants() const override { return 
true; }
+
+Status execute_impl(FunctionContext* context, Block& block, const 
ColumnNumbers& arguments,
+size_t result, size_t input_rows_count) override {
+auto res_data_column = ColumnInt64::create();
+auto& res = res_data_column->get_data();
+auto data_null_map = ColumnUInt8::create(input_rows_count, 0);
+auto& null_map = data_null_map->get_data();
+
+auto column = 
block.get_by_position(arguments[0]).column->convert_to_full_column_if_const();
+if (auto* nullable = check_and_get_column(*column)) {
+VectorizedUtils::update_null_map(null_map, 
nullable->get_null_map_data());
+column = nullable->get_nested_column_ptr();
+}
+auto str_col = assert_cast(column.get());
+const auto& col_data = str_col->get_data();
+
+res.reserve(input_rows_count);
+for (size_t i = 0; i < input_rows_count; ++i) {
+if(null_map[i]) {

Review comment:
   if (




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] morningman commented on pull request #8377: [Bug] fe start fail

2022-03-07 Thread GitBox


morningman commented on pull request #8377:
URL: https://github.com/apache/incubator-doris/pull/8377#issuecomment-1060418509


   Hi @BiteThet , this bug has been fixed in #8355


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] BiteTheDDDDt commented on pull request #8377: [Bug] fe start fail

2022-03-07 Thread GitBox


BiteThet commented on pull request #8377:
URL: https://github.com/apache/incubator-doris/pull/8377#issuecomment-1060506691


   > Hi @BiteThet , this bug has been fixed in #8355
   
   ok


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] BiteTheDDDDt closed pull request #8377: [Bug] fe start fail

2022-03-07 Thread GitBox


BiteThet closed pull request #8377:
URL: https://github.com/apache/incubator-doris/pull/8377


   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] BiteTheDDDDt closed issue #8376: [Bug] fe start fail

2022-03-07 Thread GitBox


BiteThet closed issue #8376:
URL: https://github.com/apache/incubator-doris/issues/8376


   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] BiteTheDDDDt opened a new issue #8380: [Bug] fix core dump on minmax_filter with decimal type

2022-03-07 Thread GitBox


BiteThet opened a new issue #8380:
URL: https://github.com/apache/incubator-doris/issues/8380


   ### Search before asking
   
   - [X] I had searched in the 
[issues](https://github.com/apache/incubator-doris/issues?q=is%3Aissue) and 
found no similar issues.
   
   
   ### Version
   
   master
   
   ### What's Wrong?
   
   ```cpp
   start time: 2022年 03月 07日 星期一 16:45:23 CST
   WARNING: Logging before InitGoogleLogging() is written to STDERR
   I0307 16:45:23.564831 13352 env.cpp:46] Env init successfully.
   Minidump created at: 
/home/disk2/pxl/dev/baidu/bdg/doris/core/output/be/minidump/fdbac494-2ca6-4a14-92c2958e-a6d9a153.dmp
   *** Aborted at 1646642730 (unix time) try "date -d @1646642730" if you are 
using GNU date ***
   PC: @ 0x564a1bca5d41 doris::MinMaxNumFunc<>::insert()
   *** SIGSEGV (@0x0) received by PID 13352 (TID 0x7f5495f6c700) from PID 0; 
stack trace: ***
   @ 0x564a1dad7042 google::(anonymous 
namespace)::FailureSignalHandler()
   @ 0x7f54f35ff920 (unknown)
   @ 0x564a1bca5d41 doris::MinMaxNumFunc<>::insert()
   @ 0x564a1bcc324d doris::HashJoinNode::open()
   @ 0x564a1bec3484 doris::PlanFragmentExecutor::open_internal()
   @ 0x564a1bec4745 doris::PlanFragmentExecutor::open()
   @ 0x564a1bea7265 doris::FragmentExecState::execute()
   @ 0x564a1beabada doris::FragmentMgr::_exec_actual()
   @ 0x564a1beb724c std::_Function_handler<>::_M_invoke()
   @ 0x564a1c141299 doris::ThreadPool::dispatch_thread()
   @ 0x564a1c13667f doris::Thread::supervise_thread()
   @ 0x7f54f33b7851 start_thread
   @ 0x7f54f36b467d clone
   @0x0 (unknown)
   ```
   
   ### What You Expected?
   
   fix it
   
   ### How to Reproduce?
   
   _No response_
   
   ### Anything Else?
   
   _No response_
   
   ### Are you willing to submit PR?
   
   - [X] Yes I am willing to submit a PR!
   
   ### Code of Conduct
   
   - [X] I agree to follow this project's [Code of 
Conduct](https://www.apache.org/foundation/policies/conduct)
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] BiteTheDDDDt opened a new pull request #8381: [Bug] fix core dump on minmax_filter with decimal type

2022-03-07 Thread GitBox


BiteThet opened a new pull request #8381:
URL: https://github.com/apache/incubator-doris/pull/8381


   # Proposed changes
   
   Issue Number: close #8380
   
   ## Problem Summary:
   
   Describe the overview of changes.
   
   ## Checklist(Required)
   
   1. Does it affect the original behavior: (Yes/No/I Don't know)
   2. Has unit tests been added: (Yes/No/No Need)
   3. Has document been added or modified: (Yes/No/No Need)
   4. Does it need to update dependencies: (Yes/No)
   5. Are there any changes that cannot be rolled back: (Yes/No)
   
   ## Further comments
   
   If this is a relatively large or complex change, kick off the discussion at 
[d...@doris.apache.org](mailto:d...@doris.apache.org) by explaining why you 
chose the solution you did and what alternatives you considered, etc...
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] BiteTheDDDDt commented on issue #8317: [Bug] core dump on BrokerScanNode with clang

2022-03-07 Thread GitBox


BiteThet commented on issue #8317:
URL: 
https://github.com/apache/incubator-doris/issues/8317#issuecomment-1060523949


   > Does it show the core dump reason, like segment fault?
   
   yes, the reason is segmentation fault.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] xinyiZzz commented on a change in pull request #8322: [refactor] Impl of MemTracker, and related use

2022-03-07 Thread GitBox


xinyiZzz commented on a change in pull request #8322:
URL: https://github.com/apache/incubator-doris/pull/8322#discussion_r820598899



##
File path: be/src/exprs/expr_context.cpp
##
@@ -371,10 +376,11 @@ Status ExprContext::get_const_value(RuntimeState* state, 
Expr& expr, AnyVal** co
 StringVal* sv = reinterpret_cast(*const_val);
 if (!sv->is_null && sv->len > 0) {
 // Make sure the memory is owned by this evaluator.
-char* ptr_copy = 
reinterpret_cast(_pool->try_allocate(sv->len));
+Status rst;
+char* ptr_copy = 
reinterpret_cast(_pool->try_allocate(sv->len, &rst));
 if (ptr_copy == nullptr) {
-return _pool->mem_tracker()->MemLimitExceeded(
-state, "Could not allocate constant string value", 
sv->len);
+RETURN_ALLOC_LIMIT_EXCEEDED(_pool->mem_tracker(), state,

Review comment:
   done




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] dataroaring opened a new issue #8382: [Bug] variance is different with trino

2022-03-07 Thread GitBox


dataroaring opened a new issue #8382:
URL: https://github.com/apache/incubator-doris/issues/8382


   ### Search before asking
   
   - [X] I had searched in the 
[issues](https://github.com/apache/incubator-doris/issues?q=is%3Aissue) and 
found no similar issues.
   
   
   ### Version
   
   master
   
   ### What's Wrong?
   
   variance's result is different with trino.
   
   
https://github.com/trinodb/trino/blob/master/testing/trino-product-tests/src/main/resources/sql-tests/testcases/aggregate/variance.result
   
   ### What You Expected?
   
   may be a bug, and i am not sure.
   
   ### How to Reproduce?
   
   _No response_
   
   ### Anything Else?
   
   _No response_
   
   ### Are you willing to submit PR?
   
   - [ ] Yes I am willing to submit a PR!
   
   ### Code of Conduct
   
   - [X] I agree to follow this project's [Code of 
Conduct](https://www.apache.org/foundation/policies/conduct)
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] BiteTheDDDDt closed issue #8262: [Bug][Vectorized] Be core dump on memtable

2022-03-07 Thread GitBox


BiteThet closed issue #8262:
URL: https://github.com/apache/incubator-doris/issues/8262


   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] coderjiang commented on issue #5103: doris怎么删除昨天的数据

2022-03-07 Thread GitBox


coderjiang commented on issue #5103:
URL: 
https://github.com/apache/incubator-doris/issues/5103#issuecomment-1060563392


   DELETE FROM ods_order_report WHERE purchase_date_local >= 
DATE_ADD('2022-03-06 00:00:00', INTERVAL 1 day);
   这语句也是不行的,会报错:
   ERROR 1064 (HY000): errCode = 2, detailMessage = Right expr of binary 
predicate should be value


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] coderjiang commented on issue #5103: doris怎么删除昨天的数据

2022-03-07 Thread GitBox


coderjiang commented on issue #5103:
URL: 
https://github.com/apache/incubator-doris/issues/5103#issuecomment-1060564080


   mysql> DELETE FROM ods_order_report WHERE purchase_date_local >= 
DATE_ADD('2022-03-06 00:00:00', INTERVAL 1 day);
   ERROR 1064 (HY000): errCode = 2, detailMessage = Right expr of binary 
predicate should be value
   mysql> select DATE_ADD('2022-03-06 00:00:00', INTERVAL 1 day);
   +-+
   | date_add('2022-03-06 00:00:00', INTERVAL 1 day) |
   +-+
   | 2022-03-07 00:00:00 |
   +-+
   1 row in set (0.00 sec)
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] yangzhg commented on a change in pull request #8364: [Function] Add compare_version function and ut

2022-03-07 Thread GitBox


yangzhg commented on a change in pull request #8364:
URL: https://github.com/apache/incubator-doris/pull/8364#discussion_r820612758



##
File path: be/src/exprs/utility_functions.cpp
##
@@ -39,4 +39,82 @@ BooleanVal UtilityFunctions::sleep(FunctionContext* ctx, 
const IntVal& seconds)
 return BooleanVal(true);
 }
 
+static void convert_to_string(doris_udf::StringVal& src, std::string& str_dst) 
{

Review comment:
   why called `convert_to_string` ? just a concat ?




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] yangzhg commented on a change in pull request #8364: [Function] Add compare_version function and ut

2022-03-07 Thread GitBox


yangzhg commented on a change in pull request #8364:
URL: https://github.com/apache/incubator-doris/pull/8364#discussion_r820614209



##
File path: be/src/exprs/utility_functions.cpp
##
@@ -39,4 +39,82 @@ BooleanVal UtilityFunctions::sleep(FunctionContext* ctx, 
const IntVal& seconds)
 return BooleanVal(true);
 }
 
+static void convert_to_string(doris_udf::StringVal& src, std::string& str_dst) 
{
+for (int i = 0; i < src.len; i++) {
+str_dst += src.ptr[i];
+}
+return;
+}
+
+static BooleanVal split_string(std::string& str_src, std::vector& 
str_vec_dst) {

Review comment:
there is already have split in the util, refer to 
be/src/util/string_util.h or gutils




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] BiteTheDDDDt opened a new issue #8383: [Feature][Vectorized] support replace()

2022-03-07 Thread GitBox


BiteThet opened a new issue #8383:
URL: https://github.com/apache/incubator-doris/issues/8383


   ### Search before asking
   
   - [X] I had searched in the 
[issues](https://github.com/apache/incubator-doris/issues?q=is%3Aissue) and 
found no similar issues.
   
   
   ### Description
   
support replace()
   
   ### Use case
   
   ```sql
   MySQL [test]> select replace("http://www.baidu.com:9090";, "9090", "");
   +--+
   | replace('http://www.baidu.com:9090', '9090', '') |
   +--+
   | http://www.baidu.com:|
   +--+
   1 row in set (0.010 sec)
   ```
   
   ### Related issues
   
   _No response_
   
   ### Are you willing to submit PR?
   
   - [X] Yes I am willing to submit a PR!
   
   ### Code of Conduct
   
   - [X] I agree to follow this project's [Code of 
Conduct](https://www.apache.org/foundation/policies/conduct)
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] BiteTheDDDDt opened a new pull request #8384: [Feature][Vectorized] support replace()

2022-03-07 Thread GitBox


BiteThet opened a new pull request #8384:
URL: https://github.com/apache/incubator-doris/pull/8384


   # Proposed changes
   @HappenLee 
   Issue Number: close #8383
   
   ## Problem Summary:
   
   Describe the overview of changes.
   
   ## Checklist(Required)
   
   1. Does it affect the original behavior: (Yes/No/I Don't know)
   2. Has unit tests been added: (Yes/No/No Need)
   3. Has document been added or modified: (Yes/No/No Need)
   4. Does it need to update dependencies: (Yes/No)
   5. Are there any changes that cannot be rolled back: (Yes/No)
   
   ## Further comments
   
   If this is a relatively large or complex change, kick off the discussion at 
[d...@doris.apache.org](mailto:d...@doris.apache.org) by explaining why you 
chose the solution you did and what alternatives you considered, etc...
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] xinyiZzz commented on a change in pull request #8322: [refactor] Impl of MemTracker, and related use

2022-03-07 Thread GitBox


xinyiZzz commented on a change in pull request #8322:
URL: https://github.com/apache/incubator-doris/pull/8322#discussion_r820638296



##
File path: be/src/olap/task/engine_alter_tablet_task.cpp
##
@@ -18,13 +18,21 @@
 #include "olap/task/engine_alter_tablet_task.h"
 
 #include "olap/schema_change.h"
+#include "runtime/mem_tracker.h"
 
 namespace doris {
 
 using std::to_string;
 
 EngineAlterTabletTask::EngineAlterTabletTask(const TAlterTabletReqV2& request)
-: _alter_tablet_req(request) {}
+: _alter_tablet_req(request) {
+_mem_tracker = MemTracker::create_tracker(

Review comment:
   done

##
File path: be/src/olap/task/engine_batch_load_task.h
##
@@ -76,6 +76,8 @@ class EngineBatchLoadTask : public EngineTask {
 AgentStatus* _res_status;
 std::string _remote_file_path;
 std::string _local_file_path;
+
+std::shared_ptr _mem_tracker;

Review comment:
   done




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] HappenLee commented on a change in pull request #8381: [Bug] fix core dump on minmax_filter with decimal type

2022-03-07 Thread GitBox


HappenLee commented on a change in pull request #8381:
URL: https://github.com/apache/incubator-doris/pull/8381#discussion_r820653466



##
File path: be/src/exprs/minmax_predicate.h
##
@@ -42,9 +42,14 @@ class MinMaxNumFunc : public MinMaxFuncBase {
 public:
 MinMaxNumFunc() = default;
 ~MinMaxNumFunc() = default;
-virtual void insert(const void* data) {
-if (data == nullptr) return;
-const T val_data = *reinterpret_cast(data);
+void insert(const void* data) override {
+if (data == nullptr) {
+return;
+}
+

Review comment:
   only core when int128_t, so maybe constexpr + sizeof.
   and add comment




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] HappenLee opened a new issue #8385: [Bug][Vectorized] Datetime type read error result

2022-03-07 Thread GitBox


HappenLee opened a new issue #8385:
URL: https://github.com/apache/incubator-doris/issues/8385


   ### Search before asking
   
   - [X] I had searched in the 
[issues](https://github.com/apache/incubator-doris/issues?q=is%3Aissue) and 
found no similar issues.
   
   
   ### Version
   
   vectorized
   
   ### What's Wrong?
   
   get 2001 when year is 
   
   ### What You Expected?
   
   get right data
   
   ### How to Reproduce?
   
   _No response_
   
   ### Anything Else?
   
   _No response_
   
   ### Are you willing to submit PR?
   
   - [X] Yes I am willing to submit a PR!
   
   ### Code of Conduct
   
   - [X] I agree to follow this project's [Code of 
Conduct](https://www.apache.org/foundation/policies/conduct)
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] HappenLee opened a new pull request #8386: [Bug][Vectorized] Fix the datetime type read error and is_same set error in reader

2022-03-07 Thread GitBox


HappenLee opened a new pull request #8386:
URL: https://github.com/apache/incubator-doris/pull/8386


   # Proposed changes
   
   Issue Number: close #8385 
   
   ## Problem Summary:
   
   Describe the overview of changes.
   
   ## Checklist(Required)
   
   1. Does it affect the original behavior: (Yes/No/I Don't know)
   2. Has unit tests been added: (Yes/No/No Need)
   3. Has document been added or modified: (Yes/No/No Need)
   4. Does it need to update dependencies: (Yes/No)
   5. Are there any changes that cannot be rolled back: (Yes/No)
   
   ## Further comments
   
   If this is a relatively large or complex change, kick off the discussion at 
[d...@doris.apache.org](mailto:d...@doris.apache.org) by explaining why you 
chose the solution you did and what alternatives you considered, etc...
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] cambyzju commented on a change in pull request #8217: [feature][array-type]support select ARRAY data type on vectorized engine

2022-03-07 Thread GitBox


cambyzju commented on a change in pull request #8217:
URL: https://github.com/apache/incubator-doris/pull/8217#discussion_r820707304



##
File path: be/src/vec/columns/column_array.cpp
##
@@ -0,0 +1,700 @@
+// Licensed to the Apache Software Foundation (ASF) under one
+// or more contributor license agreements.  See the NOTICE file
+// distributed with this work for additional information
+// regarding copyright ownership.  The ASF licenses this file
+// to you under the Apache License, Version 2.0 (the
+// "License"); you may not use this file except in compliance
+// with the License.  You may obtain a copy of the License at
+//
+//   http://www.apache.org/licenses/LICENSE-2.0
+//
+// Unless required by applicable law or agreed to in writing,
+// software distributed under the License is distributed on an
+// "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
+// KIND, either express or implied.  See the License for the
+// specific language governing permissions and limitations
+// under the License.
+// This file is copied from
+// 
https://github.com/ClickHouse/ClickHouse/blob/master/src/Columns/ColumnArray.cpp
+// and modified by Doris
+
+#include  // memcpy
+
+#include "vec/common/assert_cast.h"
+#include "vec/columns/collator.h"
+#include "vec/columns/column_array.h"
+#include "vec/columns/column_const.h"
+#include "vec/columns/column_nullable.h"
+#include "vec/columns/column_string.h"
+#include "vec/columns/columns_common.h"
+#include "vec/columns/columns_number.h"
+
+namespace doris::vectorized {
+
+namespace ErrorCodes {
+extern const int NOT_IMPLEMENTED;
+extern const int BAD_ARGUMENTS;
+extern const int PARAMETER_OUT_OF_BOUND;
+extern const int SIZES_OF_COLUMNS_DOESNT_MATCH;
+extern const int LOGICAL_ERROR;
+extern const int TOO_LARGE_ARRAY_SIZE;
+}
+
+/** Obtaining array as Field can be slow for large arrays and consume vast 
amount of memory.
+  * Just don't allow to do it.
+  * You can increase the limit if the following query:
+  *  SELECT range(1000)
+  * will take less than 500ms on your machine.
+  */
+static constexpr size_t max_array_size_as_field = 100;
+
+ColumnArray::ColumnArray(MutableColumnPtr && nested_column, MutableColumnPtr 
&& offsets_column)
+: data(std::move(nested_column)), offsets(std::move(offsets_column)) {
+const ColumnOffsets * offsets_concrete = typeid_cast(offsets.get());
+
+if (!offsets_concrete) {
+LOG(FATAL) << "offsets_column must be a ColumnUInt64";
+}
+
+if (!offsets_concrete->empty() && nested_column) {
+Offset last_offset = offsets_concrete->get_data().back();
+
+/// This will also prevent possible overflow in offset.
+if (nested_column->size() != last_offset) {
+LOG(FATAL) << "offsets_column has data inconsistent with 
nested_column";
+}
+}
+
+/** NOTE
+  * Arrays with constant value are possible and used in implementation of 
higher order functions (see FunctionReplicate).
+  * But in most cases, arrays with constant value are unexpected and code 
will work wrong. Use with caution.
+  */
+}
+
+ColumnArray::ColumnArray(MutableColumnPtr && nested_column)
+: data(std::move(nested_column)) {
+if (!data->empty()) {
+LOG(FATAL) << "Not empty data passed to ColumnArray, but no offsets 
passed";
+}
+
+offsets = ColumnOffsets::create();
+}
+
+std::string ColumnArray::get_name() const { return "Array(" + 
get_data().get_name() + ")"; }
+
+MutableColumnPtr ColumnArray::clone_resized(size_t to_size) const {
+auto res = ColumnArray::create(get_data().clone_empty());
+
+if (to_size == 0)
+return res;
+size_t from_size = size();
+
+if (to_size <= from_size) {
+/// Just cut column.
+res->get_offsets().assign(get_offsets().begin(), get_offsets().begin() 
+ to_size);
+res->get_data().insert_range_from(get_data(), 0, get_offsets()[to_size 
- 1]);
+} else {
+/// Copy column and append empty arrays for extra elements.
+Offset offset = 0;
+if (from_size > 0) {
+res->get_offsets().assign(get_offsets().begin(), 
get_offsets().end());
+res->get_data().insert_range_from(get_data(), 0, 
get_data().size());
+offset = get_offsets().back();
+}
+
+res->get_offsets().resize(to_size);
+for (size_t i = from_size; i < to_size; ++i)
+res->get_offsets()[i] = offset;
+}
+
+return res;
+}
+
+size_t ColumnArray::size() const {
+return get_offsets().size();
+}
+
+Field ColumnArray::operator[](size_t n) const {
+size_t offset = offset_at(n);
+size_t size = size_at(n);
+
+if (size > max_array_size_as_field)
+LOG(FATAL) << "Array of size " << size << " is too large to be 
manipulated as single field,"
+   << "maximum size " << max_array_size_as_field;
+
+Array res(size);
+
+for (size_t i = 0; i < size; ++i)
+res[i] = get_d

[GitHub] [incubator-doris] cambyzju commented on a change in pull request #8217: [feature][array-type]support select ARRAY data type on vectorized engine

2022-03-07 Thread GitBox


cambyzju commented on a change in pull request #8217:
URL: https://github.com/apache/incubator-doris/pull/8217#discussion_r820727165



##
File path: be/src/vec/columns/column_array.cpp
##
@@ -0,0 +1,700 @@
+// Licensed to the Apache Software Foundation (ASF) under one
+// or more contributor license agreements.  See the NOTICE file
+// distributed with this work for additional information
+// regarding copyright ownership.  The ASF licenses this file
+// to you under the Apache License, Version 2.0 (the
+// "License"); you may not use this file except in compliance
+// with the License.  You may obtain a copy of the License at
+//
+//   http://www.apache.org/licenses/LICENSE-2.0
+//
+// Unless required by applicable law or agreed to in writing,
+// software distributed under the License is distributed on an
+// "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
+// KIND, either express or implied.  See the License for the
+// specific language governing permissions and limitations
+// under the License.
+// This file is copied from
+// 
https://github.com/ClickHouse/ClickHouse/blob/master/src/Columns/ColumnArray.cpp
+// and modified by Doris
+
+#include  // memcpy
+
+#include "vec/common/assert_cast.h"
+#include "vec/columns/collator.h"
+#include "vec/columns/column_array.h"
+#include "vec/columns/column_const.h"
+#include "vec/columns/column_nullable.h"
+#include "vec/columns/column_string.h"
+#include "vec/columns/columns_common.h"
+#include "vec/columns/columns_number.h"
+
+namespace doris::vectorized {
+
+namespace ErrorCodes {
+extern const int NOT_IMPLEMENTED;
+extern const int BAD_ARGUMENTS;
+extern const int PARAMETER_OUT_OF_BOUND;
+extern const int SIZES_OF_COLUMNS_DOESNT_MATCH;
+extern const int LOGICAL_ERROR;
+extern const int TOO_LARGE_ARRAY_SIZE;
+}
+
+/** Obtaining array as Field can be slow for large arrays and consume vast 
amount of memory.
+  * Just don't allow to do it.
+  * You can increase the limit if the following query:
+  *  SELECT range(1000)
+  * will take less than 500ms on your machine.
+  */
+static constexpr size_t max_array_size_as_field = 100;
+
+ColumnArray::ColumnArray(MutableColumnPtr && nested_column, MutableColumnPtr 
&& offsets_column)
+: data(std::move(nested_column)), offsets(std::move(offsets_column)) {
+const ColumnOffsets * offsets_concrete = typeid_cast(offsets.get());
+
+if (!offsets_concrete) {
+LOG(FATAL) << "offsets_column must be a ColumnUInt64";
+}
+
+if (!offsets_concrete->empty() && nested_column) {
+Offset last_offset = offsets_concrete->get_data().back();
+
+/// This will also prevent possible overflow in offset.
+if (nested_column->size() != last_offset) {
+LOG(FATAL) << "offsets_column has data inconsistent with 
nested_column";
+}
+}
+
+/** NOTE
+  * Arrays with constant value are possible and used in implementation of 
higher order functions (see FunctionReplicate).
+  * But in most cases, arrays with constant value are unexpected and code 
will work wrong. Use with caution.
+  */
+}
+
+ColumnArray::ColumnArray(MutableColumnPtr && nested_column)
+: data(std::move(nested_column)) {
+if (!data->empty()) {
+LOG(FATAL) << "Not empty data passed to ColumnArray, but no offsets 
passed";
+}
+
+offsets = ColumnOffsets::create();
+}
+
+std::string ColumnArray::get_name() const { return "Array(" + 
get_data().get_name() + ")"; }
+
+MutableColumnPtr ColumnArray::clone_resized(size_t to_size) const {
+auto res = ColumnArray::create(get_data().clone_empty());
+
+if (to_size == 0)
+return res;
+size_t from_size = size();
+
+if (to_size <= from_size) {
+/// Just cut column.
+res->get_offsets().assign(get_offsets().begin(), get_offsets().begin() 
+ to_size);
+res->get_data().insert_range_from(get_data(), 0, get_offsets()[to_size 
- 1]);
+} else {
+/// Copy column and append empty arrays for extra elements.
+Offset offset = 0;
+if (from_size > 0) {
+res->get_offsets().assign(get_offsets().begin(), 
get_offsets().end());
+res->get_data().insert_range_from(get_data(), 0, 
get_data().size());
+offset = get_offsets().back();
+}
+
+res->get_offsets().resize(to_size);
+for (size_t i = from_size; i < to_size; ++i)
+res->get_offsets()[i] = offset;
+}
+
+return res;
+}
+
+size_t ColumnArray::size() const {
+return get_offsets().size();
+}
+
+Field ColumnArray::operator[](size_t n) const {
+size_t offset = offset_at(n);
+size_t size = size_at(n);
+
+if (size > max_array_size_as_field)
+LOG(FATAL) << "Array of size " << size << " is too large to be 
manipulated as single field,"
+   << "maximum size " << max_array_size_as_field;
+
+Array res(size);
+
+for (size_t i = 0; i < size; ++i)
+res[i] = get_d

[GitHub] [incubator-doris] caiconghui opened a new pull request #8387: [Refactor](storage_engine) Use std::shared_lock to replace RWMutex

2022-03-07 Thread GitBox


caiconghui opened a new pull request #8387:
URL: https://github.com/apache/incubator-doris/pull/8387


   # Proposed changes
   
   Issue Number: close #xxx
   
   ## Problem Summary:
   
   Describe the overview of changes.
   
   ## Checklist(Required)
   
   1. Does it affect the original behavior: (No)
   2. Has unit tests been added: (No)
   3. Has document been added or modified: (No)
   4. Does it need to update dependencies: (No)
   5. Are there any changes that cannot be rolled back: (No)
   
   ## Further comments
   
   If this is a relatively large or complex change, kick off the discussion at 
[d...@doris.apache.org](mailto:d...@doris.apache.org) by explaining why you 
chose the solution you did and what alternatives you considered, etc...
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] morningman commented on pull request #8387: [Refactor](storage_engine) Use std::shared_lock to replace RWMutex

2022-03-07 Thread GitBox


morningman commented on pull request #8387:
URL: https://github.com/apache/incubator-doris/pull/8387#issuecomment-1060749080


   Please describe the motivation and any possible performance impact, so that 
other reviewers can directly know what is going on.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] morningman commented on pull request #8336: [improvment](broker load) fix broker load check logic

2022-03-07 Thread GitBox


morningman commented on pull request #8336:
URL: https://github.com/apache/incubator-doris/pull/8336#issuecomment-1060793086


   Seems like the code you changed in this PR does not related the problem you 
described in #8335 ?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] morningman commented on pull request #8340: Translate Chinese comment to English

2022-03-07 Thread GitBox


morningman commented on pull request #8340:
URL: https://github.com/apache/incubator-doris/pull/8340#issuecomment-1060795516


   > You'd better cleanup your commits by removing useless commits like 
'第一次提交'. You can use git force push.
   
   That would be OK, because we using "Squash" merge. So the commit message 
will be reorganized finally.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] morningman commented on pull request #8340: Translate Chinese comment to English

2022-03-07 Thread GitBox


morningman commented on pull request #8340:
URL: https://github.com/apache/incubator-doris/pull/8340#issuecomment-1060796247


   > There are a lot of Chinese comments, I prefer to clean together.
   
   I think we can simply merge this PR. And doing what you prefer later?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] morningman merged pull request #8275: [community] Modify doris connector release doc

2022-03-07 Thread GitBox


morningman merged pull request #8275:
URL: https://github.com/apache/incubator-doris/pull/8275


   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] github-actions[bot] commented on pull request #8340: Translate Chinese comment to English

2022-03-07 Thread GitBox


github-actions[bot] commented on pull request #8340:
URL: https://github.com/apache/incubator-doris/pull/8340#issuecomment-1060797540


   PR approved by at least one committer and no changes requested.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[incubator-doris] branch master updated (d9c2c2c -> c647def)

2022-03-07 Thread morningman
This is an automated email from the ASF dual-hosted git repository.

morningman pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/incubator-doris.git.


from d9c2c2c  Revert "[refactor] remove unused new_in_predicate code 
(#8263)" (#8372)
 add c647def  [community] Modify doris connector release doc (#8275)

No new revisions were added by this update.

Summary of changes:
 .../release-and-verify/release-doris-connectors.md | 82 ++
 .../release-and-verify/release-doris-connectors.md | 78 +---
 2 files changed, 74 insertions(+), 86 deletions(-)

-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] morningman commented on pull request #8324: [Feature](proc) Support proc dir for showing tablet health status

2022-03-07 Thread GitBox


morningman commented on pull request #8324:
URL: https://github.com/apache/incubator-doris/pull/8324#issuecomment-1060799805


   Why don't just add new columns to `show proc "/statistic"`?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] morningman merged pull request #8325: [typo]fix some typo in fe_config

2022-03-07 Thread GitBox


morningman merged pull request #8325:
URL: https://github.com/apache/incubator-doris/pull/8325


   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] morningman merged pull request #8350: [license] Organize third-party dependent licenses for bianry releases

2022-03-07 Thread GitBox


morningman merged pull request #8350:
URL: https://github.com/apache/incubator-doris/pull/8350


   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] morningman merged pull request #8352: [improvement]support build with parallel parameter only

2022-03-07 Thread GitBox


morningman merged pull request #8352:
URL: https://github.com/apache/incubator-doris/pull/8352


   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[incubator-doris] branch master updated (c647def -> ca3be24)

2022-03-07 Thread morningman
This is an automated email from the ASF dual-hosted git repository.

morningman pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/incubator-doris.git.


from c647def  [community] Modify doris connector release doc (#8275)
 add ca3be24  [typo]fix some typo in fe_config (#8325)

No new revisions were added by this update.

Summary of changes:
 docs/en/administrator-guide/config/fe_config.md|  45 -
 docs/zh-CN/administrator-guide/config/fe_config.md | 112 ++---
 2 files changed, 72 insertions(+), 85 deletions(-)

-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[incubator-doris] branch master updated (ca3be24 -> 50a59f3)

2022-03-07 Thread morningman
This is an automated email from the ASF dual-hosted git repository.

morningman pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/incubator-doris.git.


from ca3be24  [typo]fix some typo in fe_config (#8325)
 add 50a59f3  [license] Organize third-party dependent licenses for bianry 
releases (#8350)

No new revisions were added by this update.

Summary of changes:
 NOTICE.txt |2 +-
 dist/LICENSE-dist.txt  | 1561 
 dist/README|   30 +
 dist/licenses/LICENSE-CC0.txt  |   29 +
 dist/licenses/LICENSE-CDDL-1.0.txt |  131 +++
 dist/licenses/LICENSE-CDDL-1.1.txt |   77 ++
 dist/licenses/LICENSE-CUP.txt  |5 +
 dist/licenses/LICENSE-EDL-1.0.txt  |   11 +
 dist/licenses/LICENSE-EPL-1.0.txt  |   60 ++
 dist/licenses/LICENSE-EPL-2.0.txt  |   79 ++
 dist/licenses/LICENSE-GPLv2-CE.txt |  131 +++
 dist/licenses/LICENSE-JSch.txt |   30 +
 dist/licenses/LICENSE-MIT.txt  |9 +
 dist/licenses/LICENSE-antlr4.txt   |   52 +
 dist/licenses/LICENSE-asm.txt  |   27 +
 dist/licenses/LICENSE-automaton.txt|   24 +
 dist/licenses/LICENSE-bitshuffle.txt   |   21 +
 dist/licenses/LICENSE-boost.txt|   23 +
 dist/licenses/LICENSE-breakpad.txt |  132 +++
 dist/licenses/LICENSE-brotli.txt   |   19 +
 dist/licenses/LICENSE-bzip2.txt|   42 +
 dist/licenses/LICENSE-commons-compiler.txt |   31 +
 dist/licenses/LICENSE-curl.txt |   22 +
 dist/licenses/LICENSE-cyrus-sasl.txt   |   44 +
 dist/licenses/LICENSE-fmt.txt  |   11 +
 dist/licenses/LICENSE-gflag.txt|   28 +
 dist/licenses/LICENSE-glog.txt |   65 ++
 dist/licenses/LICENSE-gperftools.txt   |   28 +
 dist/licenses/LICENSE-gtest.txt|   28 +
 dist/licenses/LICENSE-hamcrest.txt |   27 +
 dist/licenses/LICENSE-javolution.txt   |   27 +
 dist/licenses/LICENSE-jemalloc.txt |   26 +
 dist/licenses/LICENSE-jline.txt|   34 +
 dist/licenses/LICENSE-jta.txt  |  245 +
 dist/licenses/LICENSE-kryo.txt |   10 +
 dist/licenses/LICENSE-leveldb.txt  |   27 +
 dist/licenses/LICENSE-leveldbjni.txt   |   27 +
 dist/licenses/LICENSE-libdivide.txt|   29 +
 dist/licenses/LICENSE-libevent.txt |   99 ++
 dist/licenses/LICENSE-librdkafka.txt   |   31 +
 dist/licenses/LICENSE-lz4.txt  |   11 +
 dist/licenses/LICENSE-lzma.txt |   64 ++
 dist/licenses/LICENSE-minlog.txt   |   10 +
 dist/licenses/LICENSE-openssl.txt  |  125 +++
 dist/licenses/LICENSE-paranamer.txt|   28 +
 dist/licenses/LICENSE-pdqsort.txt  |   16 +
 dist/licenses/LICENSE-protobuf.txt |   33 +
 dist/licenses/LICENSE-py4j.txt |   26 +
 dist/licenses/LICENSE-rapidjson.txt|   57 +
 dist/licenses/LICENSE-re2.txt  |   27 +
 dist/licenses/LICENSE-snappy.txt   |   54 +
 dist/licenses/LICENSE-unixodbc.txt |  511 +
 dist/licenses/LICENSE-xmlenc.txt   |   23 +
 dist/licenses/LICENSE-xsimd.txt|   28 +
 dist/licenses/LICENSE-zlib.txt |   31 +
 dist/licenses/LICENSE-zstd-jni.txt |   26 +
 dist/licenses/LICENSE-zstd.txt |   30 +
 dist/tools/README  |   45 +
 dist/tools/parse_licenses.py   |  180 
 fe/fe-core/pom.xml |  110 +-
 fe/pom.xml |  217 ++--
 fe/spark-dpp/pom.xml   |  109 --
 fs_brokers/apache_hdfs_broker/pom.xml  |   13 +
 thirdparty/vars.sh |   10 +-
 64 files changed, 4820 insertions(+), 298 deletions(-)
 create mode 100644 dist/LICENSE-dist.txt
 create mode 100644 dist/README
 create mode 100644 dist/licenses/LICENSE-CC0.txt
 create mode 100644 dist/licenses/LICENSE-CDDL-1.0.txt
 create mode 100644 dist/licenses/LICENSE-CDDL-1.1.txt
 create mode 100644 dist/licenses/LICENSE-CUP.txt
 create mode 100644 dist/licenses/LICENSE-EDL-1.0.txt
 create mode 100644 dist/licenses/LICENSE-EPL-1.0.txt
 create mode 100644 dist/licenses/LICENSE-EPL-2.0.txt
 create mode 100644 dist/licenses/LICENSE-GPLv2-CE.txt
 create mode 100644 dist/licenses/LICENSE-JSch.txt
 create mode 100644 dist/licenses/LICENSE-MIT.txt
 create mode 100644 dist/licenses/LICENSE-antlr4.txt
 create mode 100644 dist/licenses/LICENSE-asm.txt
 create mode 100644 dist/licenses/LICENSE-automaton.txt
 create mode 100644 dist/licenses/LICENSE-bitshuffle.txt
 create mode 100644 dist/licenses/LICENSE-boost.txt
 create mode 100644 dist/licenses/LICENSE-breakpad.txt
 create mode 100644 dist/licenses/LICENSE-brotli.txt
 create mode 100644 dist/licenses/LICENSE-bzip2.txt
 create mode 100644 dist/licenses/LICENSE-commons-

[incubator-doris] branch master updated (50a59f3 -> 089ef53)

2022-03-07 Thread morningman
This is an automated email from the ASF dual-hosted git repository.

morningman pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/incubator-doris.git.


from 50a59f3  [license] Organize third-party dependent licenses for bianry 
releases (#8350)
 add 089ef53  [chore] fix build with parallel parameter only (#8352)

No new revisions were added by this update.

Summary of changes:
 build.sh | 13 -
 1 file changed, 12 insertions(+), 1 deletion(-)

-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] morningman commented on pull request #7529: Support remote storage, step2, only for be: hot data trans to cold data. clean cold data when drop table

2022-03-07 Thread GitBox


morningman commented on pull request #7529:
URL: https://github.com/apache/incubator-doris/pull/7529#issuecomment-1060820068


   Please update the PR comment to describe the new implementation


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] github-actions[bot] commented on pull request #8096: [optimize] optimze tablet read, avoid to create too much scanner for small tablet

2022-03-07 Thread GitBox


github-actions[bot] commented on pull request #8096:
URL: https://github.com/apache/incubator-doris/pull/8096#issuecomment-1060826763






-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] morningman commented on a change in pull request #8305: [feature-wip][array-type] Support nested array insertion.

2022-03-07 Thread GitBox


morningman commented on a change in pull request #8305:
URL: https://github.com/apache/incubator-doris/pull/8305#discussion_r820833353



##
File path: fe/fe-core/src/main/java/org/apache/doris/catalog/Column.java
##
@@ -373,6 +373,9 @@ private void toChildrenThrift(Column column, TColumn 
tColumn) {
 childrenTColumnType.setIndexLen(children.getOlapColumnIndexSize());
 childrenTColumn.setColumnType(childrenTColumnType);
 childrenTColumn.setIsAllowNull(children.isAllowNull());
+if (tColumn.getAggregationType() != null) {
+
childrenTColumn.setAggregationType(tColumn.getAggregationType());

Review comment:
   why need to set agg type for child column?




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] morningman commented on a change in pull request #8368: [feature-wip][array-type] Array data can be loaded in stream load.

2022-03-07 Thread GitBox


morningman commented on a change in pull request #8368:
URL: https://github.com/apache/incubator-doris/pull/8368#discussion_r820839495



##
File path: be/src/util/array_parser.hpp
##
@@ -0,0 +1,215 @@
+// Licensed to the Apache Software Foundation (ASF) under one
+// or more contributor license agreements.  See the NOTICE file
+// distributed with this work for additional information
+// regarding copyright ownership.  The ASF licenses this file
+// to you under the Apache License, Version 2.0 (the
+// "License"); you may not use this file except in compliance
+// with the License.  You may obtain a copy of the License at
+//
+//   http://www.apache.org/licenses/LICENSE-2.0
+//
+// Unless required by applicable law or agreed to in writing,
+// software distributed under the License is distributed on an
+// "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
+// KIND, either express or implied.  See the License for the
+// specific language governing permissions and limitations
+// under the License.
+
+#ifndef DORIS_BE_SRC_UTIL_ARRAY_PARSER_HPP
+#define DORIS_BE_SRC_UTIL_ARRAY_PARSER_HPP

Review comment:
   ```suggestion
   #pragma once
   ```

##
File path: be/src/util/array_parser.hpp
##
@@ -0,0 +1,215 @@
+// Licensed to the Apache Software Foundation (ASF) under one
+// or more contributor license agreements.  See the NOTICE file
+// distributed with this work for additional information
+// regarding copyright ownership.  The ASF licenses this file
+// to you under the Apache License, Version 2.0 (the
+// "License"); you may not use this file except in compliance
+// with the License.  You may obtain a copy of the License at
+//
+//   http://www.apache.org/licenses/LICENSE-2.0
+//
+// Unless required by applicable law or agreed to in writing,
+// software distributed under the License is distributed on an
+// "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
+// KIND, either express or implied.  See the License for the
+// specific language governing permissions and limitations
+// under the License.
+
+#ifndef DORIS_BE_SRC_UTIL_ARRAY_PARSER_HPP
+#define DORIS_BE_SRC_UTIL_ARRAY_PARSER_HPP
+
+#include 
+
+#include 
+
+#include "common/status.h"
+#include "exprs/anyval_util.h"
+#include "runtime/collection_value.h"
+#include "runtime/primitive_type.h"
+#include "runtime/types.h"
+#include "util/mem_util.hpp"
+
+namespace doris {
+
+template 
+using ConstArray = typename rapidjson::GenericValue::ConstArray;
+
+template 
+using ConstArrayIterator = typename ConstArray::ValueIterator;
+
+class ArrayParser {
+public:
+static Status parse(CollectionVal& array_val, FunctionContext* context,
+const StringVal& str_val) {
+rapidjson::Document document;
+if (document.Parse(reinterpret_cast(str_val.ptr), 
str_val.len).HasParseError()) {
+return Status::RuntimeError("Failed to parse the json to array.");
+}
+if (document.IsNull()) {
+array_val = CollectionVal::null();
+return Status::OK();
+} else if (!document.IsArray()) {
+return Status::RuntimeError("Failed to parse the json to array.");
+}
+auto type_desc = convertToTypeDescriptor(context->get_return_type());
+return parse>(
+array_val, context,
+reinterpret_cast(&document)->GetArray(), type_desc);
+}
+
+private:
+static TypeDescriptor convertToTypeDescriptor(FunctionContext::TypeDesc 
function_type_desc) {

Review comment:
   ```suggestion
   static TypeDescriptor 
_convert_to_type_descriptor(FunctionContext::TypeDesc function_type_desc) {
   ```




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] adonis0147 commented on a change in pull request #8305: [feature-wip][array-type] Support nested array insertion.

2022-03-07 Thread GitBox


adonis0147 commented on a change in pull request #8305:
URL: https://github.com/apache/incubator-doris/pull/8305#discussion_r820847392



##
File path: fe/fe-core/src/main/java/org/apache/doris/catalog/Column.java
##
@@ -373,6 +373,9 @@ private void toChildrenThrift(Column column, TColumn 
tColumn) {
 childrenTColumnType.setIndexLen(children.getOlapColumnIndexSize());
 childrenTColumn.setColumnType(childrenTColumnType);
 childrenTColumn.setIsAllowNull(children.isAllowNull());
+if (tColumn.getAggregationType() != null) {
+
childrenTColumn.setAggregationType(tColumn.getAggregationType());

Review comment:
   The parent aggregate type is `NONE` by default for array type. If we 
don't set the aggregate type for the children column, after deserializing in 
BE, the aggregate type of the children column is `TAggregationType::type::SUM`, 
because the value of `TAggregationType::type::SUM` is zero.




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] adonis0147 commented on a change in pull request #8305: [feature-wip][array-type] Support nested array insertion.

2022-03-07 Thread GitBox


adonis0147 commented on a change in pull request #8305:
URL: https://github.com/apache/incubator-doris/pull/8305#discussion_r820847392



##
File path: fe/fe-core/src/main/java/org/apache/doris/catalog/Column.java
##
@@ -373,6 +373,9 @@ private void toChildrenThrift(Column column, TColumn 
tColumn) {
 childrenTColumnType.setIndexLen(children.getOlapColumnIndexSize());
 childrenTColumn.setColumnType(childrenTColumnType);
 childrenTColumn.setIsAllowNull(children.isAllowNull());
+if (tColumn.getAggregationType() != null) {
+
childrenTColumn.setAggregationType(tColumn.getAggregationType());

Review comment:
   The parent aggregate type is `NONE` by default for array type.
   If we don't set the aggregate type for the children column, after 
deserializing in BE, the aggregate type of the children column is 
`TAggregationType::type::SUM`, because the value of 
`TAggregationType::type::SUM` is zero.  That is incorrect.




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] carlvinhust2012 commented on a change in pull request #8364: [Function] Add compare_version function and ut

2022-03-07 Thread GitBox


carlvinhust2012 commented on a change in pull request #8364:
URL: https://github.com/apache/incubator-doris/pull/8364#discussion_r821251157



##
File path: be/src/exprs/utility_functions.cpp
##
@@ -39,4 +39,82 @@ BooleanVal UtilityFunctions::sleep(FunctionContext* ctx, 
const IntVal& seconds)
 return BooleanVal(true);
 }
 
+static void convert_to_string(doris_udf::StringVal& src, std::string& str_dst) 
{

Review comment:
   just want to convert the type from 'StringVal' to 'string'.




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] carlvinhust2012 commented on a change in pull request #8364: [Function] Add compare_version function and ut

2022-03-07 Thread GitBox


carlvinhust2012 commented on a change in pull request #8364:
URL: https://github.com/apache/incubator-doris/pull/8364#discussion_r821251569



##
File path: be/src/exprs/utility_functions.cpp
##
@@ -39,4 +39,82 @@ BooleanVal UtilityFunctions::sleep(FunctionContext* ctx, 
const IntVal& seconds)
 return BooleanVal(true);
 }
 
+static void convert_to_string(doris_udf::StringVal& src, std::string& str_dst) 
{
+for (int i = 0; i < src.len; i++) {
+str_dst += src.ptr[i];
+}
+return;
+}
+
+static BooleanVal split_string(std::string& str_src, std::vector& 
str_vec_dst) {

Review comment:
   I will re-check this split function.




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris-flink-connector] hf200012 merged pull request #12: [refactor]doris sink enable 2pc

2022-03-07 Thread GitBox


hf200012 merged pull request #12:
URL: https://github.com/apache/incubator-doris-flink-connector/pull/12


   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris-flink-connector] hf200012 commented on pull request #13: Feature/20220305 support flush when batch bytes reach threshold

2022-03-07 Thread GitBox


hf200012 commented on pull request #13:
URL: 
https://github.com/apache/incubator-doris-flink-connector/pull/13#issuecomment-1061321741


   Resolve a conflict @bridgeDream 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] carlvinhust2012 closed pull request #8364: [Function] Add compare_version function and ut

2022-03-07 Thread GitBox


carlvinhust2012 closed pull request #8364:
URL: https://github.com/apache/incubator-doris/pull/8364


   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] carlvinhust2012 commented on pull request #8364: [Function] Add compare_version function and ut

2022-03-07 Thread GitBox


carlvinhust2012 commented on pull request #8364:
URL: https://github.com/apache/incubator-doris/pull/8364#issuecomment-1061322368


   > Why not `compare_version(ver1 , ver2)` return 0 if equal, 1 if ver1 > 
ver2, -1 if ver1 < ver2 ? In addition, there is still a lot of room for 
optimization in the implementation of the program
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] carlvinhust2012 commented on pull request #8364: [Function] Add compare_version function and ut

2022-03-07 Thread GitBox


carlvinhust2012 commented on pull request #8364:
URL: https://github.com/apache/incubator-doris/pull/8364#issuecomment-1061326259


   > Why not `compare_version(ver1 , ver2)` return 0 if equal, 1 if ver1 > 
ver2, -1 if ver1 < ver2 ? In addition, there is still a lot of room for 
optimization in the implementation of the program
   
   We discussed this approach with the business, but they still prefer to use 
the current approach.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] carlvinhust2012 edited a comment on pull request #8364: [Function] Add compare_version function and ut

2022-03-07 Thread GitBox


carlvinhust2012 edited a comment on pull request #8364:
URL: https://github.com/apache/incubator-doris/pull/8364#issuecomment-1061326259


   > Why not `compare_version(ver1 , ver2)` return 0 if equal, 1 if ver1 > 
ver2, -1 if ver1 < ver2 ? In addition, there is still a lot of room for 
optimization in the implementation of the program
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] carlvinhust2012 commented on pull request #8364: [Function] Add compare_version function and ut

2022-03-07 Thread GitBox


carlvinhust2012 commented on pull request #8364:
URL: https://github.com/apache/incubator-doris/pull/8364#issuecomment-1061326860


   > > Why not `compare_version(ver1 , ver2)` return 0 if equal, 1 if ver1 > 
ver2, -1 if ver1 < ver2 ? In addition, there is still a lot of room for 
optimization in the implementation of the program
   
   We discussed this approach with the business, but they still prefer to use 
the current approach.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] carlvinhust2012 removed a comment on pull request #8364: [Function] Add compare_version function and ut

2022-03-07 Thread GitBox


carlvinhust2012 removed a comment on pull request #8364:
URL: https://github.com/apache/incubator-doris/pull/8364#issuecomment-1061326259


   > Why not `compare_version(ver1 , ver2)` return 0 if equal, 1 if ver1 > 
ver2, -1 if ver1 < ver2 ? In addition, there is still a lot of room for 
optimization in the implementation of the program
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] yangzhg commented on a change in pull request #8336: [improvment](broker load) fix broker load check logic

2022-03-07 Thread GitBox


yangzhg commented on a change in pull request #8336:
URL: https://github.com/apache/incubator-doris/pull/8336#discussion_r821258659



##
File path: fe/fe-core/src/main/java/org/apache/doris/analysis/ExportStmt.java
##
@@ -232,16 +232,17 @@ public static String checkPath(String path, 
StorageBackend.StorageType type) thr
 if (Strings.isNullOrEmpty(path)) {
 throw new AnalysisException("No dest path specified.");
 }
-
+if (type == StorageBackend.StorageType.BROKER) {
+if (!path.toLowerCase().contains("bos") && 
!path.toLowerCase().contains("afs")
+&& !path.toLowerCase().contains("hdfs")) {
+throw new AnalysisException("Invalid export path. please use 
valid 'HDFS://', 'AFS://' or 'BOS://' path.");
+}
+return path;
+}

Review comment:
   How  about a path  `nvalid_schema://xxbos/xxafs_path/path-contians-hdfs`




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] wangbo opened a new issue #8388: [Enhancement] Avoid read index bitmap when no index is provided

2022-03-07 Thread GitBox


wangbo opened a new issue #8388:
URL: https://github.com/apache/incubator-doris/issues/8388


   ### Search before asking
   
   - [X] I had searched in the 
[issues](https://github.com/apache/incubator-doris/issues?q=is%3Aissue) and 
found no similar issues.
   
   
   ### Description
   
   When a query hits index, then index bitmap is reset;
   Storage layer read rows by index bitmap.
   In the current implementation, even a query doesn't hit any index, storage 
still need to read rows by index bitmap.
   This could bring two costs:
   1 Read data from roaring bitmap.
   2 The foreach to read rowids from roaring bitmap.
   
   There is still optimization room for query without indexes.
   
   ### Solution
   
   _No response_
   
   ### Are you willing to submit PR?
   
   - [X] Yes I am willing to submit a PR!
   
   ### Code of Conduct
   
   - [X] I agree to follow this project's [Code of 
Conduct](https://www.apache.org/foundation/policies/conduct)
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] Henry2SS commented on pull request #8336: [improvment](broker load) fix broker load check logic

2022-03-07 Thread GitBox


Henry2SS commented on pull request #8336:
URL: https://github.com/apache/incubator-doris/pull/8336#issuecomment-1061349709


   > 
   
   It is. 
   URI not support '{}' '[]'. And when the path contains '{}' '[]', 
`URISyntaxException` will be thrown out. 
   This is the reason for the problem described in #8335.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] Henry2SS removed a comment on pull request #8336: [improvment](broker load) fix broker load check logic

2022-03-07 Thread GitBox


Henry2SS removed a comment on pull request #8336:
URL: https://github.com/apache/incubator-doris/pull/8336#issuecomment-1061349709


   > 
   
   It is. 
   URI not support '{}' '[]'. And when the path contains '{}' '[]', 
`URISyntaxException` will be thrown out. 
   This is the reason for the problem described in #8335.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] wangbo commented on a change in pull request #8386: [Bug][Vectorized] Fix the datetime type read error and is_same set error in reader

2022-03-07 Thread GitBox


wangbo commented on a change in pull request #8386:
URL: https://github.com/apache/incubator-doris/pull/8386#discussion_r821275926



##
File path: be/src/vec/olap/vcollect_iterator.cpp
##
@@ -129,7 +129,14 @@ bool 
VCollectIterator::LevelIteratorComparator::operator()(LevelIterator* lhs, L
 // for UNIQUE_KEYS just read the highest version and no need agg_update.
 // for AGG_KEYS if a version is deleted, the lower version no need to 
agg_update
 bool lower = (cmp_res != 0) ? (cmp_res < 0) : (lhs->version() < 
rhs->version());

Review comment:
   ```suggestion
   bool lower = lhs->version() < rhs->version();
   ```




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] Henry2SS commented on pull request #8336: [improvment](broker load) fix broker load check logic

2022-03-07 Thread GitBox


Henry2SS commented on pull request #8336:
URL: https://github.com/apache/incubator-doris/pull/8336#issuecomment-1061350644


   > Seems like the code you changed in this PR does not related the problem 
you described in #8335 ?
   
   It is.
   URI doesn't support '{}' '[]', and when a path contains '{}', '[]', a 
`URISyntaxException` will be thrown out.
   It is the reason for the problem described in #8335. 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] Henry2SS commented on a change in pull request #8336: [improvment](broker load) fix broker load check logic

2022-03-07 Thread GitBox


Henry2SS commented on a change in pull request #8336:
URL: https://github.com/apache/incubator-doris/pull/8336#discussion_r821278406



##
File path: fe/fe-core/src/main/java/org/apache/doris/analysis/ExportStmt.java
##
@@ -232,16 +232,17 @@ public static String checkPath(String path, 
StorageBackend.StorageType type) thr
 if (Strings.isNullOrEmpty(path)) {
 throw new AnalysisException("No dest path specified.");
 }
-
+if (type == StorageBackend.StorageType.BROKER) {
+if (!path.toLowerCase().contains("bos") && 
!path.toLowerCase().contains("afs")
+&& !path.toLowerCase().contains("hdfs")) {
+throw new AnalysisException("Invalid export path. please use 
valid 'HDFS://', 'AFS://' or 'BOS://' path.");
+}
+return path;
+}

Review comment:
   `checkPath` only do a pre-check, it checks the input style.
   If the path is not a valid one, an error will be thrown out as expected.
   ```
type:ETL_RUN_FAIL; msg:errCode = 2, detailMessage = Broker list path failed.
   ```
   
   And maybe change `path.contains` to `path.startsWith`, is it better? What's 
your opinions?




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] adonis0147 commented on a change in pull request #8368: [feature-wip][array-type] Array data can be loaded in stream load.

2022-03-07 Thread GitBox


adonis0147 commented on a change in pull request #8368:
URL: https://github.com/apache/incubator-doris/pull/8368#discussion_r821279648



##
File path: be/src/util/array_parser.hpp
##
@@ -0,0 +1,215 @@
+// Licensed to the Apache Software Foundation (ASF) under one
+// or more contributor license agreements.  See the NOTICE file
+// distributed with this work for additional information
+// regarding copyright ownership.  The ASF licenses this file
+// to you under the Apache License, Version 2.0 (the
+// "License"); you may not use this file except in compliance
+// with the License.  You may obtain a copy of the License at
+//
+//   http://www.apache.org/licenses/LICENSE-2.0
+//
+// Unless required by applicable law or agreed to in writing,
+// software distributed under the License is distributed on an
+// "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
+// KIND, either express or implied.  See the License for the
+// specific language governing permissions and limitations
+// under the License.
+
+#ifndef DORIS_BE_SRC_UTIL_ARRAY_PARSER_HPP
+#define DORIS_BE_SRC_UTIL_ARRAY_PARSER_HPP
+
+#include 
+
+#include 
+
+#include "common/status.h"
+#include "exprs/anyval_util.h"
+#include "runtime/collection_value.h"
+#include "runtime/primitive_type.h"
+#include "runtime/types.h"
+#include "util/mem_util.hpp"
+
+namespace doris {
+
+template 
+using ConstArray = typename rapidjson::GenericValue::ConstArray;
+
+template 
+using ConstArrayIterator = typename ConstArray::ValueIterator;
+
+class ArrayParser {
+public:
+static Status parse(CollectionVal& array_val, FunctionContext* context,
+const StringVal& str_val) {
+rapidjson::Document document;
+if (document.Parse(reinterpret_cast(str_val.ptr), 
str_val.len).HasParseError()) {
+return Status::RuntimeError("Failed to parse the json to array.");
+}
+if (document.IsNull()) {
+array_val = CollectionVal::null();
+return Status::OK();
+} else if (!document.IsArray()) {
+return Status::RuntimeError("Failed to parse the json to array.");
+}
+auto type_desc = convertToTypeDescriptor(context->get_return_type());
+return parse>(
+array_val, context,
+reinterpret_cast(&document)->GetArray(), type_desc);
+}
+
+private:
+static TypeDescriptor convertToTypeDescriptor(FunctionContext::TypeDesc 
function_type_desc) {

Review comment:
   Done. PTAL @morningman .




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] hf200012 commented on pull request #8369: [docs] Update documentation configuration parameter `sink.batch.bytes…

2022-03-07 Thread GitBox


hf200012 commented on pull request #8369:
URL: https://github.com/apache/incubator-doris/pull/8369#issuecomment-1061355224


   fix the code problem


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] wangbo commented on a change in pull request #8386: [Bug][Vectorized] Fix the datetime type read error and is_same set error in reader

2022-03-07 Thread GitBox


wangbo commented on a change in pull request #8386:
URL: https://github.com/apache/incubator-doris/pull/8386#discussion_r821275926



##
File path: be/src/vec/olap/vcollect_iterator.cpp
##
@@ -129,7 +129,14 @@ bool 
VCollectIterator::LevelIteratorComparator::operator()(LevelIterator* lhs, L
 // for UNIQUE_KEYS just read the highest version and no need agg_update.
 // for AGG_KEYS if a version is deleted, the lower version no need to 
agg_update
 bool lower = (cmp_res != 0) ? (cmp_res < 0) : (lhs->version() < 
rhs->version());

Review comment:
   ```suggestion
   bool lower = lhs->version() < rhs->version();
   ```




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] HappenLee commented on pull request #8384: [Feature][Vectorized] support replace()

2022-03-07 Thread GitBox


HappenLee commented on pull request #8384:
URL: https://github.com/apache/incubator-doris/pull/8384#issuecomment-1061363767






-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] caiconghui commented on pull request #8387: [Refactor](storage_engine) Use std::shared_lock to replace RWMutex

2022-03-07 Thread GitBox


caiconghui commented on pull request #8387:
URL: https://github.com/apache/incubator-doris/pull/8387#issuecomment-1061365634


   > Please describe the motivation and any possible performance impact, so 
that other reviewers can directly know what is going on.
   
   the motivation is to use the function of STL instead of wrapper of 
pthread_rwlock_t by ourselves, which is more portable.
   
   Exact behavior is not specified by the standard. But here are some clues:
   On POSIX systems shared_mutex will most likely be implemented on top of 
pthread_rwlock_t and implementations usually give preference to readers because 
of its requirement to support recursive read locks.
   
   reference as following:
   
https://www.quora.com/Why-use-std-shared_mutex-instead-of-a-C-wrapper-of-pthread_rwlock_t
   
https://stackoverflow.com/questions/33770500/when-to-use-c11-mutex-lock-unique-lock-shared-lock-etc
   
   anyway, I will make some performance test for the replacement


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] caiconghui commented on pull request #8324: [Feature](proc) Support proc dir for showing tablet health status

2022-03-07 Thread GitBox


caiconghui commented on pull request #8324:
URL: https://github.com/apache/incubator-doris/pull/8324#issuecomment-1061366922


   > Why don't just add new columns to `show proc "/statistic"`?
   
   cannot find a suitable place to append this, which may cause /statistic info 
verbose and not easy to display


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] lordk911 commented on issue #7502: Doris Roadmap 2022

2022-03-07 Thread GitBox


lordk911 commented on issue #7502:
URL: 
https://github.com/apache/incubator-doris/issues/7502#issuecomment-1061367317


   want to know when will vectorized query engine could be released.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] github-actions[bot] commented on pull request #8267: [fix](vectorization)Some small fix for SegmentIter Vectorization

2022-03-07 Thread GitBox


github-actions[bot] commented on pull request #8267:
URL: https://github.com/apache/incubator-doris/pull/8267#issuecomment-1061369587






-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] github-actions[bot] commented on pull request #8307: [Vectorized][Function] support regexp&&sm4&&aes functions

2022-03-07 Thread GitBox


github-actions[bot] commented on pull request #8307:
URL: https://github.com/apache/incubator-doris/pull/8307#issuecomment-1061375913






-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] morningman commented on pull request #8324: [Feature](proc) Support proc dir for showing tablet health status

2022-03-07 Thread GitBox


morningman commented on pull request #8324:
URL: https://github.com/apache/incubator-doris/pull/8324#issuecomment-1061394173


   > 
   
   So maybe it is better to move some column from `/statistic` to 
`/cluster_health`? such as `UnhealthyTabletNum` and `InconsistentTabletNum`? 
Otherwise we need to check them here and there.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] caiconghui commented on pull request #8324: [Feature](proc) Support proc dir for showing tablet health status

2022-03-07 Thread GitBox


caiconghui commented on pull request #8324:
URL: https://github.com/apache/incubator-doris/pull/8324#issuecomment-1061396132


   > > 
   > 
   > So maybe it is better to move some column from `/statistic` to 
`/cluster_health`? such as `UnhealthyTabletNum` and `InconsistentTabletNum`? 
Otherwise we need to check them here and there.
   
   move UnhealthyTabletNum | InconsistentTabletNum | CloningTabletNum | 
BadTabletNum |  to /cluster_health is more reasonable. BadTabletNum is 
uncoverableTabletNum


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] morningman merged pull request #8267: [fix](vectorization)Some small fix for SegmentIter Vectorization

2022-03-07 Thread GitBox


morningman merged pull request #8267:
URL: https://github.com/apache/incubator-doris/pull/8267


   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[incubator-doris] branch master updated: [fix](vectorization)Some small fix for SegmentIter Vectorization (#8267)

2022-03-07 Thread morningman
This is an automated email from the ASF dual-hosted git repository.

morningman pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/incubator-doris.git


The following commit(s) were added to refs/heads/master by this push:
 new d711d64  [fix](vectorization)Some small fix for SegmentIter 
Vectorization  (#8267)
d711d64 is described below

commit d711d64dda3b92f3b51154ee585bf0c92b9aed0d
Author: wangbo 
AuthorDate: Tue Mar 8 13:13:17 2022 +0800

[fix](vectorization)Some small fix for SegmentIter Vectorization  (#8267)

1. No longer using short-circuit to evaluate date type, because the cost of 
read date type is small,
lazy materialization has higher costs.
2. Fix read hll/bitmap/date type error results.
---
 be/src/olap/rowset/segment_v2/segment_iterator.cpp |  5 +
 be/src/vec/columns/column_complex.h| 16 
 be/src/vec/columns/column_vector.h |  4 ++--
 3 files changed, 19 insertions(+), 6 deletions(-)

diff --git a/be/src/olap/rowset/segment_v2/segment_iterator.cpp 
b/be/src/olap/rowset/segment_v2/segment_iterator.cpp
index 2883e63..6a8febb 100644
--- a/be/src/olap/rowset/segment_v2/segment_iterator.cpp
+++ b/be/src/olap/rowset/segment_v2/segment_iterator.cpp
@@ -613,10 +613,7 @@ void SegmentIterator::_vec_init_lazy_materialization() {
 _is_pred_column[cid] = true;
 pred_column_ids.insert(cid);
 
-// for date type which can not be executed in a vectorized way, 
using short circuit execution
-if (type == OLAP_FIELD_TYPE_VARCHAR || type == 
OLAP_FIELD_TYPE_CHAR ||
-type == OLAP_FIELD_TYPE_DECIMAL || type == 
OLAP_FIELD_TYPE_DATE ||
-predicate->is_in_predicate()) {
+if (type == OLAP_FIELD_TYPE_VARCHAR || type == 
OLAP_FIELD_TYPE_CHAR || type == OLAP_FIELD_TYPE_STRING || 
predicate->is_in_predicate()) {
 short_cir_pred_col_id_set.insert(cid);
 _short_cir_eval_predicate.push_back(predicate);
 _is_all_column_basic_type = false;
diff --git a/be/src/vec/columns/column_complex.h 
b/be/src/vec/columns/column_complex.h
index cd26c7f..df2b582 100644
--- a/be/src/vec/columns/column_complex.h
+++ b/be/src/vec/columns/column_complex.h
@@ -63,6 +63,7 @@ public:
 }
 
 void insert_many_binary_data(char* data_array, uint32_t* len_array, 
uint32_t* start_offset_array, size_t num) override {
+resize(num);
 if constexpr (std::is_same_v) {
 for (size_t i = 0; i < num; i++) {
 uint32_t len = len_array[i];
@@ -76,6 +77,21 @@ public:
 *pvalue = 
std::move(*reinterpret_cast(data_array + start_offset));   
 }
 }
+} else if constexpr (std::is_same_v) {
+for (size_t i = 0; i < num; i++) {
+uint32_t len = len_array[i];
+uint32_t start_offset = start_offset_array[i];
+HyperLogLog* pvalue = &get_element(size() - 1);
+if (len != 0) {
+HyperLogLog value;
+value.deserialize(Slice(data_array + start_offset, len));
+*pvalue = std::move(value);
+} else {
+*pvalue = 
std::move(*reinterpret_cast(data_array + start_offset));
+}
+}
+} else {
+LOG(FATAL) << "Unexpected type in column complex";
 }
 }
 
diff --git a/be/src/vec/columns/column_vector.h 
b/be/src/vec/columns/column_vector.h
index c6b2311..216e8fd 100644
--- a/be/src/vec/columns/column_vector.h
+++ b/be/src/vec/columns/column_vector.h
@@ -179,7 +179,7 @@ public:
 value |= *(unsigned char*)(cur_ptr);
 vectorized::VecDateTimeValue date;
 date.from_olap_date(value);
-data.push_back_without_reserve(date);
+this->insert_data(reinterpret_cast(&date), 0);
 }
 }
 
@@ -189,7 +189,7 @@ public:
 const char* cur_ptr = data_ptr + value_size * i;
 uint64_t value = *reinterpret_cast(cur_ptr);
 vectorized::VecDateTimeValue date(value);
-data.push_back_without_reserve(date);
+this->insert_data(reinterpret_cast(&date), 0);
 }
 }
 

-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] morningman closed issue #8306: [Vectorized][Feature] support regexp&&sm4&&aes function

2022-03-07 Thread GitBox


morningman closed issue #8306:
URL: https://github.com/apache/incubator-doris/issues/8306


   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[incubator-doris] branch master updated: [feature](vectorize)(function) support regexp&&sm4&&aes functions (#8307)

2022-03-07 Thread morningman
This is an automated email from the ASF dual-hosted git repository.

morningman pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/incubator-doris.git


The following commit(s) were added to refs/heads/master by this push:
 new 454b45b  [feature](vectorize)(function) support regexp&&sm4&&aes 
functions  (#8307)
454b45b is described below

commit 454b45bea3cb5d0c6eb0c2391465a07a19aceed4
Author: zhangstar333 <87313068+zhangstar...@users.noreply.github.com>
AuthorDate: Tue Mar 8 13:14:02 2022 +0800

[feature](vectorize)(function) support regexp&&sm4&&aes functions  (#8307)
---
 be/src/exprs/encryption_functions.cpp  |  21 --
 be/src/exprs/encryption_functions.h|  21 +-
 be/src/exprs/string_functions.cpp  |   4 +-
 be/src/exprs/string_functions.h|   3 +
 be/src/vec/CMakeLists.txt  |   3 +
 .../aggregate_function_percentile_approx.cpp   |   1 -
 be/src/vec/functions/function_encryption.cpp   | 245 +
 be/src/vec/functions/function_hex.cpp  | 183 +++
 be/src/vec/functions/function_regexp.cpp   | 216 ++
 be/src/vec/functions/function_string.cpp   | 132 +--
 be/src/vec/functions/function_string.h |  21 +-
 be/src/vec/functions/math.cpp  |  51 -
 be/src/vec/functions/simple_function_factory.h |   6 +
 be/test/vec/function/function_like_test.cpp|  59 +
 be/test/vec/function/function_string_test.cpp  | 241 +++-
 gensrc/script/doris_builtins_functions.py  |  74 +++
 16 files changed, 1034 insertions(+), 247 deletions(-)

diff --git a/be/src/exprs/encryption_functions.cpp 
b/be/src/exprs/encryption_functions.cpp
index 19ec1a7..a4d93b7 100644
--- a/be/src/exprs/encryption_functions.cpp
+++ b/be/src/exprs/encryption_functions.cpp
@@ -22,34 +22,13 @@
 #include "runtime/string_value.h"
 #include "runtime/tuple_row.h"
 #include "util/debug_util.h"
-#include "util/encryption_util.h"
 #include "util/md5.h"
 #include "util/sm3.h"
-#include "util/string_util.h"
 #include "util/url_coding.h"
 
 namespace doris {
 void EncryptionFunctions::init() {}
 
-StringCaseUnorderedMap aes_mode_map {
-{"AES_128_ECB", AES_128_ECB},   {"AES_192_ECB", AES_192_ECB},
-{"AES_256_ECB", AES_256_ECB},   {"AES_128_CBC", AES_128_CBC},
-{"AES_192_CBC", AES_192_CBC},   {"AES_256_CBC", AES_256_CBC},
-{"AES_128_CFB", AES_128_CFB},   {"AES_192_CFB", AES_192_CFB},
-{"AES_256_CFB", AES_256_CFB},   {"AES_128_CFB1", AES_128_CFB1},
-{"AES_192_CFB1", AES_192_CFB1}, {"AES_256_CFB1", AES_256_CFB1},
-{"AES_128_CFB8", AES_128_CFB8}, {"AES_192_CFB8", AES_192_CFB8},
-{"AES_256_CFB8", AES_256_CFB8}, {"AES_128_CFB128", AES_128_CFB128},
-{"AES_192_CFB128", AES_192_CFB128}, {"AES_256_CFB128", AES_256_CFB128},
-{"AES_128_CTR", AES_128_CTR},   {"AES_192_CTR", AES_192_CTR},
-{"AES_256_CTR", AES_256_CTR},   {"AES_128_OFB", AES_128_OFB},
-{"AES_192_OFB", AES_192_OFB},   {"AES_256_OFB", AES_256_OFB}};
-StringCaseUnorderedMap sm4_mode_map {{"SM4_128_ECB", 
SM4_128_ECB},
- {"SM4_128_CBC", 
SM4_128_CBC},
- {"SM4_128_CFB128", 
SM4_128_CFB128},
- {"SM4_128_OFB", 
SM4_128_OFB},
- {"SM4_128_CTR", 
SM4_128_CTR}};
-
 StringVal encrypt(FunctionContext* ctx, const StringVal& src, const StringVal& 
key,
   const StringVal& iv, EncryptionMode mode) {
 if (src.len == 0 || src.is_null) {
diff --git a/be/src/exprs/encryption_functions.h 
b/be/src/exprs/encryption_functions.h
index 8a7e45d..0046da8 100644
--- a/be/src/exprs/encryption_functions.h
+++ b/be/src/exprs/encryption_functions.h
@@ -22,13 +22,32 @@
 
 #include "udf/udf.h"
 #include "udf/udf_internal.h"
+#include "util/encryption_util.h"
+#include "util/string_util.h"
 
 namespace doris {
 
 class Expr;
 struct ExprValue;
 class TupleRow;
-
+static StringCaseUnorderedMap aes_mode_map {
+{"AES_128_ECB", AES_128_ECB},   {"AES_192_ECB", AES_192_ECB},
+{"AES_256_ECB", AES_256_ECB},   {"AES_128_CBC", AES_128_CBC},
+{"AES_192_CBC", AES_192_CBC},   {"AES_256_CBC", AES_256_CBC},
+{"AES_128_CFB", AES_128_CFB},   {"AES_192_CFB", AES_192_CFB},
+{"AES_256_CFB", AES_256_CFB},   {"AES_128_CFB1", AES_128_CFB1},
+{"AES_192_CFB1", AES_192_CFB1}, {"AES_256_CFB1", AES_256_CFB1},
+{"AES_128_CFB8", AES_128_CFB8}, {"AES_192_CFB8", AES_192_CFB8},
+{"AES_256_CFB8", AES_256_CFB8}, {"AES_128_CFB128", AES_128_CFB128},
+{"AES_192_CFB128", AES_192_CFB128}, {"AES_256_CFB128", AES_256_CFB128},
+{"AE

[GitHub] [incubator-doris] morningman merged pull request #8307: [Vectorized][Function] support regexp&&sm4&&aes functions

2022-03-07 Thread GitBox


morningman merged pull request #8307:
URL: https://github.com/apache/incubator-doris/pull/8307


   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] morningman commented on a change in pull request #8355: [improvement][fix](insert)(replay) support SHOW LAST INSERT stmt and fix json replay bug

2022-03-07 Thread GitBox


morningman commented on a change in pull request #8355:
URL: https://github.com/apache/incubator-doris/pull/8355#discussion_r821328248



##
File path: 
fe/fe-core/src/main/java/org/apache/doris/analysis/ShowLastInsertStmt.java
##
@@ -0,0 +1,55 @@
+// Licensed to the Apache Software Foundation (ASF) under one
+// or more contributor license agreements.  See the NOTICE file
+// distributed with this work for additional information
+// regarding copyright ownership.  The ASF licenses this file
+// to you under the Apache License, Version 2.0 (the
+// "License"); you may not use this file except in compliance
+// with the License.  You may obtain a copy of the License at
+//
+//   http://www.apache.org/licenses/LICENSE-2.0
+//
+// Unless required by applicable law or agreed to in writing,
+// software distributed under the License is distributed on an
+// "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
+// KIND, either express or implied.  See the License for the
+// specific language governing permissions and limitations
+// under the License.
+
+package org.apache.doris.analysis;
+
+import org.apache.doris.catalog.Column;
+import org.apache.doris.catalog.ScalarType;
+import org.apache.doris.qe.ShowResultSetMetaData;
+
+// SHOW LAST INSERT
+public class ShowLastInsertStmt extends ShowStmt {
+private static final ShowResultSetMetaData META_DATA =
+ShowResultSetMetaData.builder()
+.addColumn(new Column("TransactionId", 
ScalarType.createVarchar(128)))
+.addColumn(new Column("Label", 
ScalarType.createVarchar(128)))
+.addColumn(new Column("Database", 
ScalarType.createVarchar(128)))
+.addColumn(new Column("Table", 
ScalarType.createVarchar(128)))
+.addColumn(new Column("TransactionStatus", 
ScalarType.createVarchar(64)))
+.addColumn(new Column("LoadedRows", 
ScalarType.createVarchar(128)))
+.addColumn(new Column("FilteredRows", 
ScalarType.createVarchar(128)))
+.build();
+
+@Override
+public void analyze(Analyzer analyzer) {

Review comment:
   No need to check priv here. Bacause `show last insert` can only view the 
insert result of current session.
   So if user does not have priv to insert, than there is no result to show.




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[GitHub] [incubator-doris] morningman commented on a change in pull request #8355: [improvement][fix](insert)(replay) support SHOW LAST INSERT stmt and fix json replay bug

2022-03-07 Thread GitBox


morningman commented on a change in pull request #8355:
URL: https://github.com/apache/incubator-doris/pull/8355#discussion_r821328248



##
File path: 
fe/fe-core/src/main/java/org/apache/doris/analysis/ShowLastInsertStmt.java
##
@@ -0,0 +1,55 @@
+// Licensed to the Apache Software Foundation (ASF) under one
+// or more contributor license agreements.  See the NOTICE file
+// distributed with this work for additional information
+// regarding copyright ownership.  The ASF licenses this file
+// to you under the Apache License, Version 2.0 (the
+// "License"); you may not use this file except in compliance
+// with the License.  You may obtain a copy of the License at
+//
+//   http://www.apache.org/licenses/LICENSE-2.0
+//
+// Unless required by applicable law or agreed to in writing,
+// software distributed under the License is distributed on an
+// "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
+// KIND, either express or implied.  See the License for the
+// specific language governing permissions and limitations
+// under the License.
+
+package org.apache.doris.analysis;
+
+import org.apache.doris.catalog.Column;
+import org.apache.doris.catalog.ScalarType;
+import org.apache.doris.qe.ShowResultSetMetaData;
+
+// SHOW LAST INSERT
+public class ShowLastInsertStmt extends ShowStmt {
+private static final ShowResultSetMetaData META_DATA =
+ShowResultSetMetaData.builder()
+.addColumn(new Column("TransactionId", 
ScalarType.createVarchar(128)))
+.addColumn(new Column("Label", 
ScalarType.createVarchar(128)))
+.addColumn(new Column("Database", 
ScalarType.createVarchar(128)))
+.addColumn(new Column("Table", 
ScalarType.createVarchar(128)))
+.addColumn(new Column("TransactionStatus", 
ScalarType.createVarchar(64)))
+.addColumn(new Column("LoadedRows", 
ScalarType.createVarchar(128)))
+.addColumn(new Column("FilteredRows", 
ScalarType.createVarchar(128)))
+.build();
+
+@Override
+public void analyze(Analyzer analyzer) {

Review comment:
   No need to check priv here. Bacause `show last insert` can only view the 
insert result of current session.
   So if user does not have priv to insert, than there is no result to show.
   Add this to the comment




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[incubator-doris] 03/05: [fix](vectorization)Some small fix for SegmentIter Vectorization (#8267)

2022-03-07 Thread morningman
This is an automated email from the ASF dual-hosted git repository.

morningman pushed a commit to branch dev-1.0.0
in repository https://gitbox.apache.org/repos/asf/incubator-doris.git

commit 9b6de9be424279ad5d33bfd74de9c8dfbfcc547e
Author: wangbo 
AuthorDate: Tue Mar 8 13:13:17 2022 +0800

[fix](vectorization)Some small fix for SegmentIter Vectorization  (#8267)

1. No longer using short-circuit to evaluate date type, because the cost of 
read date type is small,
lazy materialization has higher costs.
2. Fix read hll/bitmap/date type error results.
---
 be/src/olap/rowset/segment_v2/segment_iterator.cpp |  5 +
 be/src/vec/columns/column_complex.h| 16 
 be/src/vec/columns/column_vector.h |  4 ++--
 3 files changed, 19 insertions(+), 6 deletions(-)

diff --git a/be/src/olap/rowset/segment_v2/segment_iterator.cpp 
b/be/src/olap/rowset/segment_v2/segment_iterator.cpp
index 2883e63..6a8febb 100644
--- a/be/src/olap/rowset/segment_v2/segment_iterator.cpp
+++ b/be/src/olap/rowset/segment_v2/segment_iterator.cpp
@@ -613,10 +613,7 @@ void SegmentIterator::_vec_init_lazy_materialization() {
 _is_pred_column[cid] = true;
 pred_column_ids.insert(cid);
 
-// for date type which can not be executed in a vectorized way, 
using short circuit execution
-if (type == OLAP_FIELD_TYPE_VARCHAR || type == 
OLAP_FIELD_TYPE_CHAR ||
-type == OLAP_FIELD_TYPE_DECIMAL || type == 
OLAP_FIELD_TYPE_DATE ||
-predicate->is_in_predicate()) {
+if (type == OLAP_FIELD_TYPE_VARCHAR || type == 
OLAP_FIELD_TYPE_CHAR || type == OLAP_FIELD_TYPE_STRING || 
predicate->is_in_predicate()) {
 short_cir_pred_col_id_set.insert(cid);
 _short_cir_eval_predicate.push_back(predicate);
 _is_all_column_basic_type = false;
diff --git a/be/src/vec/columns/column_complex.h 
b/be/src/vec/columns/column_complex.h
index cd26c7f..df2b582 100644
--- a/be/src/vec/columns/column_complex.h
+++ b/be/src/vec/columns/column_complex.h
@@ -63,6 +63,7 @@ public:
 }
 
 void insert_many_binary_data(char* data_array, uint32_t* len_array, 
uint32_t* start_offset_array, size_t num) override {
+resize(num);
 if constexpr (std::is_same_v) {
 for (size_t i = 0; i < num; i++) {
 uint32_t len = len_array[i];
@@ -76,6 +77,21 @@ public:
 *pvalue = 
std::move(*reinterpret_cast(data_array + start_offset));   
 }
 }
+} else if constexpr (std::is_same_v) {
+for (size_t i = 0; i < num; i++) {
+uint32_t len = len_array[i];
+uint32_t start_offset = start_offset_array[i];
+HyperLogLog* pvalue = &get_element(size() - 1);
+if (len != 0) {
+HyperLogLog value;
+value.deserialize(Slice(data_array + start_offset, len));
+*pvalue = std::move(value);
+} else {
+*pvalue = 
std::move(*reinterpret_cast(data_array + start_offset));
+}
+}
+} else {
+LOG(FATAL) << "Unexpected type in column complex";
 }
 }
 
diff --git a/be/src/vec/columns/column_vector.h 
b/be/src/vec/columns/column_vector.h
index c6b2311..216e8fd 100644
--- a/be/src/vec/columns/column_vector.h
+++ b/be/src/vec/columns/column_vector.h
@@ -179,7 +179,7 @@ public:
 value |= *(unsigned char*)(cur_ptr);
 vectorized::VecDateTimeValue date;
 date.from_olap_date(value);
-data.push_back_without_reserve(date);
+this->insert_data(reinterpret_cast(&date), 0);
 }
 }
 
@@ -189,7 +189,7 @@ public:
 const char* cur_ptr = data_ptr + value_size * i;
 uint64_t value = *reinterpret_cast(cur_ptr);
 vectorized::VecDateTimeValue date(value);
-data.push_back_without_reserve(date);
+this->insert_data(reinterpret_cast(&date), 0);
 }
 }
 

-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[incubator-doris] 01/05: Revert "[refactor] remove unused new_in_predicate code (#8263)" (#8372)

2022-03-07 Thread morningman
This is an automated email from the ASF dual-hosted git repository.

morningman pushed a commit to branch dev-1.0.0
in repository https://gitbox.apache.org/repos/asf/incubator-doris.git

commit cadf19ae38bc2e4a88c27263bb703762695ccf4e
Author: Zhengguo Yang <780531...@qq.com>
AuthorDate: Mon Mar 7 15:55:38 2022 +0800

Revert "[refactor] remove unused new_in_predicate code (#8263)" (#8372)

This reverts commit 757e35744d4f6319e936fca84b4be13cf043a578.
---
 be/src/common/daemon.cpp  |   2 +
 be/src/exprs/CMakeLists.txt   |   1 +
 be/src/exprs/new_in_predicate.cpp | 185 
 be/src/exprs/new_in_predicate.h   | 348 ++
 4 files changed, 536 insertions(+)

diff --git a/be/src/common/daemon.cpp b/be/src/common/daemon.cpp
index 04625e3..044feda 100644
--- a/be/src/common/daemon.cpp
+++ b/be/src/common/daemon.cpp
@@ -37,6 +37,7 @@
 #include "exprs/json_functions.h"
 #include "exprs/like_predicate.h"
 #include "exprs/math_functions.h"
+#include "exprs/new_in_predicate.h"
 #include "exprs/operators.h"
 #include "exprs/string_functions.h"
 #include "exprs/table_function/dummy_table_functions.h"
@@ -249,6 +250,7 @@ void Daemon::init(int argc, char** argv, const 
std::vector& paths) {
 StringFunctions::init();
 ArrayFunctions::init();
 CastFunctions::init();
+InPredicate::init();
 MathFunctions::init();
 EncryptionFunctions::init();
 TimestampFunctions::init();
diff --git a/be/src/exprs/CMakeLists.txt b/be/src/exprs/CMakeLists.txt
index f266fd9..c71f23e 100644
--- a/be/src/exprs/CMakeLists.txt
+++ b/be/src/exprs/CMakeLists.txt
@@ -41,6 +41,7 @@ add_library(Exprs
   expr.cpp
   expr_context.cpp
   in_predicate.cpp
+  new_in_predicate.cpp
   bloomfilter_predicate.cpp
   block_bloom_filter_avx_impl.cc
   block_bloom_filter_impl.cc
diff --git a/be/src/exprs/new_in_predicate.cpp 
b/be/src/exprs/new_in_predicate.cpp
new file mode 100644
index 000..b0a6390
--- /dev/null
+++ b/be/src/exprs/new_in_predicate.cpp
@@ -0,0 +1,185 @@
+// Licensed to the Apache Software Foundation (ASF) under one
+// or more contributor license agreements.  See the NOTICE file
+// distributed with this work for additional information
+// regarding copyright ownership.  The ASF licenses this file
+// to you under the Apache License, Version 2.0 (the
+// "License"); you may not use this file except in compliance
+// with the License.  You may obtain a copy of the License at
+//
+//   http://www.apache.org/licenses/LICENSE-2.0
+//
+// Unless required by applicable law or agreed to in writing,
+// software distributed under the License is distributed on an
+// "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
+// KIND, either express or implied.  See the License for the
+// specific language governing permissions and limitations
+// under the License.
+
+#include "exprs/new_in_predicate.h"
+
+#include 
+
+#include "exprs/anyval_util.h"
+#include "runtime/string_value.hpp"
+
+namespace doris {
+
+void InPredicate::init() {}
+
+// Templated getter functions for extracting 'SetType' values from AnyVals
+template 
+SetType get_val(const FunctionContext::TypeDesc* type, const T& x) {
+DCHECK(!x.is_null);
+return x.val;
+}
+
+template <>
+StringValue get_val(const FunctionContext::TypeDesc* type, const StringVal& x) 
{
+DCHECK(!x.is_null);
+return StringValue::from_string_val(x);
+}
+
+template <>
+DateTimeValue get_val(const FunctionContext::TypeDesc* type, const 
DateTimeVal& x) {
+return DateTimeValue::from_datetime_val(x);
+}
+
+template <>
+DecimalV2Value get_val(const FunctionContext::TypeDesc* type, const 
DecimalV2Val& x) {
+return DecimalV2Value::from_decimal_val(x);
+}
+
+template 
+void InPredicate::set_lookup_prepare(FunctionContext* ctx,
+ FunctionContext::FunctionStateScope 
scope) {
+if (scope != FunctionContext::FRAGMENT_LOCAL) {
+return;
+}
+
+SetLookupState* state = new SetLookupState;
+state->type = ctx->get_arg_type(0);
+state->contains_null = false;
+for (int i = 1; i < ctx->get_num_args(); ++i) {
+DCHECK(ctx->is_arg_constant(i));
+T* arg = reinterpret_cast(ctx->get_constant_arg(i));
+if (arg->is_null) {
+state->contains_null = true;
+} else {
+state->val_set.insert(get_val(state->type, *arg));
+}
+}
+ctx->set_function_state(scope, state);
+}
+
+template 
+void InPredicate::set_lookup_close(FunctionContext* ctx,
+   FunctionContext::FunctionStateScope scope) {
+if (scope != FunctionContext::FRAGMENT_LOCAL) {
+return;
+}
+SetLookupState* state =
+
reinterpret_cast*>(ctx->get_function_state(scope));
+delete state;
+}
+
+template 
+BooleanVal InPredicate::templated_in(FunctionContext* ctx, const T& val, int 
num_args,
+ const T* args) {
+if (val.is_null) {
+

[incubator-doris] 05/05: [improvement] Support show tablets stmt (#7970)

2022-03-07 Thread morningman
This is an automated email from the ASF dual-hosted git repository.

morningman pushed a commit to branch dev-1.0.0
in repository https://gitbox.apache.org/repos/asf/incubator-doris.git

commit d2b88edfdab7179f580e587c85007e27f7818cf1
Author: ChPi 
AuthorDate: Sat Mar 5 15:25:57 2022 +0800

[improvement] Support show tablets stmt (#7970)

change `show tablet from tbl` to `show tablets from tbl`
---
 .../Data Manipulation/SHOW TABLET.md   | 13 ++---
 .../Data Manipulation/SHOW TABLETS.md  | 56 ++
 .../Data Manipulation/SHOW TABLET.md   | 33 ++---
 .../{SHOW TABLET.md => SHOW TABLETS.md}| 46 +-
 fe/fe-core/src/main/cup/sql_parser.cup |  4 +-
 fe/fe-core/src/main/jflex/sql_scanner.flex |  1 +
 6 files changed, 89 insertions(+), 64 deletions(-)

diff --git a/docs/en/sql-reference/sql-statements/Data Manipulation/SHOW 
TABLET.md b/docs/en/sql-reference/sql-statements/Data Manipulation/SHOW 
TABLET.md
index 17a559e..21ccbcc 100644
--- a/docs/en/sql-reference/sql-statements/Data Manipulation/SHOW TABLET.md 
+++ b/docs/en/sql-reference/sql-statements/Data Manipulation/SHOW TABLET.md 
@@ -26,17 +26,14 @@ under the License.
 
 # SHOW TABLET
 ## Description
-This statement is used to display tablet-related information (for 
administrators only)
-Grammar:
-SHOW TABLET
-[From [db-uu name]] table U name.; Table U Id]
+This statement is used to display information of the specified tablet (for 
administrators only)
+Grammar:
+SHOW TABLET tablet_id
 
 ## example
-1. Display all tablet information in the specified table below the specified DB
-SHOW TABLET FROM example_db.table_name;
 
-2. Display parent level ID information of tablet with specified tablet ID of 
1
-Performance board 1;
+// Display parent level ID information of tablet with specified tablet 
id of 1
+SHOW TABLET 1;
 
 ## keyword
 SHOW,TABLET
diff --git a/docs/en/sql-reference/sql-statements/Data Manipulation/SHOW 
TABLETS.md b/docs/en/sql-reference/sql-statements/Data Manipulation/SHOW 
TABLETS.md
new file mode 100644
index 000..0ed338d
--- /dev/null
+++ b/docs/en/sql-reference/sql-statements/Data Manipulation/SHOW TABLETS.md
@@ -0,0 +1,56 @@
+---
+{
+"title": "SHOW TABLETS",
+"language": "en"
+}
+---
+
+
+
+# SHOW TABLETS
+## Description
+This statement is used to display tablet-related information (for 
administrators only)
+Grammar:
+SHOW TABLETS
+[FROM [db_name.]table_name] [partiton(partition_name_1, 
partition_name_1)]
+[where [version=1] [and backendid=1] [and 
state="NORMAL|ROLLUP|CLONE|DECOMMISSION"]]
+[order by order_column]
+[limit [offset,]size]
+
+## example
+// Display all tablets information in the specified table below the 
specified DB
+SHOW TABLETS FROM example_db.table_name;
+
+SHOW TABLETS FROM example_db.table_name partition(p1, p2);
+
+// display 10 tablets information in the table
+SHOW TABLETS FROM example_db.table_name limit 10;
+
+SHOW TABLETS FROM example_db.table_name limit 5,10;
+
+// display the tablets that fulfill some conditions
+SHOW TABLETS FROM example_db.table_name where backendid=1 and 
version=1 and state="NORMAL";
+
+SHOW TABLETS FROM example_db.table_name where backendid=1 order by 
version;
+
+SHOW TABLETS FROM example_db.table_name where indexname="t1_rollup";
+
+## keyword
+SHOW,TABLETS,LIMIT
diff --git a/docs/zh-CN/sql-reference/sql-statements/Data Manipulation/SHOW 
TABLET.md b/docs/zh-CN/sql-reference/sql-statements/Data Manipulation/SHOW 
TABLET.md
index fc6fb77..d2de94f 100644
--- a/docs/zh-CN/sql-reference/sql-statements/Data Manipulation/SHOW TABLET.md  
+++ b/docs/zh-CN/sql-reference/sql-statements/Data Manipulation/SHOW TABLET.md  
@@ -26,40 +26,13 @@ under the License.
 
 # SHOW TABLET
 ## description
-该语句用于显示 tablet 相关的信息(仅管理员使用)
+该语句用于显示指定tablet id 信息(仅管理员使用)
 语法:
-SHOW TABLET
-[FROM [db_name.]table_name | tablet_id] [partiton(partition_name_1, 
partition_name_1)]
-[where [version=1] [and backendid=1] [and 
state="NORMAL|ROLLUP|CLONE|DECOMMISSION"]]
-[order by order_column]
-[limit [offset,]size]
+SHOW TABLET tablet_id
 
-现在show tablet命令支持按照按照以下字段进行过滤:partition, index name, version, backendid,
-state,同时支持按照任意字段进行排序,并且提供limit限制返回条数。
 
 ## example
-1. 显示指定 db 的下指定表所有 tablet 信息
-SHOW TABLET FROM example_db.table_name;
-
-// 获取partition p1和p2的tablet信息
-SHOW TABLET FROM example_db.table_name partition(p1, p2);
-
-// 获取10个结果
-SHOW TABLET FROM example_db.table_name limit 10;
-
-// 从偏移5开始获取10个结果
-SHOW TABLET FROM example_db.table_name limit 5,10;
-
-// 按照backendid/version/state字段进行过滤
-SHOW

[incubator-doris] 02/05: [chore] fix build with parallel parameter only (#8352)

2022-03-07 Thread morningman
This is an automated email from the ASF dual-hosted git repository.

morningman pushed a commit to branch dev-1.0.0
in repository https://gitbox.apache.org/repos/asf/incubator-doris.git

commit 26f55dabcd87fff37180cce6ecbdd122f591f90d
Author: dataalive <99398130+dataal...@users.noreply.github.com>
AuthorDate: Mon Mar 7 23:19:19 2022 +0800

[chore] fix build with parallel parameter only (#8352)
---
 build.sh | 13 -
 1 file changed, 12 insertions(+), 1 deletion(-)

diff --git a/build.sh b/build.sh
index 2c883c8..bc44fb8 100755
--- a/build.sh
+++ b/build.sh
@@ -121,6 +121,8 @@ BUILD_UI=
 BUILD_SPARK_DPP=
 CLEAN=
 HELP=0
+PARAMETER_COUNT=$#
+PARAMETER_FLAG=0
 if [ $# == 1 ] ; then
 # default
 BUILD_BE=1
@@ -146,11 +148,20 @@ else
 --clean) CLEAN=1 ; shift ;;
 -h) HELP=1; shift ;;
 --help) HELP=1; shift ;;
--j) PARALLEL=$2; shift 2 ;;
+-j) PARALLEL=$2; PARAMETER_FLAG=1; shift 2 ;;
 --) shift ;  break ;;
 *) echo "Internal error" ; exit 1 ;;
 esac
 done
+#only ./build.sh -j xx then build all 
+if [[ ${PARAMETER_COUNT} -eq 3 ]] && [[ ${PARAMETER_FLAG} -eq 1 ]];then
+BUILD_BE=1
+BUILD_FE=1
+BUILD_BROKER=1
+BUILD_UI=1
+BUILD_SPARK_DPP=1
+CLEAN=0
+fi
 fi
 
 if [[ ${HELP} -eq 1 ]]; then

-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



[incubator-doris] 04/05: [feature](vectorize)(function) support regexp&&sm4&&aes functions (#8307)

2022-03-07 Thread morningman
This is an automated email from the ASF dual-hosted git repository.

morningman pushed a commit to branch dev-1.0.0
in repository https://gitbox.apache.org/repos/asf/incubator-doris.git

commit 209c5a173b1335514af0f1e57f033b2c21803097
Author: zhangstar333 <87313068+zhangstar...@users.noreply.github.com>
AuthorDate: Tue Mar 8 13:14:02 2022 +0800

[feature](vectorize)(function) support regexp&&sm4&&aes functions  (#8307)
---
 be/src/exprs/encryption_functions.cpp  |  21 --
 be/src/exprs/encryption_functions.h|  21 +-
 be/src/exprs/string_functions.cpp  |   4 +-
 be/src/exprs/string_functions.h|   3 +
 be/src/vec/CMakeLists.txt  |   3 +
 .../aggregate_function_percentile_approx.cpp   |   1 -
 be/src/vec/functions/function_encryption.cpp   | 245 +
 be/src/vec/functions/function_hex.cpp  | 183 +++
 be/src/vec/functions/function_regexp.cpp   | 216 ++
 be/src/vec/functions/function_string.cpp   | 132 +--
 be/src/vec/functions/function_string.h |  21 +-
 be/src/vec/functions/math.cpp  |  51 -
 be/src/vec/functions/simple_function_factory.h |   6 +
 be/test/vec/function/function_like_test.cpp|  59 +
 be/test/vec/function/function_string_test.cpp  | 241 +++-
 gensrc/script/doris_builtins_functions.py  |  74 +++
 16 files changed, 1034 insertions(+), 247 deletions(-)

diff --git a/be/src/exprs/encryption_functions.cpp 
b/be/src/exprs/encryption_functions.cpp
index 19ec1a7..a4d93b7 100644
--- a/be/src/exprs/encryption_functions.cpp
+++ b/be/src/exprs/encryption_functions.cpp
@@ -22,34 +22,13 @@
 #include "runtime/string_value.h"
 #include "runtime/tuple_row.h"
 #include "util/debug_util.h"
-#include "util/encryption_util.h"
 #include "util/md5.h"
 #include "util/sm3.h"
-#include "util/string_util.h"
 #include "util/url_coding.h"
 
 namespace doris {
 void EncryptionFunctions::init() {}
 
-StringCaseUnorderedMap aes_mode_map {
-{"AES_128_ECB", AES_128_ECB},   {"AES_192_ECB", AES_192_ECB},
-{"AES_256_ECB", AES_256_ECB},   {"AES_128_CBC", AES_128_CBC},
-{"AES_192_CBC", AES_192_CBC},   {"AES_256_CBC", AES_256_CBC},
-{"AES_128_CFB", AES_128_CFB},   {"AES_192_CFB", AES_192_CFB},
-{"AES_256_CFB", AES_256_CFB},   {"AES_128_CFB1", AES_128_CFB1},
-{"AES_192_CFB1", AES_192_CFB1}, {"AES_256_CFB1", AES_256_CFB1},
-{"AES_128_CFB8", AES_128_CFB8}, {"AES_192_CFB8", AES_192_CFB8},
-{"AES_256_CFB8", AES_256_CFB8}, {"AES_128_CFB128", AES_128_CFB128},
-{"AES_192_CFB128", AES_192_CFB128}, {"AES_256_CFB128", AES_256_CFB128},
-{"AES_128_CTR", AES_128_CTR},   {"AES_192_CTR", AES_192_CTR},
-{"AES_256_CTR", AES_256_CTR},   {"AES_128_OFB", AES_128_OFB},
-{"AES_192_OFB", AES_192_OFB},   {"AES_256_OFB", AES_256_OFB}};
-StringCaseUnorderedMap sm4_mode_map {{"SM4_128_ECB", 
SM4_128_ECB},
- {"SM4_128_CBC", 
SM4_128_CBC},
- {"SM4_128_CFB128", 
SM4_128_CFB128},
- {"SM4_128_OFB", 
SM4_128_OFB},
- {"SM4_128_CTR", 
SM4_128_CTR}};
-
 StringVal encrypt(FunctionContext* ctx, const StringVal& src, const StringVal& 
key,
   const StringVal& iv, EncryptionMode mode) {
 if (src.len == 0 || src.is_null) {
diff --git a/be/src/exprs/encryption_functions.h 
b/be/src/exprs/encryption_functions.h
index 8a7e45d..0046da8 100644
--- a/be/src/exprs/encryption_functions.h
+++ b/be/src/exprs/encryption_functions.h
@@ -22,13 +22,32 @@
 
 #include "udf/udf.h"
 #include "udf/udf_internal.h"
+#include "util/encryption_util.h"
+#include "util/string_util.h"
 
 namespace doris {
 
 class Expr;
 struct ExprValue;
 class TupleRow;
-
+static StringCaseUnorderedMap aes_mode_map {
+{"AES_128_ECB", AES_128_ECB},   {"AES_192_ECB", AES_192_ECB},
+{"AES_256_ECB", AES_256_ECB},   {"AES_128_CBC", AES_128_CBC},
+{"AES_192_CBC", AES_192_CBC},   {"AES_256_CBC", AES_256_CBC},
+{"AES_128_CFB", AES_128_CFB},   {"AES_192_CFB", AES_192_CFB},
+{"AES_256_CFB", AES_256_CFB},   {"AES_128_CFB1", AES_128_CFB1},
+{"AES_192_CFB1", AES_192_CFB1}, {"AES_256_CFB1", AES_256_CFB1},
+{"AES_128_CFB8", AES_128_CFB8}, {"AES_192_CFB8", AES_192_CFB8},
+{"AES_256_CFB8", AES_256_CFB8}, {"AES_128_CFB128", AES_128_CFB128},
+{"AES_192_CFB128", AES_192_CFB128}, {"AES_256_CFB128", AES_256_CFB128},
+{"AES_128_CTR", AES_128_CTR},   {"AES_192_CTR", AES_192_CTR},
+{"AES_256_CTR", AES_256_CTR},   {"AES_128_OFB", AES_128_OFB},
+{"AES_192_OFB", AES_192_OFB},   {"AES_25

[incubator-doris] branch dev-1.0.0 updated (d7f8a5d -> d2b88ed)

2022-03-07 Thread morningman
This is an automated email from the ASF dual-hosted git repository.

morningman pushed a change to branch dev-1.0.0
in repository https://gitbox.apache.org/repos/asf/incubator-doris.git.


from d7f8a5d  [feature](vec) Support update stmt in vec query engine (#8296)
 new cadf19a  Revert "[refactor] remove unused new_in_predicate code 
(#8263)" (#8372)
 new 26f55da  [chore] fix build with parallel parameter only (#8352)
 new 9b6de9b  [fix](vectorization)Some small fix for SegmentIter 
Vectorization  (#8267)
 new 209c5a1  [feature](vectorize)(function) support regexp&&sm4&&aes 
functions  (#8307)
 new d2b88ed  [improvement] Support show tablets stmt (#7970)

The 5 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails.  The revisions
listed as "add" were already present in the repository and have only
been added to this reference.


Summary of changes:
 be/src/common/daemon.cpp   |   2 +
 be/src/exprs/CMakeLists.txt|   1 +
 be/src/exprs/encryption_functions.cpp  |  21 --
 be/src/exprs/encryption_functions.h|  21 +-
 be/src/exprs/new_in_predicate.cpp  | 185 +++
 be/src/exprs/new_in_predicate.h| 348 +
 be/src/exprs/string_functions.cpp  |   4 +-
 be/src/exprs/string_functions.h|   3 +
 be/src/olap/rowset/segment_v2/segment_iterator.cpp |   5 +-
 be/src/vec/CMakeLists.txt  |   3 +
 .../aggregate_function_percentile_approx.cpp   |   1 -
 be/src/vec/columns/column_complex.h|  16 +
 be/src/vec/columns/column_vector.h |   4 +-
 be/src/vec/functions/function_encryption.cpp   | 245 +++
 be/src/vec/functions/function_hex.cpp  | 183 +++
 be/src/vec/functions/function_regexp.cpp   | 216 +
 be/src/vec/functions/function_string.cpp   | 132 +---
 be/src/vec/functions/function_string.h |  21 +-
 be/src/vec/functions/math.cpp  |  51 ---
 be/src/vec/functions/simple_function_factory.h |   6 +
 be/test/vec/function/function_like_test.cpp|  59 
 be/test/vec/function/function_string_test.cpp  | 241 +-
 build.sh   |  13 +-
 .../Data Manipulation/SHOW TABLET.md   |  13 +-
 .../Data Manipulation/SHOW TABLETS.md  |  56 
 .../Data Manipulation/SHOW TABLET.md   |  33 +-
 .../{SHOW TABLET.md => SHOW TABLETS.md}|  46 ++-
 fe/fe-core/src/main/cup/sql_parser.cup |   4 +-
 fe/fe-core/src/main/jflex/sql_scanner.flex |   1 +
 gensrc/script/doris_builtins_functions.py  |  74 ++---
 30 files changed, 1690 insertions(+), 318 deletions(-)
 create mode 100644 be/src/exprs/new_in_predicate.cpp
 create mode 100644 be/src/exprs/new_in_predicate.h
 create mode 100644 be/src/vec/functions/function_encryption.cpp
 create mode 100644 be/src/vec/functions/function_hex.cpp
 create mode 100644 be/src/vec/functions/function_regexp.cpp
 create mode 100644 docs/en/sql-reference/sql-statements/Data Manipulation/SHOW 
TABLETS.md
 copy docs/zh-CN/sql-reference/sql-statements/Data Manipulation/{SHOW TABLET.md 
=> SHOW TABLETS.md} (60%)

-
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org



  1   2   >