amorynan opened a new pull request, #21699:
URL: https://github.com/apache/doris/pull/21699

   ## Proposed changes
   
   Issue Number: close #xxx
   1. when cal array hash, elem size is not need to seed hash 
   ```
   hash = HashUtil::zlib_crc_hash(reinterpret_cast<const char*>(&elem_size),
                                                      sizeof(elem_size), hash);
   ```
   but we need to be care [[], [1]] vs [[1], []], when array nested array , and 
nested array is empty, we should make hash seed to 
   make difference
   2.  use range for one hash value to avoid virtual function call in loop.
   which double the performance. I make it in ut
   
    column: array[int64]
     50 rows , and single array has 1000000 elements
   
   before : 
   <img width="797" alt="截屏2023-07-10 22 20 45" 
src="https://github.com/apache/doris/assets/18551114/25b70cb0-e407-4fff-b45e-d516527cdf34";>
   
   after : 
   <img width="680" alt="截屏2023-07-10 22 23 08" 
src="https://github.com/apache/doris/assets/18551114/5cb78933-2cbe-4cbc-9a9d-28bf7a76fcf4";>
   
   
   <!--Describe your changes.-->
   
   ## Further comments
   
   If this is a relatively large or complex change, kick off the discussion at 
[d...@doris.apache.org](mailto:d...@doris.apache.org) by explaining why you 
chose the solution you did and what alternatives you considered, etc...
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org

Reply via email to