Csaba Ringhofer has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/22049 )

Change subject: WIP IMPALA-10319: Support arbitrary encodings on Text/Sequence 
files
......................................................................


Patch Set 15:

(4 comments)

http://gerrit.cloudera.org:8080/#/c/22049/15/be/src/runtime/mem-tracker.cc
File be/src/runtime/mem-tracker.cc:

http://gerrit.cloudera.org:8080/#/c/22049/15/be/src/runtime/mem-tracker.cc@587
PS15, Line 587:   if (tracker_ != nullptr) {
              :     tracker_->Release(size_);
              :   }
We should also consider whether the allocation actually happened and was 
successful.

My suggestion is to set size_ to 0 in the constructor and increase it in 
TryConsume based on a parameter if it was successful. This would allow calling 
TryConsume multiple times.


http://gerrit.cloudera.org:8080/#/c/22049/14/fe/src/main/java/org/apache/impala/analysis/AlterTableSetTblProperties.java
File 
fe/src/main/java/org/apache/impala/analysis/AlterTableSetTblProperties.java:

http://gerrit.cloudera.org:8080/#/c/22049/14/fe/src/main/java/org/apache/impala/analysis/AlterTableSetTblProperties.java@297
PS14, Line 297:     if (tbl instanceof FeIcebergTable) {
I am unsure about the detailed verification.
- I am not sure if FeIcebergTable should support any serde property in general
- this strictness also disallows unsetting the property if it incorrectly set 
in another engine
- the same verification should be done for create table too


http://gerrit.cloudera.org:8080/#/c/22049/14/fe/src/test/java/org/apache/impala/analysis/AnalyzeDDLTest.java
File fe/src/test/java/org/apache/impala/analysis/AnalyzeDDLTest.java:

http://gerrit.cloudera.org:8080/#/c/22049/14/fe/src/test/java/org/apache/impala/analysis/AnalyzeDDLTest.java@1068
PS14, Line 1068: "alter table
CREATE TABLE ... SERDEPROPERTIES could be also tested. See 
https://github.com/apache/impala/blob/b121a40d20107bad6c04732ba580f26639acd43a/docker/quickstart_client/load_tpcds_parquet.sql#L78
 for example.

Meanwhile I am not sure about the serde prop verification, see my comment in 
AlterTableSetTblProperties.java


http://gerrit.cloudera.org:8080/#/c/22049/14/tests/query_test/test_charcodec.py
File tests/query_test/test_charcodec.py:

http://gerrit.cloudera.org:8080/#/c/22049/14/tests/query_test/test_charcodec.py@77
PS14, Line 77: lambda v: v.get_value('table_format').file_format == 'text'
> IIRC we have previously agreed to move SequenceFiles decoding to a separate
It is not clear to me how this patch handles Sequence files - 
AlterTableSetTblProperties.java line 319 suggests that it is supported.



--
To view, visit http://gerrit.cloudera.org:8080/22049
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I787cd01caa52a19d6645519a6cedabe0a5253a65
Gerrit-Change-Number: 22049
Gerrit-PatchSet: 15
Gerrit-Owner: Mihaly Szjatinya <msz...@pm.me>
Gerrit-Reviewer: Csaba Ringhofer <csringho...@cloudera.com>
Gerrit-Reviewer: Impala Public Jenkins <impala-public-jenk...@cloudera.com>
Gerrit-Reviewer: Mihaly Szjatinya <msz...@pm.me>
Gerrit-Reviewer: Quanlong Huang <huangquanl...@gmail.com>
Gerrit-Comment-Date: Mon, 31 Mar 2025 12:19:40 +0000
Gerrit-HasComments: Yes

Reply via email to