Csaba Ringhofer has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/22289 )

Change subject: IMPALA-12927: Support specifying format for reading JSON BINARY 
columns
......................................................................


Patch Set 6:

(4 comments)

http://gerrit.cloudera.org:8080/#/c/22289/6/fe/src/main/java/org/apache/impala/planner/HdfsScanNode.java
File fe/src/main/java/org/apache/impala/planner/HdfsScanNode.java:

http://gerrit.cloudera.org:8080/#/c/22289/6/fe/src/main/java/org/apache/impala/planner/HdfsScanNode.java@567
PS6, Line 567:       TJsonBinaryFormat specificFormat = 
partition.getInputFormatDescriptor()
Will this return the table level serde property in a partitioned table where it 
was set at a table level? It would be better if there was a test with a 
partitioned table.

I think that it is actually ok to forbid the case when different partitions 
have different JSON_BINARY_FORMAT to simplify testing. It could be added later 
if someone actuallyn needs it.


http://gerrit.cloudera.org:8080/#/c/22289/6/testdata/datasets/functional/functional_schema_template.sql
File testdata/datasets/functional/functional_schema_template.sql:

http://gerrit.cloudera.org:8080/#/c/22289/6/testdata/datasets/functional/functional_schema_template.sql@4499
PS6, Line 4499: alter table {db_name}{db_suffix}.{table_name} set 
serdeproperties ('json.binary.format'='base64');
It seems cleaner to me to do the ALTER before the INSERT - or that would lead 
to a different result?


http://gerrit.cloudera.org:8080/#/c/22289/6/testdata/datasets/functional/functional_schema_template.sql@4544
PS6, Line 4544: alter table {db_name}{db_suffix}.{table_name} partition(year) 
set serdeproperties ('json.binary.format'='base64');
Same as line 4499?


http://gerrit.cloudera.org:8080/#/c/22289/6/testdata/workloads/functional-query/queries/QueryTest/json-binary-format.test
File 
testdata/workloads/functional-query/queries/QueryTest/json-binary-format.test:

http://gerrit.cloudera.org:8080/#/c/22289/6/testdata/workloads/functional-query/queries/QueryTest/json-binary-format.test@72
PS6, Line 72: # Unsetting the property and setting the query option to 'base64' 
will have the same effect.
Nit: can you break the lines to fit into column width 90?  This is not needed 
---- CATCH strings, but should be done query text/comments



--
To view, visit http://gerrit.cloudera.org:8080/22289
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Idf61fa3afc0f33caa63fbc05393e975733165e82
Gerrit-Change-Number: 22289
Gerrit-PatchSet: 6
Gerrit-Owner: Zihao Ye <eyiz...@163.com>
Gerrit-Reviewer: Csaba Ringhofer <csringho...@cloudera.com>
Gerrit-Reviewer: Impala Public Jenkins <impala-public-jenk...@cloudera.com>
Gerrit-Reviewer: Zihao Ye <eyiz...@163.com>
Gerrit-Comment-Date: Tue, 18 Mar 2025 07:47:26 +0000
Gerrit-HasComments: Yes

Reply via email to