parthchandra commented on code in PR #1756:
URL: https://github.com/apache/datafusion-comet/pull/1756#discussion_r2101451928


##########
native/core/src/parquet/parquet_exec.rs:
##########
@@ -61,12 +63,14 @@ pub(crate) fn init_datasource_exec(
     file_groups: Vec<Vec<PartitionedFile>>,
     projection_vector: Option<Vec<usize>>,
     data_filters: Option<Vec<Arc<dyn PhysicalExpr>>>,
+    default_values: Option<HashMap<usize, ScalarValue>>,
     session_timezone: &str,
 ) -> Result<Arc<DataSourceExec>, ExecutionError> {
     let (table_parquet_options, spark_parquet_options) = 
get_options(session_timezone);
-    let mut parquet_source = 
ParquetSource::new(table_parquet_options).with_schema_adapter_factory(
-        Arc::new(SparkSchemaAdapterFactory::new(spark_parquet_options)),
-    );
+    let mut parquet_source =
+        
ParquetSource::new(table_parquet_options).with_schema_adapter_factory(Arc::new(
+            SparkSchemaAdapterFactory::new(spark_parquet_options, 
default_values),

Review Comment:
   I don't think it makes sense to do that even though it might make the code. 
a little bit simpler. `default_values` are not exactly options. But I'm not 
going to argue if you choose to do it that way.



##########
native/core/src/parquet/mod.rs:
##########
@@ -715,6 +715,7 @@ pub unsafe extern "system" fn 
Java_org_apache_comet_parquet_Native_initRecordBat
             file_groups,
             None,
             data_filters,
+            None,

Review Comment:
   It's handled in the `ConstantColumnReader` which is shared between 
native_comet and native_iceberg_compat.
   Also see `ResolveDefaultColumns.getExistenceDefaultValues`. Not quite sure 
what the difference between ExistenceDefaultValues and simply default values is.



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org
For additional commands, e-mail: github-h...@datafusion.apache.org

Reply via email to