Neuw84 opened a new issue, #10598:
URL: https://github.com/apache/incubator-gluten/issues/10598

   ### Backend
   
   VL (Velox)
   
   ### Bug description
   
   I am trying to use Gluten with EMR Serverless and I get to this point. Seems 
that some code is trying to use functionality that is not available on EMR 
Serverless. I am trying to use a custom 
[image](https://docs.aws.amazon.com/emr/latest/EMR-Serverless-UserGuide/application-custom-image.html).
 
   
   
   ```
   
   # Set timezone environment variables for Velox
   ENV TZ=America/New_York
   ENV TZDIR=/usr/share/zoneinfo
   
   RUN ln -sf /usr/share/zoneinfo/America/New_York /etc/localtime
   ```
   However, no matter how I try to build it, I get this exception. 
   
   ```
   org.apache.gluten.exception.GlutenException: discover_tz_dir failed to find 
zoneinfo
   
        at 
org.apache.gluten.vectorized.PlanEvaluatorJniWrapper.nativeValidateWithFailureReason(Native
 Method) ~[gluten-velox-bundle-spark3.5_2.12-linux_amd64-1.4.0.jar:1.4.0]
        at 
org.apache.gluten.vectorized.NativePlanEvaluator.doNativeValidateWithFailureReason(NativePlanEvaluator.java:51)
 ~[gluten-velox-bundle-spark3.5_2.12-linux_amd64-1.4.0.jar:1.4.0]
   
   ```
   
   Any help or guidance would be great, at least from what the native code is 
trying to read/open. 
   
   ### Gluten version
   
   Gluten-1.4
   
   ### Spark version
   
   Spark-3.5.x
   
   ### Spark configurations
   
   On the application code: 
   
       .config("spark.gluten.enabled", "true") \
       .config("spark.gluten.sql.columnar.backend.lib", "velox") \
       .config("spark.memory.offHeap.enabled", "true") \
       .config("spark.memory.offHeap.size", "8g") \
       .config("spark.sql.legacy.parquet.nanosAsLong", "true") \
   
   On the Run config.
   
   --conf spark.plugins=org.apache.gluten.GlutenPlugin           --conf 
spark.executor.cores=4 --conf spark.executor.memory=16g --conf 
spark.executor.memoryOverhead=4096         --conf 
spark.dynamicAllocation.enabled=false  --conf spark.executor.instances=2        
  --conf spark.driver.cores=2 --conf spark.driver.memory=4g   --conf 
spark.sql.adaptive.enabled=false --conf 
spark.jars=s3://velox-gluten-emr-data-378683551918-us-east-2/gluten-velox/gluten-velox-bundle-spark3.5_2.12-linux_amd64-1.4.0.jar
 
   
   ### System information
   
   EMR Serverless 7.5.0 Spark 3.5.2
   
   ### Relevant logs
   
   ```bash
   
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to