kaka11chen opened a new pull request, #23260:
URL: https://github.com/apache/doris/pull/23260

   
   ## Proposed changes
   
   ### Issue
   ```
   W0814 20:33:44.143478 427556 scanner_scheduler.cpp:377] Scan thread read 
VScanner failed: [INTERNAL_ERROR]InternalError: null
   W0814 20:33:44.505172 427550 jni-util.cpp:239] java.lang.InternalError
           at org.apache.hadoop.io.compress.zlib.ZlibDecompressor.init(Native 
Method)
           at 
org.apache.hadoop.io.compress.zlib.ZlibDecompressor.<init>(ZlibDecompressor.java:114)
           at 
org.apache.hadoop.io.compress.GzipCodec$GzipZlibDecompressor.<init>(GzipCodec.java:229)
           at 
org.apache.hadoop.io.compress.GzipCodec.createDecompressor(GzipCodec.java:188)
           at 
org.apache.hadoop.io.compress.CodecPool.getDecompressor(CodecPool.java:183)
           at 
org.apache.parquet.hadoop.CodecFactory$HeapBytesDecompressor.<init>(CodecFactory.java:99)
           at 
org.apache.parquet.hadoop.CodecFactory.createDecompressor(CodecFactory.java:223)
           at 
org.apache.parquet.hadoop.CodecFactory.getDecompressor(CodecFactory.java:212)
           at 
org.apache.parquet.hadoop.CodecFactory.getDecompressor(CodecFactory.java:43)
           at 
org.apache.parquet.hadoop.ParquetFileReader$Chunk.readAllPages(ParquetFileReader.java:1655)
           at 
org.apache.parquet.hadoop.ParquetFileReader$Chunk.readAllPages(ParquetFileReader.java:1538)
           at 
org.apache.parquet.hadoop.ParquetFileReader.readChunkPages(ParquetFileReader.java:1148)
   ```
   
   - Fix zlib init error by using doris's zlib shared library.  
   This error occurs when `ZlibDecompressor` loads the zlib shared library via 
`dlopen()` and calls the `inflateInit2_()` function. It will return error code 
`-2`. 
   Through testing, it is found that this may be because we have also linked 
the zlib library, resulting in different zlib library versions. And this 
situation only occurs under clang after testing. Anyway, we use a version to 
solve the current problem.
   
   - Fix the problem that the java log does not output in some environment's. 
The root cause is  `$HADOOP_CONF_DIR` has log4j.properties, it will override 
commons jars's log4j.properties.
   
   
   ## Further comments
   
   If this is a relatively large or complex change, kick off the discussion at 
[d...@doris.apache.org](mailto:d...@doris.apache.org) by explaining why you 
chose the solution you did and what alternatives you considered, etc...
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org

Reply via email to