[
https://issues.apache.org/jira/browse/ARROW-243?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15402593#comment-15402593
]
Wes McKinney commented on ARROW-243:
------------------------------------
Yes -- part of the reason for using dlopen for libhdfs is that it's typically
part of a Hadoop distribution, so {{libhdfs.so}} will likely not be in
LD_LIBRARY_PATH. Since we already have this code in place
(https://github.com/apache/arrow/blob/master/cpp/src/arrow/io/libhdfs_shim.cc#L500),
making libhdfs3.so also a soft (i.e. not loaded when libarrow_io.so is loaded)
dependency isn't too much extra work.
This code would benefit from some refactoring (for example: there is only one
possible set of function pointers available in the shim layer -- you could put
these in static shim structs) to make switching between the libraries as
seamless as possible.
> Use generic HDFS component instead of libhdfs
> ---------------------------------------------
>
> Key: ARROW-243
> URL: https://issues.apache.org/jira/browse/ARROW-243
> Project: Apache Arrow
> Issue Type: New Feature
> Reporter: Ryan Lewis
>
> I would like to use for example libhdfs3 from pivotal to read apache parquet
> files. This would be a small change to the hdfs layer of apache arrow to
> support this.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)